GithubHelp home page GithubHelp logo

farallonwest's Projects

academic_web_crawler icon academic_web_crawler

A web crawler that finds resources about authors, institutions, journals, conferences, and research papers.

article_categorization icon article_categorization

This program is used to develop a NLP deep learning model that categorizes articles into 5 categories based on its topic.

carrot2 icon carrot2

Carrot2: Text Clustering Algorithms and Applications

constellation icon constellation

A graph-focused data visualisation and interactive analysis application.

contextualized-topic-models icon contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

datapolitics icon datapolitics

Data and documents for the DataPolitics archive about political marketing companies

digital-gardeners icon digital-gardeners

Resources, links, projects, and ideas for gardeners tending their digital notes on the public interwebs

giveme5w1h icon giveme5w1h

Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?

huginn icon huginn

Create agents that monitor and act on your behalf. Your agents are standing by!

maigret icon maigret

🕵️‍♂️ Collect a dossier on a person by username from thousands of sites

metaosint.github.io icon metaosint.github.io

A tool to quickly identify relevant, publicly-available open source intelligence ("OSINT") tools and resources, saving valuable time during investigations, research, and analysis.

obsidian-osint-templates icon obsidian-osint-templates

These templates are suggestions of how the Obsidian notetaking tool can be used during an OSINT investigation. The example data in those files should allow you to make some connections (see what I did there?) between how you record your data during an investigation and some of what the tool can offer FOR FREE!

openrefine icon openrefine

OpenRefine is a free, open source power tool for working with messy data and improving it

politics icon politics

Code, data, and models for "POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection"

sirix icon sirix

Sirix facilitates effective and efficient storing and querying of your temporal data through snapshotting (only ever appends changed database pages) and a novel versioning approach called sliding snapshot, which versions at the node level. Currently we support the storage and querying of XML- and JSON-documents in our binary encoding.

sirix-doc icon sirix-doc

Repository which includes all kinds of documentation as well as technical reports, conference papers... for Sirix.

skraper icon skraper

Kotlin/Java library and cli tool for scraping posts and media from various sources with neither authorization nor full page rendering (Facebook, Instagram, Twitter, Youtube, Tiktok, Telegram, Twitch, Reddit, 9GAG, Pinterest, Flickr, Tumblr, Coub, Vimeo, IFunny, VK, Odnoklassniki, Pikabu)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.