xhuang28 Goto Github PK
Name: xiao
Type: User
Name: xiao
Type: User
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
ASER (activities, states, events, and their relations), a large-scale eventuality knowledge graph extracted from more than 11-billion-token unstructured textual data.
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
Stanford CoreNLP: A Java suite of core NLP tools.
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Code for the paper: GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling
GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package also contains the source for the mkcls tool which generates the word classes necessary for training some of the alignment models.
Python code for training all models in the ICLR paper, "Towards Universal Paraphrastic Sentence Embeddings". These models achieve strong performance on semantic similarity tasks without any training or tuning on the training data for those tasks. They also can produce features that are at least as discriminative as skip-thought vectors for semantic similarity tasks at a minimum. Moreover, this code can achieve state-of-the-art results on entailment and sentiment tasks.
Distinguish Bots from Humans on Twitter
Empower Sequence Labeling with Task-Aware Language Model
MACROSCORE project at ISI - Micro Feature Extraction direction
Massively Multilingual Transfer for NER
The code for "A Unified MRC Framework for Named Entity Recognition"
Code for our AAAI2019 paper "GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition"
NLTK Contrib
A BIO formatted Named Entity Recognition data set extracted from the OntoNotes 5.0 release.
Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations"
English word segmentation, written in pure-Python, and based on a trillion-word corpus.
syntactically controlled paraphrase networks
Code for "Strong Baselines for Neural Semi-supervised Learning under Domain Shift" (Ruder & Plank, 2018 ACL)
Vanilla Sequence Labeling w. Char-LSTM-CRF
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.