GithubHelp home page GithubHelp logo

hhy5277 / nlp-1 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from makcedward/nlp

0.0 1.0 0.0 2.23 MB

:memo: This repository recorded my NLP journey.

Home Page: https://makcedward.github.io/

Python 61.82% Shell 1.43% sed 1.59% Jupyter Notebook 35.16%

nlp-1's Introduction

NLP - Tutorial

Repository to show how NLP can tacke real problem. Including the source code, dataset, state-of-the art in NLP

Text Augmentation

Section Sub-Section Research Lab Story Paper & Code
Augmentation Data Augmentation in NLP Medium
Augmentation Data Augmentation library for Text Medium
Augmentation Does your NLP model able to prevent adversarial attack? Medium
Augmentation Data Augmentation library for Speech Recognition Medium
Augmentation Data Augmentation library for Audio Medium
Augmentation Unsupervied Data Augmentation Medium

Text Processing

Section Sub-Section Description Link
Tokenization Subword Tokenization Medium
Tokenization Word Tokenization Medium Github
Tokenization Sentence Tokenization Medium Github
Part of Speech Medium Github
Lemmatization Medium Github
Stemming Medium Github
Stop Words Medium Github
Phrase Word Recognition
Spell Checking Lexicon-based Peter Norvig algorithm Medium Github
Lexicon-based Symspell Medium Github
Machine Translation Statistical Machine Translation Medium
Machine Translation Attention Medium
String Matching Fuzzywuzzy Medium Github

Named Entity Recognition (NER)

Section Sub-Section Research Lab Story Paper & Code
Pattern-based Recognition Medium
Lexicon-based Recognition Medium
Pre-trained NER Spacy Medium Github
Custom NER

Optical Character Recognition (OCR)

Section Sub-Section Research Lab Story Paper & Code
Printed Text Google Cloud Vision API Google Medium Paper
Handwriting LSTM Google Medium Paper

Text Summarization

Section Sub-Section Description Link
Extractive Approach Medium Github
Abstractive Approach

Emotion Recognition

Section Sub-Section Description Link
Audio, Text, Visual 3 Multimodals for Emotion Recognition Medium

Voice

Section Sub-Section Description Link
Feature Representation Unsupervised Learning Introduction to Audio Feature Learning Medium Paper 1 Paper 2 Paper 3
Speech-to-text Introduction to Speeh-to-text Medium

Distance Measurement

Section Sub-Section Description Link Paper
Euclidean Distance, Cosine Similarity and Jaccard Similarity Medium Github
Edit Distance Levenshtein Distance Medium Github
Word Moving Distance (WMD) Medium Github
Supervised Word Moving Distance (S-WMD) Medium
Manhattan LSTM Medium Paper

Text Representation

Section Sub-Section Research Lab Story Paper & Code
Traditional Method Bag-of-words (BoW) Medium Github
Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) Medium Github
Character Level Character Embedding New York University Medium Github Paper
Word Level Negative Sampling and Hierarchical Softmax Medium
Word2Vec, GloVe, fastText Medium Github
Contextualized Word Vectors (CoVe) Salesforce Medium Github Paper Code
Embeddings from Language Models (ELMo) AI2 Medium Github Paper Code
Generative Pre-Training (GPT) OpenAI Medium Paper Code
Contextual String Embeddings Zalando Research Medium Paper Code
Self-Governing Neural Networks (SGNN) Google Medium Paper
Multi-Task Deep Neural Networks (MT-DNN) Microsoft Medium Paper
Generative Pre-Training-2 (GPT-2) OpenAI Medium Paper Code
Universal Language Model Fine-tuning (ULMFiT) OpenAI Medium Paper Code
Sentence Level Skip-thoughts Medium Github Paper Code
InferSent Medium Github Paper Code
Quick-Thoughts Google Medium Paper Code
General Purpose Sentence (GenSen) Medium Paper Code
Bidirectional Encoder Representations from Transformers (BERT) Google Medium Paper Code
BERT in Science Domain Medium SciBERT Paper BioBERT Paper
BERT in Clinical Domain Medium Clincical BERT Embeddings Paper ClinicalBert Paper
Unified Language Model for NLP and NLU Medium Paper
Cross-lingual Language Model Medium Paper
Document Level lda2vec Medium Paper
doc2vec Google Medium Github Paper

Model Interpretation

Section Sub-Section Description Link
ELI5, LIME and Skater Medium Github
SHapley Additive exPlanations (SHAP) Medium Github
Anchors Medium Github

Myth

Section Sub-Section Description Link
Using Deep Learning can resolve all problem? Medium Kaggle

Source Code

Section Sub-Section Description Link
Spellcheck Github
InferSent Github

nlp-1's People

Contributors

makcedward avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.