GithubHelp home page GithubHelp logo

drelhaj / nlp_ml_visualization_tutorial Goto Github PK

View Code? Open in Web Editor NEW
19.0 19.0 11.0 3.96 MB

This is a step by step tutorial for text analyst who want an easy start to basic and and common techniques in NLP, Text Analysis, Machine Learning, Topic Modelling and corpus Linguistics. The tutorial is pat of the "Data Visualisation Workshop for Critical Computational Discourse" at the Data Science Institute at Lancaster University, UK. Presented by Dr Mahmoud El-Haj https://www.lancaster.ac.uk/staff/elhaj

Jupyter Notebook 100.00%

nlp_ml_visualization_tutorial's Introduction

NLP_ML_Visualization_Tutorial

This is a step by step tutorial for text analyst who want an easy start to basic and and common techniques in NLP, Text Analysis, Machine Learning, Topic Modelling and Corpus Linguistics. The tutorial is pat of the "Visualise My Corpus" UCREL and DSG Seminar and Tutorial as well as the "Data Visualisation Workshop for Critical Computational Discourse" at the Data Science Institute at Lancaster University, UK.

Arabic Version of this Tutorial:

You can find the Arabic-customised version of this tutorial here: https://github.com/drelhaj/NLP_ML_Visualization_Tutorial/tree/master/Arabic_Tutorial

Author and Presenter:

Dr Mahmoud El-Haj https://www.lancaster.ac.uk/staff/elhaj

Presentation Slides:

If you have attended the 'Visualise My Corpus' talk before here are the introductory slides: https://www.lancaster.ac.uk/staff/elhaj/docs/visualise_my%20_corpus.pdf

Presentation YouTube Video:

A step by step presentation of the tutorials: https://youtu.be/g6tUQxIVesA

Tutorials

The repository is made up of 6 tutorials as follow:

  • 1- Visualaization using SpaCy: a basic introduction to using SpaCy and to visualise part of speech tagging and named entity recognition.
  • 2- Topic Modelling: Using LDA and LDAvis to display an interactive topic model.
  • 3- Word Clouds: an introduction to creating word clouds using basic word frequency and more towards focusing on other part of speech tags.
  • 4- Machine Learning: a basic introduction to SVM and Naive Bayse, this a simple classifier and the results are shown in a confusion matrix.
  • 5- Word Usage: show word usage in terms of frequency over a period of time
  • 6- Word Embeddings: a gentle start to word embeddings using gensim and visualising the vectors using TSNE and PCA.

Installation

You need Jupyter to run the notebooks https://jupyter.org/. Check the 0_Visualisation_Setup.ipynb for the required python packages. (https://github.com/drelhaj/NLP_ML_Visualization_Tutorial/blob/master/0_Visualisation_Setup.ipynb)

nlp_ml_visualization_tutorial's People

Contributors

chiamaka249 avatar dependabot[bot] avatar drelhaj avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.