GithubHelp home page GithubHelp logo

mangipudiprashanth7 / twitter_sentiment_analysis-using-ml-and-nlp Goto Github PK

View Code? Open in Web Editor NEW
4.0 2.0 3.0 3.42 MB

The objective of this project is to detect hate speech in tweets. For the sake of simplicity, we say a tweet contains hate speech if it has a racist or sexist sentiment associated with it. So, the task is to classify racist or sexist tweets from other tweets.

License: MIT License

Jupyter Notebook 100.00%
machine-learning natural-language-processing bagging xgboost-algorithm sentiment-analysis

twitter_sentiment_analysis-using-ml-and-nlp's Introduction

Twitter Sentiment Analysis

Python 3.6 NLTK

If you like my work, do leave a star ✨. I'd more than happy if you give any receive suggestions to improve my work!!

In this project various techniques are used. The basic concepts such as data cleaning, data transformation are also explained very clearly. Summary

Word2Vec features turned out to be most useful. Whereas XGBoost with Word2Vec features was the best model for this problem. This clearly shows the power of word embeddings in dealing with NLP problems.

WHAT ELSE CAN BE TRIED? We have covered a lot in this Sentiment Analysis project, but still there is plenty of room for other things to try out. Given below is a list of tasks that you can try with this data.

1.We have built so many models in this, we can definitely try model ensembling. A simple ensemble of all the submission files (maximum voting) yielded an F1 score of 0.55 on the public leaderboard.

2.Use Parts-of-Speech tagging to create new features.

3.Use stemming and/or lemmatization. It might help in getting rid of unnecessary words.

4.Use bi-grams or tri-grams (tokens of 2 or 3 words respectively) for Bag-of-Words and TF-IDF.

5.We can give pretrained word-embeddings models a try.

Credits✨

Analytics Vidhya Sentiment Analysis Project! 🎈

twitter_sentiment_analysis-using-ml-and-nlp's People

Contributors

prashanthm07 avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.