floraxinru / nlp_hotelreviews Goto Github PK
View Code? Open in Web Editor NEWUsing sentiment analysis to analyze 515K European hotel reviews (NLP, Naive Bayes Classifier, seaborn)
Using sentiment analysis to analyze 515K European hotel reviews (NLP, Naive Bayes Classifier, seaborn)
(when revisiting this project in May 2019) Realized there is no correlation because the column Reviewer_Score contains scores for all the hotels from all reviewers, plus the ones that only left a score without reviewing.
To take care of that discrepancy, would probably need to filter out the scores posted by those who did not leave a review, and then find a way to match score with reviewer (while the purpose of the project is mainly to use the Naive Bayes Classifier for sentiment analysis).
At this point it might be better to go in the other direction of using the newly-developed ULMFiT and training a language model for hotel reviews. Also considering "diminishing returns", it might be better to explore ULMFiT with a larger, more nuanced data set.
A new approach for NLP was developed in 2018, using neural networks and inductive transfer learning for text classification. It would be very interesting to apply it here (there's already 1 related kernel on Kaggle), and train a language model and use it to classify hotel reviews (the result might also be applied to reviews for airbnb or other rentals)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.