GithubHelp home page GithubHelp logo

playground's Introduction

Playground

This depository stores some of my experimented projects. Welcome to check it out and give any feedback, so that we can improve the cases even better.

01-Classification-Modeling-on-Hotel-Scoring
Comparing the performance of models among MLP, Xgboost, and Logistic, and choose Logistic Model as the final choice. The final model reaches 0.62 in accuracy and 0.61 in F1 score in three classes scenario. If we look deeper, the share of each class are 35%, 29%, 36%, which compared to the precision from the predicted result for each class 68%, 47%, 65%, we can say that Logistic Model doubles the precision generally.

02-Mass-Transit-Analysis
By the end of this analysis, I am able to come up with a list of suggested stations for ads placement. Adhering to the stations, I also obtain a table of expected commute time for various subway routes.

04-Model-Comparison-Word2vec-Doc2vec-TfIdfWeighted
In this post, I'd like to test out the effect of different kinds of wordembeddings on text classifier. The wordembeddings investigated here include word2vec, TF-IDF weighted word2vec, pre-train GloVe word2vec and doc2vec. It turns out that there is no significant difference among their performance, in the meantime, pre-train Glove and doc2vec alone seem to under-perform a bit compared with others.

05-Try-out-Spacy-Pretrain
This post focuses on trying out spaCy's new BERT-style language pre-training feature. First, I set up a classifier without pre-training and then compare it with one initiated with pre-training. It turns out the classifier without pre-training has higher score on both f1 and accuracy. Nonetheless, take it with a grain of salt. If we look closely, the one with pre-training performs better at identifying rare classes. It still has its own merits if identifying rare classes matter.

06-Recsys-Variants-SVD-Recommender
This post consists of 5 jupyter notebookes, beginning from data preprocessing to model training. In this post, we focus on CF, particularly SVD-based algorithms. We'll practice on building not only SVD based model, but also its variations, such as one with regularization and the other, neural network adopted. The final founding is that recommender built on neural network(NCF) has the best performance in terms of RMSE. Nevertheless, most SVD based models can roughly achieve quality result as long as each item and user of the dataset having enough interaction records, meaning no significant problem of cold-start.

playground's People

Contributors

tomlin avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.