GithubHelp home page GithubHelp logo

amy-leaf's Projects

adult-data-set-classification-and-classifier-comparison icon adult-data-set-classification-and-classifier-comparison

In this program we apply machine learning principals to predict weather income exceeds $50k per year on the Adult data set. We use the four techniques to achieve better performance which includes choosing appropriate classifier, preprocessing techniques, parallel infrastructure and external libraries. This program will focus on proper use of each classifier by fine tuning the hyperparameter to achieve the best results, the classifiers include SVM, KNN, Random Forest, Gaussian Naïve Bayes. The preprocessing techniques used to eliminate noise and inconsistency of data are standard scaler, label Encoder and quantile transformer. The best accuracy was achieved by random forest, this Classifier outperforms every other classifier as it makes multiple decision tree which prevents overfitting.

amazonproductrecommendation-cf-als-spark icon amazonproductrecommendation-cf-als-spark

We have chosen Amazon product sales data set comprising of sales activity and user ratings for each product. The idea is to create a product suggestion/recommendation system for each user based on his previous purchases and his rating for each one. A Collaborative Filtering model is built to predict the virtual ratings for the product that the user did not purchase. The system predicts the user rating for all the items and we display the products which user may be like, buy and rate higher.

apriori icon apriori

Python Implementation of Apriori Algorithm for finding Frequent sets and Association Rules

big-data-automation icon big-data-automation

This is a Movie Recommender System built on both Hadoop and Spark using MovieLens 10 million ratings dataset. The project's deployment has been automated using Ansible.

boston icon boston

Predicting prices using linear regression on The Boston Housing Dataset

cf-nade icon cf-nade

A implementation of CF-NADE. Yin Zheng, et. al. "A Neural Autoregressive Approach to Collaborative Filtering", accepted by ICML 2016.

movie_recommendation_system icon movie_recommendation_system

Launched a distributed application using Spark and MLlib ALS recommendation engine to analyze a complex dataset of 10 million movie ratings from MovieLens.

movielens icon movielens

使用 Spark MLlib 的 ALS 算法的电影推荐系统

movielensrecommendation icon movielensrecommendation

Built recommender engine for MovieLens dataset using Spark to learn user preferences based on various factors like ratings, users and items

parllel-fp-growth icon parllel-fp-growth

this project contains the implementation of the parllel fp growth using spark

similarity-user-based icon similarity-user-based

Code used in paper Analysis of Similarity Measures for Collaborative Filtering Recommendation, presented in ISCCDA 2017, NIE, Mysuru.

spark-movie-lens icon spark-movie-lens

An on-line movie recommender using Spark, Python, and the MovieLens dataset

ts-ppr icon ts-ppr

Codes for IEEE TKDE 2016 paper: Recommendation for repeat consumption from user implicit feedback

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.