GithubHelp home page GithubHelp logo

artanic30 / cs150-database-and-data-mining-project Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 0.0 21 MB

Final project for CS150 database and data minning course in fall 2020

TeX 31.11% Jupyter Notebook 45.25% Python 23.64%

cs150-database-and-data-mining-project's Introduction

CS150 Database and Data Mining Project

Logistics

Please type your Chinese name and ID.

  • Your Name: 邱龙田

  • Your ID: 2018533107

  • Your Name: 施芊靖

  • Your ID: 2018533194

If you are a team, please write names and IDs of both people.

Due date: 23:59,January, 10th, 2020.

You need to finish an entire machine learning system on provided dataset. You do not need to implement a machine learning algorithm from scratch, you are free to call any existing libraries for data science.

Submission

You need to submit three parts.

  • Submit the report to gradescope
    • To form a team, remember to select your teammate when you are submitting at gradescope.
  • Submit the completed test.csv to gradescope
    • To form a team, remember to select your teammate when you are submitting at gradescope.
  • Submit your codes to the github classroom repository.

Report

A report at most 4-page to describle the entire pipeline of your work. You should use the provided the report template, follow the guideline and instructions given in the template and fill into the corresponding part.

Answers of the rest in testset

We'll only offer a subset of correct answers for test data. To submit your results, you should complete the missing values of Correct First Attempt in test.csv, which means replace NaN with the value your model predicts. Then you need to submit your completed test.csv. (Don't submit train.csv.)

Note: For those who don't obey our submission rules, we'll give it 0 point. If you have any question about this, post it on Piazza.

Codes

You also need to upload your codes with an introduction file. We'll do duplicate checking for all the submitted codes, so don't copy other people's codes.

Bonus: We'll offer additional points for those using PySpark to implement the algorithms. To earn the bonus, state clearly in the report about your implementation.

cs150-database-and-data-mining-project's People

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.