GithubHelp home page GithubHelp logo

adkwn1 / numerical_methods Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 5.71 MB

Trains, tunes, and evaluates different regression models to develop a time-efficient, high-quality model for predicting car prices based on RMSE and CPU runtime.

Jupyter Notebook 100.00%
catboostregressor gridsearchcv lightgbm-regressor pandas performance-analysis python regression-algorithms rmse sklearn time

numerical_methods's Introduction

Numerical Methods

Author: Andrew Kwon

Description

This project trains and evaluates different prediction models for a regression task. Scope of the project is to balance runtime performance with model quality. Disclaimer: this project runs on CPU only support in its current state to measure CPU runtime performance; users can expect long runtimes if starting the notebook from a fresh kernel.

Introduction

A used car sales service is developing an app to attract new customers where users can quickly find out the market value of their car. In this project, we will need to build the model that predicts the car values. The company is interested in the quality (evaluated on RMSE), speed of the prediction, and time required for model training.

Dataset

We have access to historical data such as technical specifications, trim versions, and prices, which are further detailed in the file car_data.csv. Due to upload size limitations, users will need to extract the 7zip archive into the appropriate directory.

Features

  • DateCrawled: date profile was downloaded from the database
  • VehicleType: vehicle body type
  • RegistrationYear: vehicle registration year
  • Gearbox: gearbox type
  • Power: engine power in horsepower
  • Model: vehicle model
  • Mileage: mileage (in kilometers)
  • RegistrationMonth: vehicle registration month
  • FuelType: fuel type
  • Brand: vehicle brand
  • NotRepaired: vehicle repaired or not
  • DateCreated: date of profile creation
  • NumberOfPictures: number of vehicle pictures
  • PostalCode: postal code of profile user
  • LastSeen: date of the last activity of the user

Target

  • Price: price in euro

Requirements

  • pandas
  • numpy
  • time
  • matplotlib.pyplot
  • sklearn.ensemble
  • sklearn.linear_model
  • sklearn.metrics
  • sklearn.model_selection
  • lightgbm
  • catboost

numerical_methods's People

Contributors

adkwn1 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.