GithubHelp home page GithubHelp logo

project-2's Introduction

Project 2: NYC TAXI FARE PREDICTION PROJECT

NJIT Data Science Program Course CS634 Data Mining - Summer 2020 Instructor Pantelis Monogioudis, Ph.D Professor of Practice, NJIT & Adjunct NYU Teaching Assistant Michael Lam (PhD student)

Students:
Fernando Rios
Hassan Ouanir
Ian Kavuma

Introduction:

In this project, the task is to predict the taxi fare in New York City given the information about number of passengers, pickup and dropoff locations, and the pickup datetime. The NYC TAXI FARE dataset consists of 55 million training records since 2009 and 9914 test records which represent taxi trips in New York.To achieve this goal predicting NYC Cab fare, we split this project into three parts:

Part I : Data pre-processing and Exploratory Data Analysis (EDA)

  • Data cleaning, data Exploration
  • Data visualization, and
  • Feature engineering, Feature Selection
**Part II**: Predictive Modelling Î del Evaluation del Evaluation

Part III: Submission

###Setup

This notebook is prepared to run easily, just copy the files: kaggle.json, nyc_mask.png and, boxdefined.png in your google drive, after that, open the file NYC TAXI FARE PROJECT/NJIT.ipynb

select option run all in google colab. In its lines this notebook do these steps to have ready the dataset.

# Import Libraries
from google.colab import files
import os
import sys
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from IPython.display import Markdown, display
import datetime as dt

from sklearn.model_selection import train_test_split
from sklearn.metrics import mean_squared_error
from sklearn.linear_model import LinearRegression

# Installing kaggle
!pip install kaggle

# Mounting google drive
drive.mount('/content/drive/')

# Move kaggle.json from Google Drive to folder Kaggle
!mkdir -p ~/.kaggle
!cp /content/drive/My\ Drive/kaggle.json  ~/.kaggle/kaggle.json

# Downloading dataset from Kaggle
!kaggle competitions download -c new-york-city-taxi-fare-prediction

# List all files in the folder where the data was downloaded
print(os.listdir())

After this, you can go through the all parts of the project: Part I: Data pre-processing and Exploratory Data Analysis (EDA) Part II: Predictive Modelling Part III: Submission

All contributions are welcome! In this project, the task is to predict the taxi fare in New York City given the information about number of passengers, pickup and dropoff locations, and the pickup datetime. The NYC TAXI FARE dataset consists of 55 million training records since 2009 and 9914 test records which represent taxi trips in New York.

To achieve this goal predicting NYC Cab fare, we split this project into three parts:

Part I: Data pre-processing and Exploratory Data Analysis (EDA) ta cleaning, Data Exploration ta visualization, and Îture engineering, Feature Selection

Part II: Predictive Modelling Î Taxi Fare Prediction

project-2's People

Contributors

frios2020 avatar ssempax1 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.