GithubHelp home page GithubHelp logo

benwaldner / ecmlpkdd2015-challenge Goto Github PK

View Code? Open in Web Editor NEW

This project forked from achm6174/ecmlpkdd2015-challenge

0.0 0.0 0.0 17 KB

[2015, ECMLPKDD] 4th & 22nd place solution - Taxi trajectory prediction and trip time Prediction

Home Page: https://www.kaggle.com/c/pkdd-15-taxi-trip-time-prediction-ii

License: MIT License

Shell 1.07% Python 55.91% R 43.02%

ecmlpkdd2015-challenge's Introduction

Summary

This is my solution for EMCL/PKDD 2015 Discovery Challenge Taxi Trajectory Prediction and Taxi Trip Time Prediction.

Problem

To improve the efficiency of electronic taxi dispatching systems it is important to be able to predict the final destination of a taxi while it is in service. Particularly during periods of high demand, there is often a taxi whose current ride will end near or exactly at a requested pick up location from a new rider. If a dispatcher knew approximately where their taxi drivers would be ending their current rides, they would be able to identify which taxi to assign to each pickup request.

The spatial trajectory of an occupied taxi could provide some hints as to where it is going. Similarly, given the taxi id, it might be possible to predict its final destination based on the regularity of pre-hired services. In a significant number of taxi rides (approximately 25%), the taxi has been called through the taxi call-center, and the passenger’s telephone id can be used to narrow the destination prediction based on historical ride data connected to their telephone id.

Our solution

The summary of the approach is as follow:

  • Perform trip matching first by extracting the trip with closest starting point on the map based on Haversine distance
  • Match similar trips using Multivariate Dynamic Time Wrapping.
  • Driver features are extracted and trained with Machine Learning model.

The model gives rank 4th for trip time prediction and 22th for trajectory prediction.

Instruction

Download Data

  • download train.csv.zip, test.csv.zip, train.csv, test.csv, and put into folder ./input.

Generate prediction

  • run ./main.sh to generate submission, which take roughly 1 day, result will be at ./train/output .

Sample submission

  • ./sample_prediction

ecmlpkdd2015-challenge's People

Contributors

achm6174 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.