GithubHelp home page GithubHelp logo

tabithaks / capstone-ge-asset-tracking Goto Github PK

View Code? Open in Web Editor NEW
2.0 2.0 0.0 8.4 MB

Columbia University Data Science Master Capstone Project. The goal of this project was to cluster trajectories by shape for later optimization.

Jupyter Notebook 99.82% Python 0.18%
spatiotemporal-data-analysis distance-measures unsupervised-clustering neural-networks autoencoder-neural-network lstm-neural-networks

capstone-ge-asset-tracking's Introduction

Capstone-GE-Asset-Tracking

Trajectories are one way to think about how things move, from medical equipment within hospitals to taxis in cities to planes across the globe. Although each of these things move very differently in scale and pattern, their movements can all be represented as a series of x,y points.

For our masters capstone as Data Science students at Columbia University, my team and I worked with a mentor from GE Research to cluster those series of points, those trajectories, by shape, because before deciding how things should move, to minimize costs of transport, cost of inventory, etc. we need to understand how they do move. This repository contains a selection of our work.

Sitemap

Autoencoders: Neural network autoencoder results

  • LSTM_model_shifted_Tabitha.ipynb: Compares results of training autoencoders to predict the original trajectory versus the trajectory shifted forward by one
  • EvaluationDataset_Tabitha.ipynb: Results of best performing model (as trained on simulated data) on another dataset

DistanceMetrics: Clustering algorithms and distance metrics designed for spaciotemporal data

  • clustereditdist.py: Functions for distance metrics and clustering algorithm (dbscan)
  • EditDistance.py: Functions for edit distance algorithm, designed for spaciotemporal data
  • ClusteringTrajectoriesDemo.ipynb: Demonstration of clustering data with dbscan
  • Clustering Trajectories with Edit Distance.ipynb: Results of clustering simulated datasets with edit distance as distance metrics and dbscan as clustering algorithm

Visualization: Visualization using LineCollection

  • visualizationclusters.py: Script for visualization using python LineCollection library

GE Asset Tracking_Final Report.pdf: Report describing project concept, development process, and results

capstone-ge-asset-tracking's People

Contributors

tabithaks avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.