GithubHelp home page GithubHelp logo

ptzagk / demo-docker Goto Github PK

View Code? Open in Web Editor NEW

This project forked from gpuopenanalytics/demo-docker

0.0 1.0 0.0 17.75 MB

Demo notebooks inside a docker for end-to-end examples

Dockerfile 0.17% Jupyter Notebook 99.77% Shell 0.06%

demo-docker's Introduction

README

This repository contains demo notebooks for the GPU DataFrame (GDF).

Demos

Getting Started with GDF

Notebook: Getting Started with GDF Uses: pygdf, h2o4gpu

A quick start to loading data into a Pandas data frame and converting it to a GDF, then manipulating it.

GDF End-to-End Example on US Census Dataset

Notebook: MapD to PyGDF to H2OAIGLM Uses: mapd, pymapd, pygdf, h2oaiglm

In this demo, we will train 4000 regularized linear regression models on the U.S. Census dataset, with the goal to predict the income of a person, given approximately 447 data points (such as age, occupation, zip code, etc.)

By using multiple GPUs, we are able to speed up this process significantly, and can train about 40 models per second (on a DGX-1 with 8 GPUs)

PyMapD and PyGDF Demo on NY Taxi Dataset

Notebook: PyMapD and PyGDF Demo on NY Taxi Data Subset Uses: mapD, pymapd, pygdf

This is a simple example that demonstrates the use of PyMapD to create a table, populate it and fetch query result as a GDF. Then, we show some common PyGDF dataframe operations on the GDF; for example, groupby, join, and transform columns with custom Python code that is just-in-time compiled into GPU code.

Human Activity Recognition using GDF and GPU KMeans

Notebook: Human Activity Recognition using GPU DataFrame and GPU KMeans Uses: pygdf, kmeans, h2o4gpu

Analyzing smart phone sensors to determine the activity the person is engaged in.

Our approach uses KMeans from the h2o4gpu package to form the initial clusters. Then, we use nearest neighbour to classify the clusters; i.e. the intra-cluster dominating class determines the class for the cluster. During the classification, we choose the class of the closest cluster center.

PyMapD to H2OGPUML to MapD on FIFA Football Dataset

Notebook: PyMapD to H2OGPUML to MapD on FIFA Football Dataset Uses: mapD, h2o4gpu

In this demo, we will train 4000 regularized linear regression models on the FIFA Football dataset, with the goal to predict the overall rating of the player, given different feature sets (such as potential, finishing, strength, etc.)

Setup

Docker Build

To build the docker image, go into the ./notebook-demo-docker and run:

docker build -t goai/base:latest ./base
docker build -t goai/demo:latest ./demo

Run Docker

nvidia-docker run -p 8888:8888 -ti goai/demo:latest

This launches the mapd, and the notebook automatically.

Login to the notebook with your browser by following the URL printed on the terminal.

Open mapd_to_pygdf_to_h2oaiglm.ipynb and hit "Run All" to test. This notebook should run to the end without error.

Diagnostic

To run on specific GPUs, use NV_GPU.

For example:

NV_GPU=0 nvidia-docker run -p 8888:8888 -ti goai/demo:latest

demo-docker's People

Contributors

sklam avatar vindows avatar michael-balint avatar pseudotensor avatar seibert avatar andrewseidl avatar arnocandel avatar mike-wendt avatar wamsiv avatar kkraus14 avatar tomkraljevic avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.