GithubHelp home page GithubHelp logo

vilmara / notebooks-extended Goto Github PK

View Code? Open in Web Editor NEW

This project forked from rapidsai-community/notebooks-contrib

0.0 1.0 0.0 405.29 MB

RAPIDS Community Notebooks

Dockerfile 0.03% Makefile 0.01% Jupyter Notebook 94.92% Shell 0.01% Python 0.36% HTML 4.68%

notebooks-extended's Introduction

RAPIDS Extended Notebooks

Intro

Welcome to the extended notebooks repo!

The purpose of this collection of notebooks is to help users understand what RAPIDS has to offer, learn how, why, and when including it in data science pipelines makes sense, and help get started using RAPIDS libraries by example.

Many of these notebooks use additional PyData ecosystem packages, and include code for downloading datasets, thus they require network connectivity. If running on a system with no network access, please use the core notebooks repo.

Exploring the Repo

Notebooks live under two subfolders:

  • cpu_comparisons - these notebooks demonstrate "why RAPIDS" by directly comparing compute time between single and multi threaded CPU implementations vs GPU (RAPIDS library) implementations. Of note here is the similarity of RAPIDS APIs to common PyData ecosystem packages like Pandas and scikit-learn. Notebooks in here include:
    • DBScan Demo Full
    • XGBoost
    • Linear Regression
    • Ridge Regression
    • PCA
  • tutorials - contains notebooks showing “how [to master] RAPIDS”:
    • getting_started - to help you quickly learn the basic RAPIDS APIs. It contains these notebooks
      • Dask Hello World
      • Getting Started with cuDF
    • examples - which will show you how to set up and implement your RAPIDS accelerated data science pipelines
      • Dask with cuDF and XGBoost
      • Dask with cuDF and XGBoost from Disk
    • advanced - these notebooks show you the power of the RAPIDS libraries unleashed to solve real world problems.
      • PLASTICC

/data contains small data samples used for purely functional demonstrations. Some notebooks include cells that download larger datasets from external websites.

The /data folder is also symlinked into /rapids/notebooks/extended/data so you can browse it from JupyterLab's UI.

Lastly, a Dockerfile is provided for installing pre-requisite packages & launching JupyterLab.

Installation

Please ensure you meet the pre-requisites

Our container extends the base notebooks container. To build:

git clone https://github.com/rapidsai/notebooks-extended
cd notebooks-extended
docker build -t notebooks-extended .

To run:

docker run -p 8888:8888 -it notebooks-extended

To use previous versions of the notebooks, do a git checkout to the relevant commit.

Contributing

You can contribute to this repo in 5 ways:

  1. Finding bugs in existing notebooks: Sometimes things unexpectedly break in our notebooks. Please raise an issue so we can fix it!
  2. Peer reviewing and benchmarking new and existing notebooks: Both new and existing notebooks need to be checked against current and new RAPIDS library releases. Your help is truly appreciated in making sure that those notebooks not just work, but are efficient, effective and, well, run rapidly
  3. Suggesting notebook content that would interest you: We create notebooks that we think are useful to you, but what do we know? You know what’s useful to you. Please tell us by suggesting a new notebook or upvoting an existing suggestion!
  4. Creating and submitting a new notebook: A good notebooks takes time and effort so it would be great if you would share the load and enhance the community’s knowledge and capabilities
  5. Writing a tutorial blog: Show your expertise in RAPIDS while teaching people how to use RAPIDS in their data science pipeline.

notebooks-extended's People

Contributors

bartleyr avatar beckernick avatar bradreeswork avatar cjnolet avatar genesisfactor avatar krajendrannv avatar mtjrider avatar paulhendricks avatar randerzander avatar raydouglass avatar robocopnixon avatar salonijain27 avatar taureandyernv avatar vibhujawa avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.