GithubHelp home page GithubHelp logo

ahjota / examples-for-data-scientists Goto Github PK

View Code? Open in Web Editor NEW

This project forked from datarobot-community/examples-for-data-scientists

0.0 0.0 0.0 41.36 MB

License: Apache License 2.0

Jupyter Notebook 76.91% Python 1.57% HTML 21.41% R 0.11%

examples-for-data-scientists's Introduction

API Samples for Data Scientists

This repository contains Python notebooks and R Markdown guides for achieving specific tasks using the API.

Start learning with the API Training module.

Usage

For each respective guide, follow the instructions in its own .ipynb or .Rmd file.

Please pay attention to the different DataRobot API Endpoints

The API endpoint you specify for accessing DataRobot is dependent on the deployment environment, as follows:

The DataRobot API Endpoint is used to connect your IDE to DataRobot.

Important Links

Contents

Advanced Tuning

  • Advanced Tuning: how to do advanced tuning. Python R

  • Datetime Partitioning: how to do datetime partitioning. Python R

  • AdvancedOptions object: how to use AdvancedOptions object. Python

Compliance Documentation

  • Getting Compliance Documentation: how to get Compliance Documentation documents. Python R

Feature Lists Manipulation

  • Feature Selection using Feature Importance Rank Ensembling: This notebook shows the benefits of advanced feature selection that uses median rank agggregation of feature impacts across several models created during a run of DataRobot autopilot. Python

  • Advanced Feature Selection: how to do advanced feature selection using all of the models created during a run of DataRobot autopilot. Python R

  • Feature Lists Manipulation: how to create and manipulate custom feature lists and use it for training. Python R

  • Transforming Feature Type: how to transform feature types. Python R

Helper Functions

  • Modeling/Python: A function that helps you search for specific blueprints within a DataRobot's project's repository and then initiates all of these models. Python

  • Time Series/Python: a set of custom functions for AutoTS projects (advanced settings, data quality, filling dates, preprocessing, brute force, cloning, accuracy metrics, modeling, project lists). Python

Initiating Projects

  • Starting a Binary Classification Project: how to initiate a DataRobot project for a Binary Classification target. Python R

  • Starting a Multiclass Project: how to initiate a DataRobot project for a Multiclass Classification target. Python R

  • Starting a Project with Selected Blueprints: how to initiate a DataRobot project manually where the user has the option to choose which models/blueprints to initiate. Python R

  • Starting a Regression Project: how to initiate a DataRobot project for a numerical target. Python R

  • Starting a Time Series Project: how to initiate a DataRobot project for a Time Series problem. This notebook also covers calendars and feature settings for time series projects. Python R

Making Predictions

  • Getting Predictions and Prediction Explanations: how to get predictions and prediction explanations out of a trained model. Python R

  • Batch Prediction API: how to use DataRobot's batch prediction API to get predictions out of a DataRobot deployed model. Python

  • Prediction Explanation Clustering: creating clusters of prediction explanations to better understand patterns in your data. R

Model Evaluation

  • Getting Confusion Chart: how to get the Confusion Matrix Chart. Python R

  • Getting Feature Impact: how to get the Feature Impact scores. Python R

  • Getting Lift Chart: how to get the lift chart. Python R

  • Getting Partial Dependence: how to get partial dependence.Python R

  • Getting ROC Curve: how to get the ROC Curve data. Python R

  • Getting SHAP Values: how to get SHAP values. Python

  • Getting Word Cloud: how to pull the word cloud data. Python R

  • Plotting Prediction Intervals: how to plot prediction intervals for time series projects (single and multi series). Python

Model Management

  • Model Management and Monitoring: how to manage models through the API. This includes deployment, replacement, deletion, and monitoring capabilities. Python R

  • Sharing Projects: how to share projects with colleagues. Python

  • Uploading Actuals to a DataRobot Deployment: how to upload actuals into the DataRobot platform in order to calculate accuracy metrics Python

AI Catalog

  • AI Catalog API Demo: how to create and share datasets in AI Catalog and use them to create projects and run predictions. Python

DataRobot Data Prep

  • DataRobot Data Prep: A collection of functions to interact with DataRobot Data Prep. Python

Rating Tables

    • Rating Tables*: A script that allows you to transform rating tables coming from Generalized Additive 2 Models to Rating Tables. Python

Development and Contributing

If you'd like to report an issue or bug, suggest improvements, or contribute code to this project, please refer to CONTRIBUTING.md.

Code of Conduct

This project has adopted the Contributor Covenant for its Code of Conduct. See CODE_OF_CONDUCT.md to read it in full.

License

Licensed under the Apache License 2.0. See LICENSE to read it in full.

examples-for-data-scientists's People

Contributors

theopetropoulos avatar ammathis avatar ismayc avatar dalilab000 avatar vitali93 avatar emilyswebber avatar fhuthmacher avatar globalmin avatar lindahaviland avatar saracoop avatar timsetsfire avatar mattyanselmo avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.