GithubHelp home page GithubHelp logo

aws-samples / sagemaker-end-to-end-workshop Goto Github PK

View Code? Open in Web Editor NEW
44.0 3.0 13.0 65.47 MB

Hands-on end-to-end workshop to explore Amazon SageMaker.

License: MIT No Attribution

Jupyter Notebook 91.42% Python 8.58%
sagemaker machine-learning aws immersion-day workshops amazon-sagemaker amazon-sagemaker-studio

sagemaker-end-to-end-workshop's Introduction

Amazon SageMaker End to End Workshop

This project was designed to provide an end to end experience on Amazon SageMaker.

It has been adapted from an AWS blog post.

Losing customers is costly for any business. Identifying unhappy customers early on gives you a chance to offer them incentives to stay. In this workshop we'll use machine learning (ML) for automated identification of unhappy customers, also known as customer churn prediction.

In this workshop we will use Gradient Boosted Trees (XGBoost) to Predict Mobile Customer Departure.

The Data

Mobile operators have historical records that tell them which customers ended up churning and which continued using the service. We can use this historical information to train an ML model that can predict customer churn. After training the model, we can pass the profile information of an arbitrary customer (the same profile information that we used to train the model) to the model to have the model predict whether this customer will churn.

The dataset we use is publicly available and was mentioned in Discovering Knowledge in Data by Daniel T. Larose. It is attributed by the author to the University of California Irvine Repository of Machine Learning Datasets. The Data sets folder that came with this notebook contains the churn dataset.

The dataset can be downloaded here.

Resources (Workshop Structure)

To put our model in production we will use some features of SageMaker. Workshop is structured as following:

  1. Introduction: Initial setup on Amazon SageMaker Studio environment;
  2. DataPrep: Load churn dataset, tranform it on Amazon SageMaker Data Wrangler, and export it to S3;
  3. Modeling: Create a XGBoost model using Amazon SageMaker Training Jobs and keep track of each training job with Amazon SageMaker Experiments and also debug our model with Amazon SageMaker Debugger;
  4. Evaluation: Check model accuracy with Amazon SageMaker Processing and explainability using Amazon SageMaker Clarify;
  5. Deployment: Host our model on Model hosting and batch inference on Batch Transform;
  6. Monitoring: Monitor our model for concept drift with SageMaker Model Monitor;
  7. Pipelines: Create a Amazon SageMaker Pipelines to run our entire process.

Getting Started

Although we recommend that you follow and run the Labs in order, this workshop was built in a way that you can skip labs or just do those that interest you the most (e.g. you can just run the last Lab, or just run labs 4 an 5, or lab 1 and 4, etc.). Running the labs in order help us understand the natural flow of an ML project and may make more sense.

This is only possible because we leverage the design of SageMaker where each component is independent from each other (e.g. training jobs, hosting, processing) and customers have the freedom to use those that fit better to their use-case.

The 0-Introduction lab is the only Lab that is strictly required to setup some basic things like creating S3 buckets, installing packages, etc.)


Run any module independently

Remember that the 0-Introduction lab is mandatory, no matter which module you will run. Following ones, can be executed independently (just follow the instructions for setup in each lab):

Security

See CONTRIBUTING for more information.

License

This library is licensed under the MIT-0 License. See the LICENSE file.

Questions / Issues?

Please raise an issue on this repo.

sagemaker-end-to-end-workshop's People

Contributors

amazon-auto avatar evandrofranco avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

sagemaker-end-to-end-workshop's Issues

Couse outdate

This workshop was made 3 Years ago. Version of many tools/services are not longer appropriate.
Is there any workshop with the same content recently made?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.