GithubHelp home page GithubHelp logo

hhy5277 / mlops-zoomcamp Goto Github PK

View Code? Open in Web Editor NEW

This project forked from datatalksclub/mlops-zoomcamp

0.0 1.0 0.0 546 KB

Free MLOps course from DataTalks.Club

Python 2.16% Jupyter Notebook 97.26% Dockerfile 0.58%

mlops-zoomcamp's Introduction

MLOps Zoomcamp

Our MLOps Zoomcamp course

Overview

Objective

Teach practical aspects of productionizing ML services โ€” from collecting requirements to model deployment and monitoring.

Target audience

Data scientists and ML engineers. Also software engineers and data engineers interested in learning about putting ML in production

Pre-requisites

  • Python
  • Docker
  • Being comfortable with command line
  • Prior exposure to machine learning (at work or from other courses, e.g. from ML Zoomcamp)
  • Prior programming experience (at least 1+ year)

Timeline

Course start: 16 of May

Syllabus

There are five modules in the course and one project at the end. Each module is 1-2 lessons and homework. One lesson is 60-90 minutes long.

This is a draft and will change.

Module 1: Introduction

  • What is MLOps
  • MLOps maturity model
  • Running example: NY Taxi trips dataset
  • Why do we need MLOps
  • Course overview
  • Environment preparation
  • Homework

Instructors: Alexey Grigorev

Module 2: Experiment tracking

  • Experiment tracking intro
  • Getting started with MLflow
  • Experiment tracking with MLflow
  • Saving and loading models with MLflow
  • Model registry
  • MLflow in practice
  • Homework

Instructors: Cristian Martinez

Module 3: Orchestration and ML Pipelines

  • ML Pipelines: introduction
  • Prefect
  • Turning a notebook into a pipeline
  • Kubeflow Pipelines
  • Homework

Instructors: Theofilos Papapanagiotou

Module 4: Serving

  • Batch vs online
  • For online: web services vs streaming
  • Serving models in Batch mode
  • Web services
  • Streaming (Kinesis/SQS + AWS Lambda)
  • Homework

Instructors: Alexey Grigorev

Module 5: Monitoring

  • ML monitoring VS software monitoring
  • Data quality monitoring
  • Data drift / concept drift
  • Batch VS real-time monitoring
  • Tools: Evidently, Prometheus and Grafana
  • Homework

Instructors: Emeli Dral

Module 6: Best practices

  • Devops
  • Virtual environments and Docker
  • Python: logging, linting
  • Testing: unit, integration, regression
  • CI/CD (github actions)
  • Infrastructure as code (terraform, cloudformation)
  • Cookiecutter
  • Makefiles
  • Homework

Instructors: Alexey Grigorev, Sejal Vaidya

Module 7: Processes

  • CRISP-DM, CRISP-ML
  • ML Canvas
  • Data Landscape canvas
  • MLOps Stack Canvas
  • Documentation practices in ML projects (Model Cards Toolkit)

Instructors: Larysa Visengeriyeva

Project

  • End-to-end project with all the things above

Running example

To make it easier to connect different modules together, weโ€™d like to use the same running example throughout the course.

Possible candidates:

Instructors

  • Larysa Visengeriyeva
  • Cristian Martinez
  • Theofilos Papapanagiotou
  • Alexey Grigorev
  • Emeli Dral
  • Sejal Vaidya

Other courses from DataTalks.Club:

FAQ

I want to start preparing for the course. What can I do?

If you haven't used Flask or Docker

If you have no previous experience with ML

  • Check Module 1 from ML Zoomcamp for an overview
  • Module 3 will also be helpful if you want to learn Scikit-Learn (we'll use it in this course)
  • We'll also use XGBoost. You don't have to know it well, but if you want to learn more about it, refer to module 6 of ML Zoomcamp

I registered but haven't received an invite link. Is it normal?

Yes, we haven't automated it. You'll get a mail from us eventually, don't worry.

If you want to make sure you don't miss anythign:

Is it going to be live?

No and yes. There will be two parts:

  • Lectures: Pre-recorded, you can watch them when it's convenient for you.
  • Office hours: Live on Mondays, but recorded, so you can watch later.

Partners

Thanks to our friends for spreading the word about the course

mlops-zoomcamp's People

Contributors

alexeygrigorev avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.