The sudokumodeltesting from blxyon

ML model

This solution uses a scikit-learn model trained on the UCI Wine Quality dataset and served via FastAPI. This is a multivariate data set and is related to red and white variants of the Portuguese “Vinho Verde” wine. Input variables are physicochemicals such as pH, chlorides, residual sugar etc. Output variable is a quality score between 0 and 10. The services are deployed on a Kubernetes cluster. Metrics are collected using Prometheus and the results are visualised in a Grafana dashboard.

How it works at a high level

Create a containerized REST service to expose the model via a prediction endpoint.
Instrument the server to collect metrics which are exposed via a separate metrics endpoint.
Deploy Prometheus to collect and store metrics.
Deploy Grafana to visualize the collected metrics.
Finally, we’ll simulate production traffic using Locust so that we have some data to see in our dashboard.

Deploying the model with FastAPI

All files contained within the model/ directory.
Train.py – simple script to produce a serialized model artifact.
App/api.py – defines a few routes for our model service including a model prediction endpoint and a health-check endpoint.
App/schemas.py – defines the expected schema for the request and response bodies in the model prediction endpoint.
Dockerfile lists the instructions to package our REST server as a container.
Server is deployed on our Kubernetes cluster using the manifest defined in Kubernetes/models/.

Instrumenting the service with metrics

Goal – capture metrics and expose this data via a /metrics endpoint on our server – this is done using Prometheus-fastapi-instrumentator, which is a library that includes FastAPI middleware that collects metrics for each request and exposes the metric data to a specified endpoint.
After deploying model service on the Kubernetes cluster, port forward to a pod running the server and check out the metrics endpoints running at 127.0.0.1:3000/metrics.

Capturing metrics with Prometheus

Once metrics are exposed at the specified endpoint, Prometheus is used to collect and store this data.
helm is used to deploy Prometheus onto our Kubernetes cluster.
Prometheus will scrape the metric data at the endpoints at a specific interval (15 sec default).
This metric data can be queries by other services which make requests to the HTTP server

Visualising results in Grafana

Grafana hosted on 127.0.0.1:8000 - pulls metric data from Prometheus and visualises it.
Queries are made to the Prometheus data source using query language called PromQL.
Pre-built dashboard is in dashboards/model.json.

Simulating production traffic with Locust

Use Locust, a python load testing framework, to make requests to the model service and simulate production traffic. This behaviour defined in load_tests/locustfile.py , where we define three tasks.
- Request to our health check endpoint.
- Choose a random example from the wine quality dataset and make a request to our prediction service.
- Choose a random example from the wine quality dataset, corrupt the data, and make a bad request to our prediction service.

Setup instructions

Install kubectl, helm, docker, and minikube.
Spin up a Kubernetes cluster on local machine using minikube
- minikube start -–driver=docker -–memory 4g –-nodes 2
Deploy Prometheus and Grafan onto the cluster using helm
- kubectl create namespace monitoring
- helm repo add prometheus-community https://prometheus-community.github.io/helm-charts
- helm install prometheus-stack prometheus-community/kube-prometheus-stack -n monitoring
Connect to the Grafana dashboard
- kubectl port-forward svc/prometheus-stack-grafana 8000:80 -n monitoring
Visit http://127.0.0.1:8000/
Login with credentials – username: admin, password: prom-operator
Import the model dashboard – click ‘+’ sign and select “import” – import the JSON file defined in dashboards/model.json.
Deploy the model
- kubectl apply -f kubernetes/models/
Begin the load test and simulate production traffic
- kubectl apply -f kubernetes/load_tests/

Tear down instructions

Stop the model REST server
- kubectl delete -f kubernetes/models/
- kubectl delete -f kubernetes/load_tests/
Remove the Prometheus stack
- helm uninstall prometheus-stack -n monitoring

blxyon / sudokumodeltesting Goto Github PK

sudokumodeltesting's Introduction

sudokumodeltesting's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs