🩺 Diabetes Prediction Model – MLOps Project

An end-to-end MLOps walkthrough: train a model locally, serve it with FastAPI, containerize it, deploy to Kubernetes, then promote training to a Kubeflow Pipeline and close the loop so the serving pods pull each new model from object storage.

The serving model predicts whether a patient is diabetic (Pima Indians dataset) from five health metrics. The ML side is intentionally simple — the point of the project is the MLOps glue around it.

✅ Model training (local + Kubeflow Pipelines)
✅ FastAPI serving
✅ Dockerized image
✅ Kubernetes deployment (kind for local, manifest for real clusters)
✅ Kubeflow Pipeline (tracked runs, artifacts, metrics)
✅ Closed train → serve loop (pipeline publishes model to seaweedfs; API pulls it on rollout)

🏗️ Architecture

Two paths through the same code: a local dev path for fast iteration, and a closed-loop in-cluster path that's the real MLOps story.

flowchart TB
    subgraph LOCAL["💻 Local dev (laptop)"]
      direction LR
      L1["python train.py"] -->|joblib.dump| L2["diabetes_model.pkl"]
      L2 -->|joblib.load| L3["uvicorn main:app"]
      L3 -->|POST /predict| L4["curl response"]
    end

    subgraph CLUSTER["☸️ kind cluster"]
      direction LR
      subgraph KFP["🔬 kubeflow namespace"]
        direction TB
        K1["train-op<br/>(KFP component)"] -->|joblib.dump| K2["KFP Model artifact"]
        K1 -->|boto3.upload| S[("seaweedfs<br/>s3://mlpipeline/<br/>models/diabetes/latest.pkl")]
        K2 --> K3["evaluate-op"]
        K3 --> K4["KFP Metrics<br/>(accuracy, F1, ...)"]
      end
      subgraph DEFAULT["📦 default namespace"]
        direction TB
        D1["diabetes-api pods"] -->|kubectl rollout restart| D1
        D1 -->|boto3.download<br/>on startup| D2["/tmp/diabetes_model.pkl"]
        D2 -->|joblib.load| D3["FastAPI<br/>POST /predict"]
      end
      S -.->|cross-ns<br/>NetworkPolicy| D1
    end

    LOCAL -.->|same code, different<br/>entry points| CLUSTER

🔄 MLOps Lifecycle

How the classic MLOps lifecycle stages map onto concrete pieces of this repo. The grey edges back from Serve are the iteration loops: a new model triggers a rollout, a drift signal (out of scope here) triggers retraining.

flowchart LR
    classDef done fill:#e6f4ea,stroke:#137333,color:#137333
    classDef stub fill:#fef7e0,stroke:#b06000,color:#b06000
    classDef todo fill:#fce8e6,stroke:#a50e0e,color:#a50e0e

    D["📊 Data<br/>Pima Indians CSV<br/><i>hosted dataset URL</i>"]
    T["🧠 Train<br/>train.py · pipeline.py:train_op<br/><i>RandomForestClassifier</i>"]
    E["📈 Evaluate<br/>pipeline.py:evaluate_op<br/><i>accuracy, precision, recall, F1</i>"]
    R["💾 Register<br/>boto3 upload in train_op<br/><i>s3://mlpipeline/models/diabetes/latest.pkl</i>"]
    P["📦 Package<br/>Dockerfile<br/><i>python:3.12-slim · ~250MB</i>"]
    DP["🚀 Deploy<br/>k8s-deploy-kind.yml<br/><i>kubectl apply</i>"]
    AR["🔁 Auto-rollout<br/>pipeline.py:deploy_op<br/><i>kubernetes client patch · restartedAt annotation</i>"]
    S["🌐 Serve<br/>main.py + FastAPI<br/><i>boto3 download on startup · POST /predict</i>"]
    M["👀 Monitor / drift detection<br/><i>not implemented;<br/>no shadow eval, no CI/CD</i>"]

    D --> T --> E --> R --> AR --> S
    P --> DP --> S
    S -.->|drift / new data| T

    class D,T,E,R,P,DP,AR,S done
    class M todo

Status per stage:

#	Stage	Implementation	Status
1	Data	Hosted Pima Indians CSV (`plotly/datasets`)	✅
2	Train	`train.py` locally, or `kubeflow/pipeline.py:train_op` in KFP	✅
3	Evaluate	`kubeflow/pipeline.py:evaluate_op` logs metrics as KFP artifacts	✅
4	Register	`boto3.upload_file` to seaweedfs inside `train_op`	✅
5	Package	`Dockerfile` builds `diabetes-prediction-model:latest`	✅
6	Deploy	`k8s-deploy-kind.yml` (NodePort) + `k8s-deploy.yml` (LB for real clusters)	✅
7	Serve	`main.py` + FastAPI; pulls model from S3 on startup if `MODEL_S3_URI` set	✅
8	Auto-rollout (deploy trigger)	`kubeflow/pipeline.py:deploy_op` patches the Deployment's `restartedAt` annotation via the k8s API at the end of every pipeline run	✅
9	CI/CD	Not implemented	❌
10	Drift detection / shadow eval	Not implemented	❌

📊 Problem statement

Predict whether a person is diabetic based on:

Pregnancies, Glucose, Blood Pressure, BMI, Age

Trained on the Pima Indians Diabetes Dataset (hosted CSV) with a RandomForestClassifier.

📋 Prerequisites

Tool	Why
Python 3.12	`scikit-learn==1.9.0` requires Python ≥3.11
Docker Desktop	Build/run the serving image
`kind` ≥ 0.32	Local Kubernetes cluster
`kubectl`	Talk to the cluster
Kubeflow Pipelines (KFP)	Only needed for the pipeline + closed-loop sections — install in your kind cluster following kfp docs

🚀 Quick Start (local)

git clone https://github.com/yashyaadav/first-mlops-project.git
cd first-mlops-project

python3.12 -m venv .mlops
source .mlops/bin/activate
pip install --upgrade pip
pip install -r requirements.txt

python train.py                   # produces diabetes_model.pkl
uvicorn main:app --reload         # http://localhost:8000/docs

Sample request:

curl -X POST http://localhost:8000/predict \
  -H 'Content-Type: application/json' \
  -d '{"Pregnancies":2,"Glucose":130,"BloodPressure":70,"BMI":28.5,"Age":45}'
# → {"diabetic": true}

🐳 Dockerize

docker build -t diabetes-prediction-model .
docker run -p 8000:8000 diabetes-prediction-model

The image is python:3.12-slim based (~250MB) and falls back to the locally-baked diabetes_model.pkl when no MODEL_S3_URI env var is set.

☸️ Deploy to Kubernetes

Real cluster

Push the image to a registry, then:

kubectl apply -f k8s-deploy.yml

Uses a LoadBalancer service — replace the image reference with your registry path first.

Local kind cluster

k8s-deploy-kind.yml uses the locally-built image and a NodePort service, so no registry or cloud LB is needed.

kind create cluster --name kubeflow
kind load docker-image diabetes-prediction-model:latest --name kubeflow
kubectl apply -f k8s-deploy-kind.yml

Verify the image landed on the node (kind nodes run containerd, so use crictl):

docker exec -it kubeflow-control-plane crictl images | grep diabetes

Port-forward and hit it at http://localhost:8000:

kubectl port-forward svc/diabetes-api-service 8000:80

Tear down when done: kind delete cluster --name kubeflow.

🔬 Kubeflow Pipeline

Run training as a tracked Kubeflow Pipeline instead of python train.py on your laptop. See kubeflow/ for the pipeline source.

1. Port-forward the KFP UI:

kubectl port-forward -n kubeflow svc/ml-pipeline-ui 8080:80

http://localhost:8080.

2. Set up a dedicated venv for the KFP SDK (~40 transitive deps you don't want mixing with .mlops):

python3.12 -m venv .kfp
source .kfp/bin/activate
pip install --upgrade pip
pip install -r kubeflow/requirements.txt

3. Compile the pipeline:

python kubeflow/pipeline.py       # produces diabetes_pipeline.yaml
deactivate                         # reactivate .mlops when you go back to serving

4. In the KFP UI:

Pipelines → Upload pipeline → pick diabetes_pipeline.yaml, name it diabetes-pipeline-kubeflow.
After the first upload, future iterations use + Upload version (not a new pipeline) — give each version a name like v3-closed-loop.
+ Create run, leave the pipeline params at their defaults (they match the in-cluster seaweedfs), submit.

A successful run produces metrics on evaluate-op and uploads the trained model to seaweedfs:

🔁 Closing the Train → Serve Loop

The pipeline has three components in series:

train_op trains a RandomForestClassifier, joblib-dumps it as a KFP Model artifact, and uploads it to s3://mlpipeline/models/diabetes/latest.pkl on seaweedfs.
evaluate_op loads the model + held-out test set and logs accuracy, precision, recall, f1 as KFP Metrics.
deploy_op patches the diabetes-api Deployment's restartedAt annotation via the in-cluster Kubernetes API, triggering a rolling restart. Fresh pods read MODEL_S3_URI and download the new model on startup.

End result: one successful pipeline run = new model on every API pod, no human in the loop.

One-time setup

1. Mirror the seaweedfs credentials into the default namespace. The serving deployment lives in default but the secret lives in kubeflow:

kubectl create secret generic s3-creds \
  --from-literal=access-key=minio \
  --from-literal=secret-key=minio123

(These are the demo creds Kubeflow Pipelines ships with — don't reuse outside a local cluster.)

2. Allow the API pods to reach seaweedfs across namespaces. Kubeflow's default seaweedfs NetworkPolicy blocks ingress from outside kubeflow, which would otherwise hang the API at startup:

kubectl apply -f k8s-allow-api-to-seaweedfs.yml

3. Grant the pipeline permission to roll out the Deployment. deploy_op runs as the pipeline-runner ServiceAccount in kubeflow; it needs patch on the diabetes-api Deployment in default:

kubectl apply -f k8s-pipeline-rbac.yml

4. (Re)deploy the API with the env-var wiring from k8s-deploy-kind.yml:

kubectl apply -f k8s-deploy-kind.yml

Per training-run workflow

# 1. Run the pipeline in the KFP UI → wait until train-op, evaluate-op,
#    AND deploy-op are all green. deploy-op logs:
#      ✅ Triggered rollout of default/diabetes-api at <timestamp>

# 2. Confirm the rollout completed (optional — happens automatically)
kubectl rollout status deployment diabetes-api

# 3. Test
kubectl port-forward svc/diabetes-api-service 8000:80
curl -X POST http://localhost:8000/predict \
  -H 'Content-Type: application/json' \
  -d '{"Pregnancies":2,"Glucose":130,"BloodPressure":70,"BMI":28.5,"Age":45}'

Troubleshooting

Symptom	Likely cause	Fix
`curl: (52) Empty reply from server`; empty pod logs	API hung downloading from seaweedfs — NetworkPolicy not applied	`kubectl apply -f k8s-allow-api-to-seaweedfs.yml` then `kubectl rollout restart deployment diabetes-api`
Pod `CreateContainerConfigError` on `S3_ACCESS_KEY`	`s3-creds` secret missing in `default` ns	Run the `kubectl create secret` from step 1
`deploy-op` fails with `HTTP 403 Forbidden`	Pipeline RBAC missing	`kubectl apply -f k8s-pipeline-rbac.yml`
`deploy-op` fails with `HTTP 404 Not Found`	`diabetes-api` Deployment doesn't exist in `default` ns	Apply `k8s-deploy-kind.yml` before re-running the pipeline
Pipeline `train-op` fails with `No matching distribution for scikit-learn==1.9.0`	Component base image is pre-3.11 Python	Pipeline already uses `python:3.12-slim`; recompile `pipeline.py` if you edited it
`InconsistentVersionWarning` on API startup	sklearn version drift between training and serving	Both are pinned to 1.9.0 — rebuild + reload the serving image

📁 Project layout

.
├── train.py                       Local training: pulls CSV, trains RF, dumps pkl
├── main.py                        FastAPI app; loads model from S3 if MODEL_S3_URI set, else local pkl
├── requirements.txt               Serving deps (fastapi, sklearn, boto3, ...)
├── Dockerfile                     python:3.12-slim + uvicorn entrypoint
├── k8s-deploy.yml                 Deployment + LoadBalancer Service (real cluster)
├── k8s-deploy-kind.yml            Deployment + NodePort Service + S3 env vars (local kind)
├── k8s-allow-api-to-seaweedfs.yml NetworkPolicy: lets API in `default` reach seaweedfs in `kubeflow`
├── k8s-pipeline-rbac.yml          Role + RoleBinding: lets pipeline-runner patch diabetes-api
├── kubeflow/
│   ├── pipeline.py                KFP v2 pipeline: train_op → evaluate_op → deploy_op (auto-rollout)
│   ├── requirements.txt           kfp SDK
│   └── README.md                  Pipeline-specific walkthrough
└── docs/
    └── kfp-run-success.png        Screenshot of a successful KFP run

🙌 Credits

This project started from the "Build Your First MLOps Project" tutorial by Abhishek Veeramalla — check out his YouTube channel Abhishek.Veeramalla for great DevOps + MLOps content.

I extended it with the Kubeflow Pipelines integration and the closed train → serve loop as a hands-on learning exercise.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🩺 Diabetes Prediction Model – MLOps Project

🏗️ Architecture

🔄 MLOps Lifecycle

📊 Problem statement

📋 Prerequisites

🚀 Quick Start (local)

🐳 Dockerize

☸️ Deploy to Kubernetes

Real cluster

Local kind cluster

🔬 Kubeflow Pipeline

🔁 Closing the Train → Serve Loop

One-time setup

Per training-run workflow

Troubleshooting

📁 Project layout

🙌 Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
docs		docs
kubeflow		kubeflow
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
k8s-allow-api-to-seaweedfs.yml		k8s-allow-api-to-seaweedfs.yml
k8s-deploy-kind.yml		k8s-deploy-kind.yml
k8s-deploy.yml		k8s-deploy.yml
k8s-pipeline-rbac.yml		k8s-pipeline-rbac.yml
main.py		main.py
requirements.txt		requirements.txt
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

🩺 Diabetes Prediction Model – MLOps Project

🏗️ Architecture

🔄 MLOps Lifecycle

📊 Problem statement

📋 Prerequisites

🚀 Quick Start (local)

🐳 Dockerize

☸️ Deploy to Kubernetes

Real cluster

Local kind cluster

🔬 Kubeflow Pipeline

🔁 Closing the Train → Serve Loop

One-time setup

Per training-run workflow

Troubleshooting

📁 Project layout

🙌 Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages