Serving App (FastAPI + scikit-learn)

A tiny, production-style ML serving skeleton.
Trains a scikit-learn classifier (Iris demo) and serves predictions via FastAPI.

🚀 FastAPI HTTP API (/predict, /predict_batch)
🩺 Health & version endpoints
🧪 Simple training script + reproducible model artifact
🐳 Dockerfile for containerized deploys
🤖 GitHub Actions CI (smoke test)

Quickstart

1) Environment

Conda (recommended)

conda create -n serve_env python=3.11 -y
conda activate serve_env
pip install -r requirements.txt

Or venv

python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt

2) Train the model

python -m training.train
# expected: models/model.pkl and models/meta.json

3) Run the API

uvicorn serving_app.main:app --host 0.0.0.0 --port 8011
# docs: http://localhost:8011/docs

Endpoints

GET /openapi.json → OpenAPI schema
GET /health → {"ok": true, "model_loaded": true, "version": "0.1.0"}
GET /version → {"version": "0.1.0"}
POST /predict → predict a single row
POST /predict_batch → predict many rows

Requests & Responses

`POST /predict` — single row

Request

{ "features": [5.1, 3.5, 1.4, 0.2], "return_proba": true }

Response

{ "prediction": 0, "proba": [1.0, 0.0, 0.0], "latency_ms": 4.7 }

`POST /predict_batch` — many rows

Request

{ "items": [[5.1,3.5,1.4,0.2],[6.7,3.0,5.2,2.3]], "return_proba": true }

Response

{
  "predictions": [0, 2],
  "proba": [[1.0,0.0,0.0],[0.0,0.0,1.0]],
  "latency_ms": 6.0
}

Curl Examples

# single
curl -s -X POST http://localhost:8011/predict \
  -H 'Content-Type: application/json' \
  -d '{"features":[5.1,3.5,1.4,0.2], "return_proba": true}' | python -m json.tool

# batch
curl -s -X POST http://localhost:8011/predict_batch \
  -H 'Content-Type: application/json' \
  -d '{"items":[[5.1,3.5,1.4,0.2],[6.7,3.0,5.2,2.3]], "return_proba": true}' | python -m json.tool

# health / version
curl -s http://localhost:8011/health  | python -m json.tool
curl -s http://localhost:8011/version

Configuration

MODEL_PATH — override the model location (defaults to the baked-in path).

MODEL_PATH=models/model.pkl uvicorn serving_app.main:app --port 8011

Project layout

serving_app/
├─ serving_app/
│  └─ main.py            # FastAPI app: health/version/predict/predict_batch
├─ training/
│  └─ train.py           # trains scikit-learn model, saves to models/
├─ models/               # model artifacts (created by training)
├─ requirements.txt
├─ Dockerfile
├─ Makefile              # optional shortcuts (train/run/predict)
├─ .github/workflows/ci.yml
└─ README.md

Docker

# build (after you've trained locally so models/ exists)
docker build -t serving-app .

# run (expose container:8000 -> host:8011)
docker run --rm -p 8011:8000 serving-app
# docs: http://localhost:8011/docs

CI

A lightweight GitHub Actions workflow (.github/workflows/ci.yml) installs deps, boots the API, and smoke-tests /health. Extend it with linting, unit tests, or load tests as you grow.

Notes / Next steps

Swap the demo Iris model with your data & pipeline.
Add stricter input validation as features evolve.
Add logging/metrics (e.g., request IDs, Prometheus) for production.
If you need auth/rate limits, add a header check + token bucket.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Serving App (FastAPI + scikit-learn)

Quickstart

1) Environment

Conda (recommended)

Or venv

2) Train the model

3) Run the API

Endpoints

Requests & Responses

`POST /predict` — single row

Request

Response

`POST /predict_batch` — many rows

Request

Response

Curl Examples

Configuration

Project layout

Docker

CI

Notes / Next steps

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name	Name	Last commit message	Last commit date
Latest commit History 11 Commits 11 Commits
.github/workflows	.github/workflows
serving_app	serving_app
training	training
.gitignore	.gitignore
Dockerfile	Dockerfile
Makefile	Makefile
README.md	README.md
main.py	main.py
requirements.txt	requirements.txt
train.py	train.py

Search code, repositories, users, issues, pull requests...

Folders and files

Latest commit

History

Repository files navigation

Serving App (FastAPI + scikit-learn)

Quickstart

1) Environment

Conda (recommended)

Or venv

2) Train the model

3) Run the API

Endpoints

Requests & Responses

POST /predict — single row

Request

Response

POST /predict_batch — many rows

Request

Response

Curl Examples

Configuration

Project layout

Docker

CI

Notes / Next steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /predict` — single row

`POST /predict_batch` — many rows

Packages