GitHub - tejas-koliyoor/NYC_project

🚕 NYC Taxi Trip Duration Prediction — Production ML Service

📌 Problem

Taxi trip duration in NYC varies significantly by time, location, and traffic
Inaccurate estimates impact passenger ETAs, fleet utilization, and pricing decisions
Rule-based or manual estimation does not scale to city-level operations

🧠 Approach

Trained a regression model on historical NYC Taxi trip data
Built an API-first inference service using FastAPI
Enforced schema validation to block invalid inputs
Reused the same preprocessing pipeline at training and inference (no train–serve skew)
Deployed as a Dockerized service for reproducibility and consistency

📊 Metric

Target: Trip duration (minutes)
Evaluated using regression metrics (MAE, RMSE) on held-out data
Focused on stable, reliable predictions rather than overfitting for benchmark scores

💼 Business Impact

Enables real-time ETA estimation for passengers
Supports driver allocation and fleet planning
Helps detect abnormal or inefficient trips
Architecture reusable for other prediction services (ETA, demand, churn, fraud)

🚀 Key Takeaway

Transforms real-world NYC taxi data into a reliable, production-ready machine-learning prediction service.

Workflow

⚡ Quickstart

# Create virtual environment 
python -m venv venv
source venv/bin/activate   # Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Start API
uvicorn app.main:app --reload
# Open
http://localhost:8000

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
api		api
data_contracts		data_contracts
docker		docker
experiments		experiments
governance		governance
k8s		k8s
mlruns		mlruns
models		models
monitoring		monitoring
runbooks		runbooks
src		src
tests		tests
.flake8		.flake8
.gitignore		.gitignore
README.md		README.md
SPRINT_DETAILS.docx		SPRINT_DETAILS.docx
model_card.md		model_card.md
nyc_taxi_2025-03_updated_1200_rows.csv		nyc_taxi_2025-03_updated_1200_rows.csv
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚕 NYC Taxi Trip Duration Prediction — Production ML Service

📌 Problem

🧠 Approach

📊 Metric

💼 Business Impact

🚀 Key Takeaway

Workflow

⚡ Quickstart

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚕 NYC Taxi Trip Duration Prediction — Production ML Service

📌 Problem

🧠 Approach

📊 Metric

💼 Business Impact

🚀 Key Takeaway

Workflow

⚡ Quickstart

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages