π MS in Data Science β Montclair State University Β |Β π United States
I build end-to-end machine learning systems, real-time data pipelines, and business intelligence solutions.
My work spans predictive modeling, streaming data engineering, LLM research, and production ML deployment.
πΌ Actively seeking full-time roles in Data Science Β· Data Engineering Β· Data Analytics Β· AI/ML Engineer
- π Currently researching on low-resource NLP (Telugu BabyLM)
- π± Deepening expertise in MLOps, RAG pipelines, and distributed data systems
- π― I enjoy turning messy data into clear decisions β whether through a model, a dashboard, or a pipeline
|
End-to-end churn prediction system with statistical validation, ML modeling, explainability, and production deployment.
|
Production-grade e-commerce streaming pipeline ingesting, processing, and storing live events at scale.
|
Segmented 9,943 SaaS customers into 4 actionable groups using RFM scoring and K-Means clustering.
|
Large-scale evaluation of bias and fairness drift across LLM families (GPT, Claude, Gemini, LLaMA, Gemma)
- 50K+ model evaluation runs Β· 5+ LLM families benchmarked
- Distributed experimentation pipelines via SLURM
- Longitudinal statistical analysis for fairness benchmarking
Transformer model trained for Telugu under the BabyLM framework
- GPT-2 style model via HuggingFace Transformers on A100 GPUs
- Custom preprocessing pipelines for low-resource corpora
- Investigated challenges unique to morphologically rich, low-resource scripts
Open to full-time roles in Data Science Β· Data Engineering Β· Data Analytics β feel free to reach out!