Data Scientist based in Singapore with an MSc in Data Science from SUTD. I build end-to-end data pipelines, analytical tools and machine learning models with a focus on real-world, production-realistic design.
Previously a Programmer Analyst at Cognizant, working on payment processing systems using Temenos T24 β giving me a strong foundation in financial data and production software discipline.
- Building a portfolio of data science projects using Singapore-specific datasets
- Extending HDB resale analysis with a predictive pricing model
- Exploring RAG-based GenAI applications for document Q&A
Languages: Python Β· SQL Β· R
Libraries: pandas Β· scikit-learn Β· matplotlib Β· NumPy
Tools: Tableau Β· Jupyter Β· Git Β· Streamlit
Domains: Financial data Β· Fraud detection Β· Predictive modelling Β· EDA
RAG-based application for querying financial documents using natural language.
Upload any PDF β MAS circulars, annual reports, loan agreements β and ask questions.
Returns cited answers with exact page numbers. Includes MLOps monitoring dashboard.
Python LangChain FAISS Groq Streamlit Docker
End-to-end analysis of 228,633 HDB resale transactions (2017β2026) using
official Singapore government data. Covers data cleaning, feature engineering,
exploratory analysis, and an interactive Tableau dashboard.
Python pandas Tableau data.gov.sg
Fraud detection pipeline on 6.3M PaySim transactions using Random Forest,
with anomaly scatter plots, trend charts, risk tables, and a live prediction
form built in Streamlit.
Python scikit-learn Streamlit Random Forest
Open to Data Scientist, ML Engineer, Data Analyst, and Junior AI/GenAI roles in Singapore. Available immediately.
