Skip to content
View MadhushreeVaratharaj's full-sized avatar

Block or report MadhushreeVaratharaj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Hi I'm Madhushree πŸ‘‹

Data Scientist based in Singapore with an MSc in Data Science from SUTD. I build end-to-end data pipelines, analytical tools and machine learning models with a focus on real-world, production-realistic design.

Previously a Programmer Analyst at Cognizant, working on payment processing systems using Temenos T24 β€” giving me a strong foundation in financial data and production software discipline.


πŸ” What I'm working on

  • Building a portfolio of data science projects using Singapore-specific datasets
  • Extending HDB resale analysis with a predictive pricing model
  • Exploring RAG-based GenAI applications for document Q&A

πŸ› οΈ Skills

Languages: Python Β· SQL Β· R
Libraries: pandas Β· scikit-learn Β· matplotlib Β· NumPy
Tools: Tableau Β· Jupyter Β· Git Β· Streamlit
Domains: Financial data Β· Fraud detection Β· Predictive modelling Β· EDA


πŸ“‚ Featured Projects

RAG-based application for querying financial documents using natural language. Upload any PDF β€” MAS circulars, annual reports, loan agreements β€” and ask questions. Returns cited answers with exact page numbers. Includes MLOps monitoring dashboard.
Python LangChain FAISS Groq Streamlit Docker

End-to-end analysis of 228,633 HDB resale transactions (2017–2026) using official Singapore government data. Covers data cleaning, feature engineering, exploratory analysis, and an interactive Tableau dashboard.
Python pandas Tableau data.gov.sg

Fraud detection pipeline on 6.3M PaySim transactions using Random Forest, with anomaly scatter plots, trend charts, risk tables, and a live prediction form built in Streamlit.
Python scikit-learn Streamlit Random Forest


πŸ“« Let's connect

LinkedIn


Open to Data Scientist, ML Engineer, Data Analyst, and Junior AI/GenAI roles in Singapore. Available immediately.

Popular repositories Loading

  1. fraud-anomaly-detector fraud-anomaly-detector Public

    Fraud detection pipeline on 6.3M PaySim transactions using Random Forest and Streamlit. Python Β· scikit-learn Β· Streamlit

    Python

  2. hdb-resale-explorer hdb-resale-explorer Public

    End-to-end analysis of 228,633 Singapore HDB resale transactions (2017–2026). Python Β· pandas Β· Tableau Public

    Jupyter Notebook

  3. MadhushreeVaratharaj MadhushreeVaratharaj Public

  4. finsight-rag finsight-rag Public

    RAG-based financial document Q&A assistant with MLOps monitoring. Built for banking and regulatory document analysis.

    Python