I'm a Data Science Master's candidate (expected June 2026) with hands-on experience building end-to-end analytics pipelines at scale β including a 16M+ row machine learning project using U.S. Census data. I specialize in predictive modeling, fairness-aware ML, and translating complex findings into actionable insights for business and policy stakeholders. My background in legal operations and academic administration gives me an edge in data governance, compliance, and communicating with non-technical audiences.
- π Rent Burden Prediction β Fairness & ML analysis on 16M+ ACS PUMS household records (Logistic Regression, Random Forest, Gradient Boosting); equity analysis across race, sex, and geography for HUD policy context
- π Customer Attrition Prediction β XGBoost churn model in R achieving 96% accuracy & AUC 0.99; SHAP values used to surface top business drivers
- π¦ Home Loan Approval Prediction β ML pipeline on 4.25M real HMDA 2023 mortgage applications; XGBoost ROC-AUC 0.9932, 96.3% accuracy across 121 features
- π Marketing Campaign Effectiveness β End-to-end ROI analysis for Nike Inc. using real Google Trends (pytrends API) + SEC EDGAR 10-K filings; ROAS modeling, lag correlation, and 6-panel dashboard in Python
- ποΈ Construction Project Management Dashboard β PostgreSQL analytics + interactive Tableau dashboard with KPI cards, risk scores, and delay trend analysis
- β CafΓ© Sales Data Cleaning & Analysis β SQL pipeline to clean and standardize transaction records; findings visualized in Tableau & Power BI