Ng Xin Yie yievia

Hi there, I'm Xin Yie 👋

🎓 MSc Data Science, Universiti Malaya
🌏 Based in Malaysia
📊 Data science, machine learning, and analytics for real-world decision making

🚀 About Me

With a background in biotechnology and scientific research, I apply data science methods to analyze complex systems and support data-driven decisions.

My work spans time-series modeling, machine learning, and data analytics, combining structured analysis with domain knowledge. I focus on building reproducible workflows, interpretable models, and practical analytics solutions.

🧠 Core Skills

Languages & Tools: Python, R, SQL
Libraries: Pandas, NumPy, Scikit-learn, Statsmodels, Matplotlib, Seaborn
Data & BI Tools: Power BI, Excel, Jupyter Notebook, MySQL Workbench
Tools & Platforms: Git, VS Code
Techniques: Data Cleaning, Feature Engineering, Machine Learning (Classification), Time Series Forecasting (ARIMA, ETS), Econometric Modeling (ARDL, VAR), SQL Window Functions, Data Visualization
Other: Agile, Google-Certified Project Management, Scientific Documentation

📌 Featured Projects

🔹 Fuel Price Pass-Through & Inflation in Malaysia

Research project analyzing how global oil prices and exchange rates influence Malaysian fuel prices and transport inflation. Built multi-source dataset integrating oil prices, FX rates, fuel prices, and CPI Developed time-series forecasting models (ARIMA, ETS) Estimated pass-through effects using ARDL and VAR models Evaluated models using RMSE, MAE, and MAPE

🔹 Logistics Inventory Data Analysis (SQL + PowerBI)

SQL and Power BI analysis of shipment lead times, delay rates, inventory days, and SKU performance for a retail logistics context.

🔹 Recipe Site Traffic Prediction (Machine Learning + KPI)

Classified high-traffic recipes using Logistic Regression, Decision Tree, and Random Forest. Defined a business KPI — High Traffic Conversion Rate (HTCR) — to align model precision with strategy. → Best Model: Logistic Regression (Precision = 0.88, HTCR = 7.13)

🔹 Telecom Customer Churn Analysis