I build end-to-end data pipelines, machine learning models and AI-powered automation systems. Currently working at NI Water as a Data Analyst, where I apply analytics and automation to real-world water utility challenges.
I'm actively seeking new opportunities in Data Engineering, Analytics Engineering or Data Science roles.
Languages
Data Engineering & Analytics
AI & Machine Learning
Automation & Tools
AI-powered water leakage anomaly detection system built for water utilities. Detects pipe bursts and anomalies across 10 DMA zones using Z-score analysis and Groq Llama 3.3 70B for automated plain-English field reports. Deployed as a live Streamlit dashboard with n8n + Gmail automated delivery.
Python Streamlit LangChain dbt SQLite Groq AI n8n
End-to-end ELT analytics engineering pipeline processing 3.5M rows across 112 partitions. Built on a Modern Data Stack with Snowflake, dbt and Power BI. Implements Kimball Star Schema, automated dbt testing and chunked Python ingestion with zero credential leakage.
Snowflake dbt Python Power BI SQL DAX
Forecasting S&P 500 and FTSE All-Share indices across 30 years of data using five time series models — ARIMA, SES, Moving Averages, Simple Forecasts and STL Decomposition. ARIMA achieved RMSE of 82.26 and MAPE of 6.37% on the 1990–2000 horizon.
R ARIMA Time Series ggplot2 Bloomberg Data
Capital Market Approach OLS regression quantifying GBP/USD and GBP/EUR exposure for a FTSE-listed oil & gas company. Model explains 87% of stock return variance. Full econometric diagnostics including Breusch-Pagan, Newey-West robust SEs and Jarque-Bera normality tests.
R OLS Regression Econometrics Bloomberg Terminal Robust SE
Binary classification of product safety outcomes across 50,646 records using Lasso feature selection, Logistic Regression and KNN. Achieved 99.08% accuracy on Subset 2 with AUC of 0.918 on Subset 3. Full 10-fold cross-validation pipeline.
R Lasso Logistic Regression KNN caret glmnet
Classifying online business scores using GAMs and Decision Trees across three field-segmented subsets. Decision Trees outperformed GAMs with up to 94% sensitivity. Lasso used as a two-stage feature selector feeding into both models.
R tidymodels GAM Decision Tree Lasso mgcv
🎓 MSc Financial Analytics — Queen's University Belfast (2023–24)
📜 Bloomberg Market Concepts (BMC) — Bloomberg certification
Open to Data Analyst, Analytics Engineer and Data Science roles — based in Belfast, open to remote.

