🚀 GPredict — Gaussian Process Regression Framework

GPredict is a Python framework for Gaussian Process (GP) regression, designed for predictive modeling and principled uncertainty quantification.

It implements non-parametric Bayesian regression using customizable mean functions and kernel methods, providing posterior predictions with calibrated confidence intervals and predictive sampling.

Author: Kevin Mota da Costa

Portfolio: https://costakevinn.github.io

LinkedIn: https://linkedin.com/in/costakevinnn

🎯 Project Purpose

GPredict was built to explore regression from a fully probabilistic perspective.

Unlike parametric neural networks, Gaussian Processes:

Model functions as distributions
Provide closed-form posterior inference
Naturally quantify predictive uncertainty
Adapt complexity to the data

This project reflects a statistical-first engineering approach to regression modeling.

🧠 Gaussian Process Formulation

The model assumes:

f(x) ~ GaussianProcess( m(x), k(x, x') )

Where:

m(x) → mean function (constant or linear)
k(x, x') → covariance kernel (RBF or Matern)
Observational noise is modeled via a diagonal noise matrix

Posterior prediction yields:

Predictive mean
Predictive variance
Full covariance structure
Posterior samples

This enables non-parametric regression with uncertainty-aware predictions.

🏗 System Architecture

The framework is modular:

Mean functions (constant, linear)
Kernel functions (RBF, Matern)
Covariance matrix construction
Noise modeling (heteroscedastic support)
Posterior computation
Predictive sampling
Visualization & result export

All components are cleanly separated for extensibility and experimentation.

📊 Example: Sinusoidal Regression

Function: sin(x) Observations: 20 points with heteroscedastic noise

Prior vs Posterior

Prior	Posterior

Prior: expresses assumptions before observing data
Posterior: updates mean and uncertainty using Bayesian inference

The posterior captures:

Global structure
Local smoothness
Increased uncertainty away from data points

🔬 Posterior Inference Mechanics

Given training data, GPredict computes:

Predictive mean via covariance-weighted interpolation
Predictive covariance via posterior update
Predictive trajectories via sampling from multivariate normal

Noise is incorporated explicitly in the covariance matrix, allowing realistic predictive intervals.

📈 Capabilities Demonstrated

Non-parametric regression modeling
Kernel engineering (RBF, Matern)
Bayesian posterior computation
Heteroscedastic noise handling
Predictive sampling
Visualization of uncertainty bands
Reproducible probabilistic workflows

📚 Engineering Decisions

Modular separation of kernels, means, and inference
Explicit covariance construction for transparency
Analytical posterior computation (no black-box frameworks)
Fixed random seed for reproducibility
Structured output generation (data, plots, results)

🛠 Tech Stack

Python

NumPy

Linear algebra (matrix inversion & covariance operations)

Bayesian inference

Kernel methods

Scientific visualization

▶ Usage

python3 main.py

Outputs:

data/ → observational datasets
plots/ → GP prior & posterior visualizations
results/ → numerical predictive summaries

📁 Project Structure

GPredict/
├── data/          # Observations
├── plots/         # Prior and posterior plots
├── results/       # Predictive summaries
├── gp.py          # GP fitting and prediction
├── kernels.py     # Kernel definitions
├── means.py       # Mean functions
├── utils.py       # Plotting utilities
├── examples.py    # Example workflows
└── main.py        # Entry point

🌐 Portfolio

This project is part of my Machine Learning portfolio: 👉 https://costakevinn.github.io

License

MIT License — see LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 GPredict — Gaussian Process Regression Framework

🎯 Project Purpose

🧠 Gaussian Process Formulation

🏗 System Architecture

📊 Example: Sinusoidal Regression

Prior vs Posterior

🔬 Posterior Inference Mechanics

📈 Capabilities Demonstrated

📚 Engineering Decisions

🛠 Tech Stack

▶ Usage

📁 Project Structure

🌐 Portfolio

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
docs		docs
plots		plots
results		results
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
examples.py		examples.py
gp.py		gp.py
kernels.py		kernels.py
main.py		main.py
means.py		means.py
requirements.txt		requirements.txt
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

🚀 GPredict — Gaussian Process Regression Framework

🎯 Project Purpose

🧠 Gaussian Process Formulation

🏗 System Architecture

📊 Example: Sinusoidal Regression

Prior vs Posterior

🔬 Posterior Inference Mechanics

📈 Capabilities Demonstrated

📚 Engineering Decisions

🛠 Tech Stack

▶ Usage

📁 Project Structure

🌐 Portfolio

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages