Energy Price Prediction Project

Overview

This project aims to predict U.S. monthly energy prices using machine learning and time series forecasting techniques. The dataset is sourced from the U.S. Energy Information Administration (EIA). The project follows a structured approach, including data wrangling, exploratory data analysis (EDA), preprocessing, and modeling.

Repository Structure

📂 springboard_dsc_capstone3
│-- 📂 data                    # Raw and processed datasets
│-- 📂 figures                 # Model forecast figures
│-- 📂 notebooks               # Jupyter notebooks with analysis and model development
│-- 📂 scripts                 # Python scripts for converting data and defining functions
|-- Capstone Three Report.pdf  # Project report with final findings
|-- glossary.pdf               # Explanation from EIA of terms used in data
|-- model-metrics.pdf          # Information on final models
│-- README.md                  # Project documentation

Data Sources

Data is collected from the U.S. Energy Information Administration (EIA).
Includes historical monthly energy prices and related economic indicators.

Methodology

The project is divided into four main phases:

1. Data Wrangling

Load and clean the dataset.
Handle missing values and outliers.

2. Exploratory Data Analysis (EDA)

Visualize time series trends.
Perform stationarity tests (ADF, KPSS).
Examine correlations using heatmaps and scatter plots.

3. Data Preprocessing

Scale and normalize features.
Create lag features for machine learning models.

4. Modeling

ARIMA/SARIMAX for traditional time series forecasting.
VAR (Vector Autoregression) for multi-variable forecasting.
Facebook Prophet for automated time series forecasting.
Exponential Smoothing for trend-based forecasting.
Cross-validation using one-step-ahead forecasting.

Evaluation Metrics

Mean Absolute Error (MAE)
Root Mean Squared Error (RMSE)
Akaike Information Criterion (AIC)

Results & Findings

Best performing models were SARIMAX and Exponential Smoothing.

Future Work

Include additional economic indicators as exogenous variables.
Experiment with deep learning models such as LSTMs.
Improve feature engineering for better predictive power.

Contact

For any questions, feel free to reach out or open an issue in this repository.

Author: [Ben Takacs]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Energy Price Prediction Project

Overview

Repository Structure

Data Sources

Methodology

1. Data Wrangling

2. Exploratory Data Analysis (EDA)

3. Data Preprocessing

4. Modeling

Evaluation Metrics

Results & Findings

Future Work

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
__pycache__		__pycache__
data		data
figures		figures
notebooks		notebooks
scripts		scripts
Capstone Three Report.pdf		Capstone Three Report.pdf
README.md		README.md
capstone-three-presentation.key		capstone-three-presentation.key
dw-eda-assignment.pdf		dw-eda-assignment.pdf
glossary.pdf		glossary.pdf
model-metrics.pdf		model-metrics.pdf
prepro-modeling-assignment.pdf		prepro-modeling-assignment.pdf

Folders and files

Latest commit

History

Repository files navigation

Energy Price Prediction Project

Overview

Repository Structure

Data Sources

Methodology

1. Data Wrangling

2. Exploratory Data Analysis (EDA)

3. Data Preprocessing

4. Modeling

Evaluation Metrics

Results & Findings

Future Work

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages