Capstone project for Udacity Machine Learning Nanodegree
I came across the following dataset during a online coding challenge.
This led me to explore it further and led to the following:
Author's best R2 scores (benchmark):
- Training Data : 0.97
- Testing Data : 0.58
My model's scores :
- Training Data : 1.00
- Testing Data : 0.60
Python version: 3.6
The software requirements can be met by installing the Anaconda distribution of python.
The main libraries used are:
- numpy
- pandas
- matplotlib
- seaborn
- scikit-learn
- scipy
To run the notebook, navigate to the directory and type
jupyter notebook predict_energy_consumption.ipynb
