Surrogate Modeling

Surrogate modeling is an umbrella term for approximations of building energy models using machine learning (ML) or algorithmic approaches, as compared to (slower) simulations. This repository contains utilities and models to replicate the ResStock dataset using surrogate modeling techniques.

This code was developed in and depends on Databricks.

Documentation

The steps run the training pipeline can be found here, and the details on versioning and model artifacts can be found here.

More technical documentation is available in the following locations:

Deprecation Notice

There are two deprecated versions of the model stored in deprecated/ that are no longer maintained.

Installation

This repository is designed to be run on Databricks and follows the conventions of the dml-sample-transmission-line repository. Please review its README for details on setup and usage patterns.

We currently run this project on clusters with DB 14.3 LTS runtime (Python 3.10).

Cluster Setup

To configure the cluster:

Upload install-db-requirements.sh to Advanced Options > Init Script in your cluster settings.
Restart the cluster for changes to take effect.

Updating Requirements

Whenever you add a requirement to pyproject.toml, follow these steps:

Run poetry update.
Generate requirements files with dml-gen-requirements as described in the dml-sample-transmission-line README.

Spell-checker

This repo has cspell configured in cspell.json for optional (highly recommended) spell-checking. If you're using VSCode, all you need is to install the cspell extension and spelling issues will be highlighted. If you're using another IDE, install cspell then run spell-checking in your command line using cspell .. To add new word(s) to the dictionary in VSCode, select some text > Right click > Spelling > Add Words to Dictionary. In other IDEs, words may need to be added manually to /.cspell/sumo_dict.txt. For more details, see the cspell docs.

Repository Structure

├── LICENSE
├── README.md
├── deprecated/                   # Old, unmaintained models
├── docs/                         # Documentation
│   ├── Building_towards_an_MVP.pdf  # Model iteration notes pre-v1.0.0, now this is in release notes
│   ├── architecture.md
│   └── features_upgrades.md
├── images/                       # Architecture diagrams and visuals
├── install-db-requirements.sh    # Cluster init file, used to install `requirements-db-14.3.txt` on databricks
├── model_artifacts/              # Stored model artifacts, including data params and evaluation results
├── notebooks/                    # Jupyter notebooks for analysis
├── poetry.lock                   # Poetry files
├── pyproject.toml                # 
├── scripts                       # Data extraction, training, and evaluation scripts
│   ├── megastock/                # Megastock-specific scripts (See scripts/megastock/README.md)
│   └── deprecated/               # Old scripts, no longer used
├── src/                          # Source code for the surrogate model
│   ├── utils/                    # General utility functions
│   ├── globals.py                # Global variables
│   ├── surrogate_model.py        # Main NN model implementation
│   ├── datagen.py                # Generates training data to feed into NN
│   ├── feature_utils.py          # Feature transformation utilities, used by main training pipeline and megastock
│   ├── versioning.py             # Version control utilities
├── tests/                        # Unit tests
└── requirements-*.txt            # Dependencies

License

This project is licensed under the terms specified in LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Surrogate Modeling

Documentation

Deprecation Notice

Installation

Cluster Setup

Updating Requirements

Spell-checker

Repository Structure

License

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 9

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.cspell		.cspell
.databricks		.databricks
.github/workflows		.github/workflows
deprecated		deprecated
docs		docs
images		images
model_artifacts		model_artifacts
notebooks		notebooks
scripts		scripts
src		src
tests		tests
.env		.env
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cspell.json		cspell.json
install-db-requirements.sh		install-db-requirements.sh
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements-db-14.3.txt		requirements-db-14.3.txt
requirements-test-14.3.txt		requirements-test-14.3.txt

License

rewiringamerica/surrogate_modeling

Folders and files

Latest commit

History

Repository files navigation

Surrogate Modeling

Documentation

Deprecation Notice

Installation

Cluster Setup

Updating Requirements

Spell-checker

Repository Structure

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 9

Uh oh!

Languages

Packages