GitHub - jhuebotter/predictive_control

Predictive Control Project

This work is developed by Justus Huebotter in 2022 as part of the SPIKEFERENCE project, co-founded by the Human Brain Project (HBP) Specific Grant Agreement 3 (ID: 945539) and the Donders Institute for Brain, Cognition and Behaviour.

In this project, we revisit policy optimization for low-level continuous control tasks and derive our methods from deep Active Inference (dAIF) In particular, we use prediction errors to learn the dynamics of the system in a recurrent transition model (see figure left). We show that we can then use this model to drive the learning of an amortized policy model (see figure right) for goal reaching by imagining state trajectory rollouts, even when interaction with the real environment is limited.

The exact method used in this code base is explained in more detail in:

J. Huebotter, S. Thill, M. van Gerven, P. Lanillos (2022): Learning Policies for Continuous Control via Transition Models, 3rd International Workshop on Active Inference

This publication is also available here.

Using the code

To use the code please clone this git via:

git clone https://github.com/jhuebotter/predictive_control.git

To install the required packages create a new local environment and run:

pip install -r requirements.txt

To enable wandb logging you will have to sign up at https://wandb.ai and call

wandb login

After this, the code should be executed by running :

python pretrain_adaptive_model.py

If desired, the parameters for the experiments can be changed in the config.yaml file. There are two environments currently supported: plane and reacher2. Please see below for example results for both environments with either static or moving targets.

Example Results

Continuous control in a planar linear environment

Continuous control of a planar robot arm

The auto-regressive prediction model learns to accurately forecast the state trajectory based on control inputs:

Name		Name	Last commit message	Last commit date
Latest commit History 136 Commits
baselines		baselines
figures		figures
notebooks		notebooks
src		src
.gitconfig		.gitconfig
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
config_snn.yaml		config_snn.yaml
config_snn_pol.yaml		config_snn_pol.yaml
config_snn_pol_cstork.yaml		config_snn_pol_cstork.yaml
config_snn_trans_cstork.yaml		config_snn_trans_cstork.yaml
evalue_adaptive_models.py		evalue_adaptive_models.py
policynet_baseline.cpt		policynet_baseline.cpt
pretrain_adaptive_models.py		pretrain_adaptive_models.py
pretrain_spiking_policy.py		pretrain_spiking_policy.py
pretrain_spiking_transition.py		pretrain_spiking_transition.py
requirements.txt		requirements.txt
transitionnet_baseline.cpt		transitionnet_baseline.cpt
two_d_plane_AIF_control.py		two_d_plane_AIF_control.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predictive Control Project

Using the code

Example Results

Continuous control in a planar linear environment

Continuous control of a planar robot arm

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Predictive Control Project

Using the code

Example Results

Continuous control in a planar linear environment

Continuous control of a planar robot arm

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages