Q-learning with Adjoint Matching

[Paper] | [Website]

teaser figure bar plot

Overview

Q-learning with adjoint matching (QAM) uses Adjoint Matching to fine-tune a flow policy towards the optimal behavior-regularized solution efficiently!

Installation: pip install -r requirements.txt

Reproducing paper results

We include the example command below for all three variants of our method on cube-triple-task2. We also release our experiment data at exp_data/README.md and include some scripts for generating experiment commands in experiments/*.py. We hope this helps facilitate/speedup future research!

# QAM_EDIT
MUJOCO_GL=egl python main.py --run_group=reproduce --agent=agents/qam.py --tags=QAM_EDIT --seed=10001  --env_name=cube-triple-play-singletask-task2-v0 --sparse=False --horizon_length=5 --agent.action_chunking=True --agent.inv_temp=3.0 --agent.fql_alpha=0.0 --agent.edit_scale=0.1

# QAM_FQL
MUJOCO_GL=egl python main.py --run_group=reproduce --agent=agents/qam.py --tags=QAM_FQL --seed=10001 --env_name=cube-triple-play-singletask-task2-v0 --sparse=False --horizon_length=5 --agent.action_chunking=True --agent.inv_temp=10.0 --agent.fql_alpha=300.0 --agent.edit_scale=0.0

# QAM
MUJOCO_GL=egl python main.py --run_group=reproduce --agent=agents/qam.py --tags=QAM --seed=10001 --env_name=cube-triple-play-singletask-task2-v0 --sparse=False --horizon_length=5 --agent.action_chunking=True --agent.inv_temp=3.0 --agent.fql_alpha=0.0 --agent.edit_scale=0.0

How do I obtain the 100M for puzzle-4x4 and cube-quadruple?

Please follow the instructions here to obtain the large datasets.

Acknowledgments

This codebase is built on top of QC.

BibTeX

@article{li2026qam,
  author = {Qiyang Li and Sergey Levine},
  title  = {Q-learning with Adjoint Matching},
  conference = {arXiv Pre-print},
  year = {2026},
  url = {http://arxiv.org/abs/2601.14234},
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
agents		agents
assets		assets
envs		envs
exp_data		exp_data
experiments		experiments
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
evaluation.py		evaluation.py
log_utils.py		log_utils.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q-learning with Adjoint Matching

[Paper] | [Website]

Overview

Reproducing paper results

How do I obtain the 100M for puzzle-4x4 and cube-quadruple?

Acknowledgments

BibTeX

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Q-learning with Adjoint Matching

[Paper] | [Website]

Overview

Reproducing paper results

How do I obtain the 100M for puzzle-4x4 and cube-quadruple?

Acknowledgments

BibTeX

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages