HRM-Mini

Minimalistic implementation of Hierarchical Recurrent Model (HRM).

Install requirements

Ensure Python and PyTorch is installed, and your machine have at least 1 GPU and total 40 GiB VRAM. Then install pip dependencies, it should be done in 10 minutes:

pip install -r requirements.txt

W&B Integration

This project uses Weights & Biases for experiment tracking and metric visualization. Ensure you're logged in:

wandb login

Download datasets

The following commands pulls the required datasets from HuggingFace repositories.

mkdir downloaded-datasets
hf download --repo-type dataset --local-dir ./downloaded-datasets/maze-30x30-hard-1k sapientinc/maze-30x30-hard-1k
hf download --repo-type dataset --local-dir ./downloaded-datasets/sudoku-extreme-1k sapientinc/sudoku-extreme-1k

Download checkpoints (optional)

Run the commands below to load trained Sudoku checkpoint for the dynamics analysis.

hf download --repo-type model --local-dir ./checkpoints/1000_tuned_hrm_new cl-agi/hrm-mini

Note: Running on a single GPU

The original experiments run on one node with 8 H100 GPUs. Sudoku takes about 30 minutes to run. If you want to run on a single GPU, set --nproc-per-node 1 in the command line. Also multiply local batch size by 8, e.g. local_batch_size=768. Sudoku will take ~4 hours per experiment on a single H100. Besides, the script by default runs 3 seeds, append seeds=[1] to run a single seed.

Launch main experiment

Sudoku-Extreme 1000 examples. It should take about 4 GPU*hours for H100 (~30 min for 8 H100 GPUs, ~4 hr for 1 H100 GPU).

OMP_NUM_THREADS=1 MKL_NUM_THREADS=1 torchrun --nproc-per-node 8 train.py --config-name tuned_hrm

Ablation studies

HRM Full: See above

Recurrent Transformer

OMP_NUM_THREADS=1 MKL_NUM_THREADS=1 torchrun --nproc-per-node 8 train.py --config-name tuned_rt

No dual timescale

OMP_NUM_THREADS=1 MKL_NUM_THREADS=1 torchrun --nproc_per_node 8 train.py --config-name tuned_hrm arch.name=hrm_ablations@HRM arch.L_cycles=1 arch.H_cycles=7

Tied H-L parameters (TRM-style)

OMP_NUM_THREADS=1 MKL_NUM_THREADS=1 torchrun --nproc_per_node 8 train.py --config-name tuned_hrm arch.name=hrm_ablations@HRM +arch.dual_module=False

No H-H links

OMP_NUM_THREADS=1 MKL_NUM_THREADS=1 torchrun --nproc_per_node 8 train.py --config-name tuned_hrm arch.name=hrm_ablations@HRM +arch.hh_link=False

MLP Mixer

OMP_NUM_THREADS=1 MKL_NUM_THREADS=1 torchrun --nproc_per_node 8 train.py --config-name tuned_hrm +arch.is_mlp_mixer=True

Dynamics and Visualization

Install Jupyter and load visualizations.ipynb. If you want to evaluate other checkpoint, change the checkpoint path in the first cell. It should take several minutes.

Other tasks

Maze 30x30

OMP_NUM_THREADS=1 MKL_NUM_THREADS=1 torchrun --nproc-per-node 8 train.py --config-name tuned_hrm data=maze

For 3-SAT, please switch to SAT branch to train.

Docker support (optional)

We use this docker image for experiments. You can use this image for exact reproducing.

You can check the exact software version in this image.

docker pull sapientai/pytorch-docker:26.02.14.hopper

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
arch		arch
config		config
dataset		dataset
LICENSE		LICENSE
README.md		README.md
adam_atan2.py		adam_atan2.py
eval.py		eval.py
requirements.txt		requirements.txt
train.py		train.py
visualizations.ipynb		visualizations.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HRM-Mini

Install requirements

W&B Integration

Download datasets

Download checkpoints (optional)

Note: Running on a single GPU

Launch main experiment

Ablation studies

Dynamics and Visualization

Other tasks

Docker support (optional)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HRM-Mini

Install requirements

W&B Integration

Download datasets

Download checkpoints (optional)

Note: Running on a single GPU

Launch main experiment

Ablation studies

Dynamics and Visualization

Other tasks

Docker support (optional)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages