MARLAX

JAX-first cooperative multi-agent reinforcement learning.

The first target is a small, fast tabular Q-learning stack:

batched cooperative gridworld environments
independent Q-learning agents with shared team rewards
JAX scan-friendly training loops

The longer-term direction is a method zoo and environment zoo for cooperative MARL.

Environment

conda env create -f environment.yml
conda run -n marlax uv pip install --python /home/dev/miniconda3/envs/marlax/bin/python -e ".[gpu,dev,storage,viz]"

Checks

conda run -n marlax python -m pytest -q
XLA_PYTHON_CLIENT_PREALLOCATE=false conda run -n marlax python experiments/coop_grid_q_learning/run.py

Gallery

python -m http.server 8000 --directory site

Figure Style

Use STYLE.md for diagnostic plot styling.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
experiments/coop_grid_q_learning		experiments/coop_grid_q_learning
plots/coop_grid_q_learning/latest		plots/coop_grid_q_learning/latest
site		site
src/marlax		src/marlax
stores/coop_grid_q_learning/latest		stores/coop_grid_q_learning/latest
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
STYLE.md		STYLE.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MARLAX

Environment

Checks

Gallery

Figure Style

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MARLAX

Environment

Checks

Gallery

Figure Style

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages