Describe the bug
when I run
python mava/systems/q_learning/anakin/rec_qmix.py env=smax env.scenario.task_name=2s3z system.seed=1 system.total_timesteps=2000000
multiple times, the results are not the same each time.
However, with a fixed random seed, the results are expected to be identical across runs. How can I ensure the same results?
Describe the bug
when I run
python mava/systems/q_learning/anakin/rec_qmix.py env=smax env.scenario.task_name=2s3z system.seed=1 system.total_timesteps=2000000multiple times, the results are not the same each time.
However, with a fixed random seed, the results are expected to be identical across runs. How can I ensure the same results?