Hi, thanks for the great work.
I would like to ask whether there is an available implementation or example for offline SRPO training. The current training scripts appear to support only online RL with rollouts in the LIBERO environment.
Is there any code, configuration, or recommended setup for running SRPO purely in an offline setting without environment interaction?
Thanks!
Hi, thanks for the great work.
I would like to ask whether there is an available implementation or example for offline SRPO training. The current training scripts appear to support only online RL with rollouts in the LIBERO environment.
Is there any code, configuration, or recommended setup for running SRPO purely in an offline setting without environment interaction?
Thanks!