Skip to content

fix(pu): use normalize_advantages_cross_batch for opd

5446e4b
Select commit
Loading
Failed to load commit list.
Open

feature(pu): add on_policy_distillation #43

fix(pu): use normalize_advantages_cross_batch for opd
5446e4b
Select commit
Loading
Failed to load commit list.