-
Notifications
You must be signed in to change notification settings - Fork 10
Pull requests: opendilab/LightRFT
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(wzn): fix format reward contamination from prompt template in gsm8k/geo3k
bug
Something isn't working
#52
opened Mar 14, 2026 by
zunian-wan
Loading…
3 of 13 tasks
feature(sunjx): implement dynamic sampling strategy in DAPO
enhancement
New feature or request
#51
opened Mar 7, 2026 by
Jiaxuan-Sun
Loading…
feature(sunjx): add GSPO and GMPO algorithms support
enhancement
New feature or request
#50
opened Mar 4, 2026 by
Jiaxuan-Sun
Loading…
feature(nyz): transfer meme rl training demo
enhancement
New feature or request
#49
opened Feb 26, 2026 by
PaParaZz1
Loading…
feature(sunjx): fix fire sampling bugs in generate_fn
bug
Something isn't working
#48
opened Feb 26, 2026 by
Jiaxuan-Sun
Loading…
dev(hansbug): add math PRM code
documentation
Improvements or additions to documentation
enhancement
New feature or request
feature(pu): add init version of on_policy_distillation
enhancement
New feature or request
#43
opened Feb 10, 2026 by
puyuan1996
Loading…
feature(sunjx): add rejection sampling in grm_training
#38
opened Feb 6, 2026 by
Jiaxuan-Sun
Loading…
doc(pu): add init version of fast_exp_maker best practice
documentation
Improvements or additions to documentation
#37
opened Feb 3, 2026 by
puyuan1996
Loading…
feature(luyd): add partial rollout in training process
enhancement
New feature or request
#29
opened Jan 22, 2026 by
AltmanD
Loading…
refactor(sunjx): refactor loss-filter implementation
enhancement
New feature or request
refactor
Cleanup, formatting, or restructuring of existing code.
#17
opened Jan 1, 2026 by
Jiaxuan-Sun
Loading…
refactor(sunjx): refactor dataset and reward module
refactor
Cleanup, formatting, or restructuring of existing code.
#13
opened Dec 31, 2025 by
Jiaxuan-Sun
Loading…
feature(sunjx): add rejective sampling pipeline in t2i demo
enhancement
New feature or request
#3
opened Dec 25, 2025 by
Jiaxuan-Sun
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.