Skip to content

Pull requests: opendilab/LightRFT

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(wzn): fix format reward contamination from prompt template in gsm8k/geo3k bug Something isn't working
#52 opened Mar 14, 2026 by zunian-wan Loading…
3 of 13 tasks
feature(sunjx): implement dynamic sampling strategy in DAPO enhancement New feature or request
#51 opened Mar 7, 2026 by Jiaxuan-Sun Loading…
feature(sunjx): add GSPO and GMPO algorithms support enhancement New feature or request
#50 opened Mar 4, 2026 by Jiaxuan-Sun Loading…
feature(nyz): transfer meme rl training demo enhancement New feature or request
#49 opened Feb 26, 2026 by PaParaZz1 Loading…
feature(sunjx): fix fire sampling bugs in generate_fn bug Something isn't working
#48 opened Feb 26, 2026 by Jiaxuan-Sun Loading…
dev(hansbug): add math PRM code documentation Improvements or additions to documentation enhancement New feature or request
#47 opened Feb 24, 2026 by HansBug Draft
feature(pu): add init version of on_policy_distillation enhancement New feature or request
#43 opened Feb 10, 2026 by puyuan1996 Loading…
WIP: feature(pu): adapt to npu device
#39 opened Feb 9, 2026 by puyuan1996 Loading…
doc(pu): add init version of fast_exp_maker best practice documentation Improvements or additions to documentation
#37 opened Feb 3, 2026 by puyuan1996 Loading…
feature(luyd): add partial rollout in training process enhancement New feature or request
#29 opened Jan 22, 2026 by AltmanD Loading…
refactor(sunjx): refactor loss-filter implementation enhancement New feature or request refactor Cleanup, formatting, or restructuring of existing code.
#17 opened Jan 1, 2026 by Jiaxuan-Sun Loading…
refactor(sunjx): refactor dataset and reward module refactor Cleanup, formatting, or restructuring of existing code.
#13 opened Dec 31, 2025 by Jiaxuan-Sun Loading…
feature(sunjx): add rejective sampling pipeline in t2i demo enhancement New feature or request
#3 opened Dec 25, 2025 by Jiaxuan-Sun Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.