Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: RL support for custom moe models in dtensor v2 CI:L1 Run doctests, unit tests, and functional tests
#1695 opened Dec 24, 2025 by hemildesai Loading…
docs: Add doc for nano-v3 CI:docs Run doctest documentation Improvements or additions to documentation
#1694 opened Dec 24, 2025 by yfw Loading…
4 tasks
[don't merge] support multiple datasets for response dataset CI:L1 Run doctests, unit tests, and functional tests
#1691 opened Dec 23, 2025 by yuki-97 Draft
fix: Fix DTensor slice crash after PyTorch 2.9 bump CI:L2 Run doctests, unit tests, functional tests, and convergence tests r0.5.0
#1689 opened Dec 23, 2025 by zpqiu Loading…
4 tasks
Nano v3 lora
#1669 opened Dec 20, 2025 by arendu Draft
4 tasks
refactor: Order node by IP in GRPO CI:L2 Run doctests, unit tests, functional tests, and convergence tests GB200
#1655 opened Dec 18, 2025 by guyueh1 Loading…
4 tasks
feat: refactor mcore train/forward utilities
#1654 opened Dec 17, 2025 by ashors1 Draft
4 tasks
chore: update Megatron-LM submodule to ed804b4
#1653 opened Dec 17, 2025 by yaoyu-33 Loading…
4 tasks
feat: refactor megatron data utils
#1651 opened Dec 17, 2025 by ashors1 Draft
4 tasks
refactor: split train and val dataset in response dataset CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1649 opened Dec 17, 2025 by yuki-97 Loading…
feat: refactor megatron init
#1646 opened Dec 17, 2025 by ashors1 Draft
4 tasks
perf: Add DeepEP support to Megatron Policy CI:L2 Run doctests, unit tests, functional tests, and convergence tests
#1645 opened Dec 17, 2025 by parthmannan Loading…
4 tasks
tests: add nanov3 nightly/release tests
#1644 opened Dec 16, 2025 by terrykong Draft
4 tasks
fix: split dtensorv1 vllm dependency CI:L1 Run doctests, unit tests, and functional tests
#1638 opened Dec 15, 2025 by yuki-97 Loading…
feat: Megatron SFT LoRA CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation r0.5.0
#1629 opened Dec 12, 2025 by arendu Loading…
4 tasks
fix: allow zero grad norm in dtensor policies for consistency with Megatron CI:L1 Run doctests, unit tests, and functional tests
#1618 opened Dec 9, 2025 by smahdavi4 Loading…
feat: Support for Ray spinup within Gym
#1613 opened Dec 9, 2025 by pjin-nvidia Draft
4 tasks
feat: Support Ray Compiled Graph for SFT
#1612 opened Dec 9, 2025 by katec846 Loading…
5 of 10 tasks
ProTip! Add no:assignee to see everything that’s not assigned.