Skip to content

feature(pu): add init version of off-policy grpo/ppo enhancement with list and http buffer#4

Open
puyuan1996 wants to merge 36 commits intomainfrom
dev-offpolicy-buffer
Open

feature(pu): add init version of off-policy grpo/ppo enhancement with list and http buffer#4
puyuan1996 wants to merge 36 commits intomainfrom
dev-offpolicy-buffer

Conversation

@puyuan1996
Copy link
Owner

No description provided.

puyuan1996 pushed a commit that referenced this pull request Jan 8, 2026
Fix FSDP load planner to keep model tensors
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant