Skip to content

Add Megatron-FSDP training script for LLama3 recipe#1468

Open
pstjohn wants to merge 2 commits intoNVIDIA:mainfrom
pstjohn:pstjohn/bio-240-add-megatron-fsdp-to-llama3-recipe
Open

Add Megatron-FSDP training script for LLama3 recipe#1468
pstjohn wants to merge 2 commits intoNVIDIA:mainfrom
pstjohn:pstjohn/bio-240-add-megatron-fsdp-to-llama3-recipe

Conversation

@pstjohn
Copy link
Collaborator

@pstjohn pstjohn commented Feb 17, 2026

Adds megatron-FSDP and context-parallel training script for llama3. Convergence testing may be blocked until we can ensure the clip_grad_norm_ works appropriately

Signed-off-by: Peter St. John <pstjohn@nvidia.com>
Signed-off-by: Peter St. John <pstjohn@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant