Skip to content

feature(wzn): add LoRA training demo for Geo3K#41

Merged
puyuan1996 merged 8 commits intoopendilab:mainfrom
zunian-wan:dev-geo3k-lora
Mar 16, 2026
Merged

feature(wzn): add LoRA training demo for Geo3K#41
puyuan1996 merged 8 commits intoopendilab:mainfrom
zunian-wan:dev-geo3k-lora

Conversation

@zunian-wan
Copy link
Contributor

@zunian-wan zunian-wan commented Feb 10, 2026

📋 Summary

Purpose:
Add a demo for Geo3k training using LoRA with FSDP and SGLang

Type of Change:

  • 🐛 Bug fix (non-breaking change which fixes an issue)
  • ✨ New feature (non-breaking change which adds functionality)
  • 💥 Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • 📚 Documentation update
  • 🎨 Code refactoring (no functional changes)
  • ⚡ Performance improvement
  • ✅ Test addition/modification
  • 🔧 Configuration/Build changes

🔗 Related Issues

Fixes #(issue number)
Related to #(issue number)

📝 Changes

What changed:

Why these changes:

Key implementation details:

🧪 Testing


BEFORE SUBMITTING, PLEASE READ:

- Implement LoRA-aware model saving in FSDPV2Strategy, supporting HF/PEFT `save_pretrained` for adapters.
- Add LoRA merging/unmerging logic in BroadcastManager to ensure inference engines receive effective weights during synchronization.
- Optimize checkpointing in PPOTerVL to prioritize HF adapter saving for LoRA runs.
- Add `run_grpo_geo3k_lora_qwen2.5_vl_7b.sh` as a reference LoRA training script.
- Improve weight mapping for SGLang to handle PEFT-wrapped module names and base layer stripping.
- Added rotation logic for HF/LoRA adapters in PPO/SPMD trainers to honor the `max_ckpt_num` parameter.
- Synced the cleanup mechanism with the `save_ckpt` implementation in `FSDPV2Strategy`.
@zunian-wan zunian-wan changed the title Add Add Geo3K training demo using LoRA Feb 10, 2026
@zunian-wan zunian-wan changed the title Add Geo3K training demo using LoRA Add Geo3K LoRA training demo Feb 10, 2026
@zunian-wan zunian-wan changed the title Add Geo3K LoRA training demo Add LoRA training demo for Geo3K Feb 10, 2026
@puyuan1996 puyuan1996 added the enhancement New feature or request label Feb 10, 2026
@puyuan1996 puyuan1996 changed the title Add LoRA training demo for Geo3K feature(wzn): add LoRA training demo for Geo3K Feb 10, 2026
…ture and improve LoRA training documentation in runing script
- simplify Geo3K LoRA demo script configs and naming
- adjust save paths for final/critic ckpts in training entry
- unify checkpoint rotation via rotate_ckpt_dirs
- improve FSDP/engine weight broadcast handling and errors
- Add test_geo3k_lora.py to seamlessly evaluate LoRA-finetuned vision-language models (e.g., Qwen2.5-VL) on the Geo3K dataset.
@puyuan1996 puyuan1996 merged commit 015fe39 into opendilab:main Mar 16, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants