-
Notifications
You must be signed in to change notification settings - Fork 200
Pull requests: sgl-project/SpecForge
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Reduce peak GPU memory in Eagle3 online target generation by avoiding an extra logits copy
#528
opened Apr 9, 2026 by
zijiexia
Loading…
1 of 6 tasks
Fix VLM preprocessing and add mRoPE position handling in target head
#527
opened Apr 8, 2026 by
liusy58
Loading…
6 tasks
Fix multimodal hidden-state preparation for Qwen3-VL models
#526
opened Apr 8, 2026 by
liusy58
Loading…
6 tasks
feat: reduce Eagle3 training memory spike via all-to-all sharding
#524
opened Apr 5, 2026 by
laoconeth
Loading…
2 of 6 tasks
fix: Make template override arg work correctly
#522
opened Apr 1, 2026 by
moehanabi
Loading…
1 of 6 tasks
fix: Support different version of PCG args
#517
opened Mar 30, 2026 by
moehanabi
Loading…
1 of 6 tasks
Fix: support preformatted text datasets in train_eagle3.py (avoid forcing conversations generator)
#498
opened Mar 9, 2026 by
Seun-Ajayi
Loading…
6 tasks
[Feature] Support DFlash Speculative Decoding Training for Qwen3.5 Models
#495
opened Mar 7, 2026 by
EanWang211123
Loading…
6 tasks
feat: add GLM-4.7-Flash EAGLE3 training support
#493
opened Mar 6, 2026 by
lujangus
Loading…
3 tasks
Add LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding
#492
opened Mar 6, 2026 by
MrShevan
Loading…
4 tasks done
Fix multi-node DP training crash from FlashInfer CUDA IPC handles
#489
opened Mar 4, 2026 by
yifjiang
Loading…
3 tasks done
fix: correct always-true condition in HarmonyParser causing duplicate system prompts
#484
opened Mar 2, 2026 by
Bias92
Loading…
fix: use processor.tokenizer for apply_chat_template in VLM preprocessing
#480
opened Mar 2, 2026 by
Bias92
Loading…
2 tasks
(feat)Draft model supports Qwen3 MoE
#468
opened Feb 13, 2026 by
sadasdasdasdasasd
Loading…
1 of 6 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.