Skip to content

Pull requests: sgl-project/SpecForge

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Feature] support prepare hidden states use dp
#282 opened Nov 7, 2025 by jiapingW Loading…
5 tasks
add template for deepseek-r1-distill
#281 opened Nov 7, 2025 by justadogistaken Loading…
6 tasks
[Feature] Add accept length simulator for QwenVL
#279 opened Nov 7, 2025 by Lihui-Gu Loading…
6 tasks
Added requirements-rocm.txt for AMD GPU and ROCm
#275 opened Nov 5, 2025 by ChangLiu0709 Loading…
6 tasks
[Feature]Add Parser for Qwen3 think model
#258 opened Oct 21, 2025 by zyksir Loading…
6 tasks
[Feature] Qwen3 VL eagle3 support
#251 opened Oct 10, 2025 by dcw02 Loading…
6 tasks
Fix resume offline train logic. Add loading optimizer state
#243 opened Sep 29, 2025 by hanq-moreh Loading…
6 tasks
Added mistral model support
#208 opened Sep 1, 2025 by ValeGian Loading…
3 of 6 tasks
[Feature] VLM model support tp
#206 opened Sep 1, 2025 by KerwinKai Draft
6 tasks
Support Train Eagle-3 By DeepSpeed
#197 opened Sep 1, 2025 by xq25478 Loading…
Adapt Eagle3 for Deepseek architecture
#186 opened Aug 28, 2025 by xuhaojie-2025 Loading…
6 tasks
Add Draft LoRA scripts
#138 opened Aug 13, 2025 by shuaills Draft
6 tasks
Added Eagle training support for Kimi-K2
#108 opened Aug 3, 2025 by xuhaojie-2025 Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.