-
Notifications
You must be signed in to change notification settings - Fork 311
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Allow searcher ckpt dir for per-rank ckpt files
#1091
opened Mar 21, 2026 by
kevalmorabia97
Loading…
add: HF PTQ support and modelopt_recipes mount in launcher
#1089
opened Mar 20, 2026 by
ChenhanYu
Loading…
Exclude small-channel Conv nodes from FP8 quantization
#1083
opened Mar 20, 2026 by
nv-samcheng
Loading…
fix: [modelopt 0.43.0][GB200][llm_ptq / sglang] Llama-3.1-8B-Inst (#5997673)
#1080
opened Mar 20, 2026 by
ChenhanYu
Loading…
Add sparse softmax to the Triton flash attention kernel
#1078
opened Mar 19, 2026 by
kaix-nv
Loading…
fix: [ModelOpt-Windows][modelopt 0.43.0] [genai_llm][README]: Sho (#5997787)
#1077
opened Mar 19, 2026 by
ChenhanYu
Loading…
[5963347] Expose iterator interface for calibration data
#1075
opened Mar 19, 2026 by
dthienan-nv
Loading…
fix: Feature: Add validation for loaded modelopt state files (#1041)
#1074
opened Mar 19, 2026 by
ChenhanYu
Loading…
[minor] Refactor TE fused-norm handling in GPTModelExporter
#1061
opened Mar 17, 2026 by
yueshen2016
Loading…
Add LoRA co-training support for HF EAGLE speculative decoding
#1060
opened Mar 17, 2026 by
yeyu-nvidia
Loading…
security(peft): enforce torch.nn.init prefix validation
#1054
opened Mar 17, 2026 by
RinZ27
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-02-22.