NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 311
Star 2.2k

Code
Issues 69
Pull requests 113
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 28 Milestones 0

New pull request New

113 Open 640 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Allow searcher ckpt dir for per-rank ckpt files

#1091 opened Mar 21, 2026 by kevalmorabia97

Loading…

add: HF PTQ support and modelopt_recipes mount in launcher

#1089 opened Mar 20, 2026 by ChenhanYu

Loading…

Dkorzekwa/anymodel subblock stats

#1085 opened Mar 20, 2026 by danielkorzekwa

Loading…

Exclude small-channel Conv nodes from FP8 quantization

#1083 opened Mar 20, 2026 by nv-samcheng

Loading…

Unified diffusion model trainer

#1082 opened Mar 20, 2026 by mxinO • Draft

Add skip-softmax to Triton flash attention kernel

#1081 opened Mar 20, 2026 by kaix-nv

Loading…

fix: [modelopt 0.43.0][GB200][llm_ptq / sglang] Llama-3.1-8B-Inst (#5997673)

#1080 opened Mar 20, 2026 by ChenhanYu

Loading…

fix: [modelopt 0.43][GH200][llm_ptq - autoquant / trtllm] Llama-3 (#5997832)

#1079 opened Mar 20, 2026 by ChenhanYu • Draft

Add sparse softmax to the Triton flash attention kernel

#1078 opened Mar 19, 2026 by kaix-nv

Loading…

fix: [ModelOpt-Windows][modelopt 0.43.0] [genai_llm][README]: Sho (#5997787)

#1077 opened Mar 19, 2026 by ChenhanYu

Loading…

fix: Speculative Decoding (#1066)

#1076 opened Mar 19, 2026 by ChenhanYu • Draft

[5963347] Expose iterator interface for calibration data

#1075 opened Mar 19, 2026 by dthienan-nv

Loading…

fix: Feature: Add validation for loaded modelopt state files (#1041)

#1074 opened Mar 19, 2026 by ChenhanYu

Loading…

Dkorzekwa/decilm hf code cleanup 2

#1073 opened Mar 19, 2026 by danielkorzekwa

Loading…

LORA first experiment

#1072 opened Mar 19, 2026 by skierat

Loading…

Dkorzekwa/decilm hf code cleanup

#1071 opened Mar 19, 2026 by danielkorzekwa

Loading…

[EAGLE] Add Tensorboard logging support

#1070 opened Mar 19, 2026 by benchislett

Loading…

Fridah/kinjal/vllm modelopt reload

#1068 opened Mar 18, 2026 by Fridah-nv • Draft

Step3.5 MoE support

#1063 opened Mar 17, 2026 by meenchen

Loading…

[minor] Refactor TE fused-norm handling in GPTModelExporter

#1061 opened Mar 17, 2026 by yueshen2016

Loading…

Add LoRA co-training support for HF EAGLE speculative decoding

#1060 opened Mar 17, 2026 by yeyu-nvidia

Loading…

Fused QKV add node issue for GQA graph surgery

#1057 opened Mar 17, 2026 by hthadicherla

Loading…

security(opt): enable weights_only=True by default

#1056 opened Mar 17, 2026 by RinZ27

Loading…

security(peft): enforce torch.nn.init prefix validation

#1054 opened Mar 17, 2026 by RinZ27

Loading…

Add VSA

#1053 opened Mar 17, 2026 by kaix-nv

Loading…

Previous 1 2 3 4 5 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2026-02-22.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!