-
Notifications
You must be signed in to change notification settings - Fork 278
Pull requests: microsoft/onnxruntime-genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update Extensions Commit to Support Array Type for Tool Calling Function
#2073
opened Apr 7, 2026 by
sayanshaw24
Loading…
Enable CUDA graph capture for CUDA EP to improve decode throughput
#2070
opened Apr 7, 2026 by
apsonawane
Loading…
docs: fix step numbering in Hugging Face download section
#2058
opened Apr 2, 2026 by
riddles-the-one
Loading…
Fix CUDA build with MSVC by enabling /Zc:preprocessor for nvcc host compilation on VS 16.5 or greater
#2054
opened Apr 1, 2026 by
nsubaru
Loading…
Add HunYuan Dense V1 (hunyuan_v1_dense) model support
#2045
opened Mar 25, 2026 by
amdrajeevp1
Loading…
Rename NemotronCacheConfig to NemotronConfig and add blank penalty to the decoder
#2042
opened Mar 22, 2026 by
nenad1002
Loading…
Add WebGPU EP support and repetitions flag to whisper.py
#2032
opened Mar 17, 2026 by
qjia7
Loading…
GenAI changes to support EPContext compilation and validation
#1993
opened Feb 27, 2026 by
lnigam
Loading…
remove one assert not verified with model microsoft/OptiMind-SFT
#1975
opened Feb 12, 2026 by
xadupre
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.