Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][test] Waive 7 failed cases for main in QA CI
#14504 opened May 24, 2026 by xinhe-nv Collaborator Draft
[None][test] Waive 1 failed cases for main in QA CI
#14503 opened May 24, 2026 by xinhe-nv Collaborator Draft
[None][test] Waive 9 failed cases for main in QA CI
#14501 opened May 24, 2026 by xinhe-nv Collaborator Draft
[https://nvbugs/6196391][fix] Eliminate O(N^2) memcpy in findBlocksInReuseTreeByBlockKey
#14499 opened May 23, 2026 by brb-nv Collaborator Loading…
1 task done
[None][test] Waive 7 failed cases for main in QA CI
#14498 opened May 23, 2026 by xinhe-nv Collaborator Draft
[None][test] Waive 4 failed cases for main in QA CI
#14497 opened May 23, 2026 by xinhe-nv Collaborator Draft
[None][test] Waive 8 failed cases for main in QA CI
#14496 opened May 23, 2026 by xinhe-nv Collaborator Draft
[None][test] Waive 5 failed cases for main in QA CI
#14482 opened May 23, 2026 by xinhe-nv Collaborator Draft
[TRTLLM-10947][perf] eagle3: use cudaMemcpy2DAsync custom op for hidden-state capture
#14479 opened May 23, 2026 by pcicotti Collaborator Loading…
2 tasks done
[None][fix] Release stale-range holder after _commit_block in KVCache…
#14478 opened May 23, 2026 by Tabrizian Member Loading…
1 task done
[None][fix] [AutoDeploy] Fix OOM of DeepSeek-R1 NVFP4 for tp=4
#14477 opened May 23, 2026 by taylor-yb-lee Collaborator Loading…
1 task done
[None][feat] MNNVL Performance Optimization and FP8/NVFP4 Quant Fusion
#14476 opened May 23, 2026 by timlee0212 Collaborator Loading…
1 task done
ProTip! Exclude everything labeled bug with -label:bug.