-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[https://nvbugs/6196391][fix] Eliminate O(N^2) memcpy in findBlocksInReuseTreeByBlockKey
#14499
opened May 23, 2026 by
brb-nv
Collaborator
Loading…
1 task done
[None][docs] Add deprecation notice to legacy support-matrix.md
#14495
opened May 23, 2026 by
fuergaosi233
Loading…
[None][docs] Add W8A8 SmoothQuant (int8_sq) to quantization feature docs
#14494
opened May 23, 2026 by
fuergaosi233
Loading…
[None][docs] Fix beamWidth default value and valid range in gpt-runtime.md
#14493
opened May 23, 2026 by
fuergaosi233
Loading…
[None][docs] Fix broken _cpp_gen/executor.rst reference in executor.md
#14492
opened May 23, 2026 by
fuergaosi233
Loading…
[None][fix] add nvfp4 to --kv_cache_dtype choices in quantize.py
#14491
opened May 23, 2026 by
fuergaosi233
Loading…
2 tasks
[None][docs] use requirements-dev.txt for dev environment setup in CONTRIBUTING.md
#14490
opened May 23, 2026 by
fuergaosi233
Loading…
2 tasks
[None][docs] pin setuptools<80 in pip installation commands
#14489
opened May 23, 2026 by
fuergaosi233
Loading…
1 task
[None][docs] remove stale prompt from quickstart_example.py output comment
#14488
opened May 23, 2026 by
fuergaosi233
Loading…
1 task
[None][docs] fix incorrect auto sampler behavior description for beam search
#14487
opened May 23, 2026 by
fuergaosi233
Loading…
2 tasks
[https://nvbugs/6164924][fix] Apply minimal fix — lower free_gpu_memory_fraction only modestly (0.9→0.8, ~700
#14486
opened May 23, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[https://nvbugs/6189416][fix] Add a Blackwell-specific reference entry (extra_acc_spec=sm100_fp8, accuracy=46.
#14484
opened May 23, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[https://nvbugs/6198674][fix] Restructure the post-quant unpack to capture recv_x as a single value and split
#14483
opened May 23, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[TRTLLM-12901][fix] cap per-rank max_num_active_requests by max_num_tokens under attention DP
#14481
opened May 23, 2026 by
xwang233
Collaborator
Loading…
1 task done
[TRTLLM-10947][perf] eagle3: use cudaMemcpy2DAsync custom op for hidden-state capture
#14479
opened May 23, 2026 by
pcicotti
Collaborator
Loading…
2 tasks done
[None][fix] Release stale-range holder after _commit_block in KVCache…
#14478
opened May 23, 2026 by
Tabrizian
Member
Loading…
1 task done
[None][fix] [AutoDeploy] Fix OOM of DeepSeek-R1 NVFP4 for tp=4
#14477
opened May 23, 2026 by
taylor-yb-lee
Collaborator
Loading…
1 task done
[None][feat] MNNVL Performance Optimization and FP8/NVFP4 Quant Fusion
#14476
opened May 23, 2026 by
timlee0212
Collaborator
Loading…
1 task done
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.