-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Draft - DO NOT REVIEW - AD host time improvement for large prefill (and more changes that will be removed
#11565
opened Feb 18, 2026 by
MrGeva
Loading…
1 task
[None][chore] Add WAN2.1 FP8 VBench test
#11564
opened Feb 18, 2026 by
yibinl-nvidia
Loading…
1 task
[None][feat] Add NVFP4 dynamic quantization support for visual_gen models
#11563
opened Feb 18, 2026 by
chang-l
Loading…
1 task done
[None][fix] Update to get central version variable for trtllm-bench.
#11562
opened Feb 18, 2026 by
FrankD412
Loading…
1 task done
[None][feat] Add GLM-4.7 FP8 recipe configs for H100 to config db
#11559
opened Feb 17, 2026 by
venkywonka
Loading…
1 task done
Draft: DON'T REVIEW: improved perf of host time attention metadata list to tensor handling
#11555
opened Feb 17, 2026 by
MrGeva
Loading…
1 task
[None][feat] Visual Gen: add cuda graphs; torch compile; nvtx; warmup
#11554
opened Feb 17, 2026 by
NVShreyas
Loading…
1 task done
[None][infra] Cherry pick plc pipeline for 1.2
#11546
opened Feb 17, 2026 by
yuanjingx87
Loading…
1 task done
[None][chore] Test change for CI workflow
#11545
opened Feb 17, 2026 by
dpitman-nvda
Loading…
1 task done
[None][fix] correct chunked prefill handling in TorchSampler
#11544
opened Feb 17, 2026 by
ixlmar
Loading…
1 task done
[https://nvbugs/839137][fix] Unwaive disagg unexpected ucx error
#11543
opened Feb 17, 2026 by
pcastonguay
Loading…
1 task done
Mf/minimax moe routing kernel
Community want to contribute
PRs initiated from Community
#11539
opened Feb 17, 2026 by
michaelfeil
•
Draft
1 task
[None][chore] DO NOT MERGE: Waiving some tests for 1.3.0rc4
#11536
opened Feb 16, 2026 by
pcastonguay
Loading…
1 task done
[#10693][chore] AutoDeploy: Add L1 tests from coverage dashboard
#11530
opened Feb 15, 2026 by
marinayanov
Loading…
[#11312][chore] Make TRT/NCCL configurable in CMake find modules
Community want to contribute
PRs initiated from Community
#11528
opened Feb 15, 2026 by
xuantengh
Loading…
1 task done
[#11398][feat] AutoDeploy: flashinfer rope for GLM4.7-Flash
AutoDeploy
<NV> AutoDeploy Backend
#11524
opened Feb 14, 2026 by
taylor-yb-lee
Loading…
1 task done
[https://nvbugs/5875514][fix] Fix WideEP gen-only benchmark hang in disaggregated serving
#11521
opened Feb 14, 2026 by
peihu-nv
Loading…
1 task done
Previous Next
ProTip!
no:milestone will show everything without a milestone.