-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][perf] Remove unnecessary ToPIL() from find_mm_token_lengths
Multimodal
Label for issues & PRs regarding Multimodal related objects
#11640
opened Feb 23, 2026 by
yechank-nvidia
Loading…
[None][infra] Waive failures on release 1.2
#11639
opened Feb 23, 2026 by
jieli-matrix
Loading…
1 task done
[TRTLLM-9605][feat] Support streaming tool calls for Responses API
#11638
opened Feb 23, 2026 by
JunyiXu-nv
Loading…
1 task done
[TRTLLM-11070][feat] Account for reusable KV cache blocks in micro batch scheduler capacity scheduling.
#11637
opened Feb 23, 2026 by
SimengLiu-nv
•
Draft
1 task done
[None][chroe] Mass integration of release/1.2 - 5th
#11636
opened Feb 23, 2026 by
dominicshanshan
Loading…
1 task done
[TRTLLM-11568][feat] Fix collective calls
#11632
opened Feb 22, 2026 by
greg-kwasniewski1
Loading…
1 task done
[#10243][chore] switched the default AD attention backend to trtllm
#11627
opened Feb 22, 2026 by
MrGeva
Loading…
1 task done
[#11529][perf] AD host time attention MD optimization for large context
#11624
opened Feb 22, 2026 by
MrGeva
Loading…
1 task done
Refactor fused qk norm rope kernel
Community want to contribute
PRs initiated from Community
#11622
opened Feb 22, 2026 by
IanBoyanZhang
Loading…
1 task
[None][fix] Accept **kwargs in DynamicYamlWithDeepMergeSettingsSource…
#11621
opened Feb 22, 2026 by
tcherckez-nvidia
Loading…
1 task done
[TRTLLM-11535][feat] Fixed NVFP4 sharding
#11618
opened Feb 21, 2026 by
greg-kwasniewski1
Loading…
1 task done
[https://nvbugs/5919025][fix] Disable warmup steps for some WAN unit tests
#11616
opened Feb 21, 2026 by
chang-l
Loading…
1 task done
[TRTLLM-11614][feat] Fixing multigpu tests
#11615
opened Feb 21, 2026 by
greg-kwasniewski1
Loading…
1 task done
[None][feat] Add Qwen3-235B-A22B Pareto configs to recipe selector
#11612
opened Feb 21, 2026 by
venkywonka
•
Draft
add globaltimer-based timing backend for autotuner profiling
#11611
opened Feb 21, 2026 by
dhansen-nvidia
•
Draft
1 task
[None][feat] NIXL support for hybrid model cache transfer
#11608
opened Feb 20, 2026 by
NVShreyas
Loading…
1 task done
[None][perf] Use UE8M0 FP8 quant kernel for DeepGemm blockwise GEMM
#11607
opened Feb 20, 2026 by
chang-l
Loading…
1 task done
[TRTLLM-11087][doc] Update speculative decoding docs
#11604
opened Feb 20, 2026 by
mikeiovine
Loading…
1 task done
[TRTLLM-11567][feat] Added GatedDeltaNet sharding from config
#11599
opened Feb 20, 2026 by
greg-kwasniewski1
Loading…
1 task done
Previous Next
ProTip!
Follow long discussions with comments:>50.