-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix default_max_tokens compute error in responses api when mtp is opened
#18932
opened Feb 17, 2026 by
LuYanFCP
Loading…
2 of 4 tasks
Fix NSA FP8 KV layout mismatch under MHA one-shot
quant
LLM Quantization
run-ci
#18931
opened Feb 17, 2026 by
mmangkad
Loading…
5 tasks
Remove unused fast-hadamard-transform PyTorch extension sources
run-ci
#18927
opened Feb 17, 2026 by
BBuf
Loading…
5 tasks
feat: [Qwen3.5] Support block-wise FP8 quantization and model adaptation
run-ci
#18926
opened Feb 17, 2026 by
zju-stu-lizheng
Loading…
2 tasks done
[Fix] Enable Pipeline Parallelism support for Kimi K2.5
#18925
opened Feb 17, 2026 by
ieBoytsov
Loading…
1 of 5 tasks
[NPU] [Quantization] w4a4 MoE layer support
npu
#18924
opened Feb 17, 2026 by
OrangeRedeng
•
Draft
2 of 7 tasks
fix: adding pin to prevent cleanups for designated nightly docker images
#18923
opened Feb 17, 2026 by
dougyster
Loading…
[bugfix?] update outdated unittest document
documentation
Improvements or additions to documentation
#18919
opened Feb 17, 2026 by
SoluMilken
Loading…
5 tasks done
[Qwen3-Next] Enable fused_qkvzba_split_reshape_cat also for prefill
run-ci
#18917
opened Feb 17, 2026 by
YAMY1234
Loading…
[TorchAO] Enable TorchAO LinearMethod and TorchAOConfig
#18916
opened Feb 17, 2026 by
ZhiweiYan-96
Loading…
5 tasks
Refactor sampler: Use a better hash function for deterministic sampling and clear dispatch for probs/logprobs/logits sampling paths
run-ci
#18915
opened Feb 17, 2026 by
merrymercy
Loading…
Add get_weights_checksum API and refactor update_weights_from_tensor tests with SHA256 verification
#18913
opened Feb 17, 2026 by
aeft
Loading…
3 of 5 tasks
[Test] add unit test for skipping already preempted request
#18912
opened Feb 17, 2026 by
glenliu21
Loading…
3 tasks done
[AMD] Add GLM-5 nightly test
amd
run-ci
#18911
opened Feb 17, 2026 by
michaelzhang-ai
•
Draft
5 tasks
Revert #17613 Qwen3-Next PCG refactor (KL divergence regression test)
#18910
opened Feb 16, 2026 by
alisonshao
Loading…
1 task
Fix FlashInfer autotune deadlock with --enable-symm-mem
#18908
opened Feb 16, 2026 by
alisonshao
Loading…
1 task
[jit kernel] Support per_token_group_quant_8bit jit kernel
quant
LLM Quantization
run-ci
#18905
opened Feb 16, 2026 by
yuan-luo
Loading…
5 tasks
Pass kv scales to paged attention in flashinfer backend
#18904
opened Feb 16, 2026 by
lukealonso
Loading…
[sgl-kernel] rebase FlashMLA 0217
run-ci
sgl-kernel
#18902
opened Feb 16, 2026 by
FlamingoPg
Loading…
[HiCache] feat: L3 prefetch prometheus metrics
documentation
Improvements or additions to documentation
#18898
opened Feb 16, 2026 by
vladnosiv
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.