sgl-project / sglang Public

Notifications You must be signed in to change notification settings
Fork 4.5k
Star 23.6k

Code
Issues 601
Pull requests 1.6k
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Pull requests: sgl-project/sglang

Labels 70 Milestones 1

New pull request New

1,589 Open 12,205 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix default_max_tokens compute error in responses api when mtp is opened

#18932 opened Feb 17, 2026 by LuYanFCP

Loading…

2 of 4 tasks

Fix NSA FP8 KV layout mismatch under MHA one-shot quant

LLM Quantization

run-ci

#18931 opened Feb 17, 2026 by mmangkad

Loading…

5 tasks

[AMD] Unit tests for mtp in GLM-4.7

#18930 opened Feb 17, 2026 by almaslof

Loading…

5 tasks

[diffusion] refactor: unify SamplingParams construction and improve DiffGenerator return types diffusion

SGLang Diffusion

npu run-ci

#18928 opened Feb 17, 2026 by mickqian

Loading…

5 tasks

Remove unused fast-hadamard-transform PyTorch extension sources run-ci

#18927 opened Feb 17, 2026 by BBuf

Loading…

5 tasks

feat: [Qwen3.5] Support block-wise FP8 quantization and model adaptation run-ci

#18926 opened Feb 17, 2026 by zju-stu-lizheng

Loading…

2 tasks done

[Fix] Enable Pipeline Parallelism support for Kimi K2.5

#18925 opened Feb 17, 2026 by ieBoytsov

Loading…

1 of 5 tasks

[NPU] [Quantization] w4a4 MoE layer support npu

#18924 opened Feb 17, 2026 by OrangeRedeng • Draft

2 of 7 tasks

fix: adding pin to prevent cleanups for designated nightly docker images

#18923 opened Feb 17, 2026 by dougyster

Loading…

[bugfix?] update outdated unittest document documentation

Improvements or additions to documentation

#18919 opened Feb 17, 2026 by SoluMilken

Loading…

5 tasks done

[Qwen3-Next] Enable fused_qkvzba_split_reshape_cat also for prefill run-ci

#18917 opened Feb 17, 2026 by YAMY1234

Loading…

[TorchAO] Enable TorchAO LinearMethod and TorchAOConfig

#18916 opened Feb 17, 2026 by ZhiweiYan-96

Loading…

5 tasks

Refactor sampler: Use a better hash function for deterministic sampling and clear dispatch for probs/logprobs/logits sampling paths run-ci

#18915 opened Feb 17, 2026 by merrymercy

Loading…

Add get_weights_checksum API and refactor update_weights_from_tensor tests with SHA256 verification

#18913 opened Feb 17, 2026 by aeft

Loading…

3 of 5 tasks

[Test] add unit test for skipping already preempted request

#18912 opened Feb 17, 2026 by glenliu21

Loading…

3 tasks done

[AMD] Add GLM-5 nightly test amd run-ci

#18911 opened Feb 17, 2026 by michaelzhang-ai • Draft

5 tasks

Revert #17613 Qwen3-Next PCG refactor (KL divergence regression test)

#18910 opened Feb 16, 2026 by alisonshao

Loading…

1 task

feat: add cuda core dump CI warpper run-ci

#18909 opened Feb 16, 2026 by hnyls2002

Loading…

Fix FlashInfer autotune deadlock with --enable-symm-mem

#18908 opened Feb 16, 2026 by alisonshao

Loading…

1 task

[diffusion] benchmark: Add SLO metric for SGL-Diffusion diffusion

SGLang Diffusion

#18907 opened Feb 16, 2026 by yyy1000 • Draft

5 tasks

[Temporarily unblock spec v2 qwen3.5]

#18906 opened Feb 16, 2026 by vincentzed • Draft

5 tasks

[jit kernel] Support per_token_group_quant_8bit jit kernel quant

LLM Quantization

run-ci

#18905 opened Feb 16, 2026 by yuan-luo

Loading…

5 tasks

Pass kv scales to paged attention in flashinfer backend

#18904 opened Feb 16, 2026 by lukealonso

Loading…

[sgl-kernel] rebase FlashMLA 0217 run-ci sgl-kernel

#18902 opened Feb 16, 2026 by FlamingoPg

Loading…

[HiCache] feat: L3 prefetch prometheus metrics documentation

Improvements or additions to documentation

#18898 opened Feb 16, 2026 by vladnosiv

Loading…

Previous 1 2 3 4 5 … 63 64 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!