Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[XPU] Enable Expert parallel for MoE models
#28263 opened Nov 7, 2025 by jikunshang Loading…
5 tasks
[CPU]Avoid repeated random sample compile v1
#28260 opened Nov 7, 2025 by xiangze-arm Loading…
[Misc][Model][Refactor] Pass the prefix into Linear layers deepseek Related to DeepSeek models qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#28259 opened Nov 7, 2025 by MengqingCao Loading…
5 tasks
Fix issues from #28242 qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#28257 opened Nov 7, 2025 by hmellor Loading…
[Log] update shm wait time msg ready ONLY add when PR is ready to merge/full CI is needed
#28255 opened Nov 6, 2025 by BoyuanFeng Loading…
[Core] Rework handling of async scheduling config ready ONLY add when PR is ready to merge/full CI is needed v1
#28250 opened Nov 6, 2025 by njhill Loading…
parse reasoning item input in responses api frontend gpt-oss Related to GPT-OSS models
#28248 opened Nov 6, 2025 by qandrew Draft
[Perf] Use np.ndarray instead of list[list[int]] to reduce GC overhead ready ONLY add when PR is ready to merge/full CI is needed v1
#28245 opened Nov 6, 2025 by Jialin Loading…
3 of 5 tasks
Add truncate arg to yarn to match openai implementation of gpt-oss gpt-oss Related to GPT-OSS models
#28244 opened Nov 6, 2025 by ashors1 Loading…
5 tasks
[BugFix][27485] Fix ITL algorithm for chunked OpenAI chat completions performance Performance-related issues
#28240 opened Nov 6, 2025 by manamalani10 Loading…
[NVIDIA] [feat] Integrate flashinfer Trtllmgen bf16 moe ci/build documentation Improvements or additions to documentation qwen Related to Qwen models
#28238 opened Nov 6, 2025 by jiahanc Draft
5 tasks
[Feature] Default ignore_eos True for random dataset performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#28227 opened Nov 6, 2025 by yewentao256 Loading…
[BugFix] [FEAT] Enable fastsafetensors for ROCm platform ci/build rocm Related to AMD ROCm
#28225 opened Nov 6, 2025 by tjtanaa Loading…
5 tasks
Update xgrammar version from 0.1.25 to 0.1.27 ci/build
#28221 opened Nov 6, 2025 by cjackal Loading…
1 task done
Adds Dockerfile arg for VLLM_PRECOMPILED_WHEEL_LOCATION ci/build
#28217 opened Nov 6, 2025 by dougbtv Loading…
3 tasks done
Enhance Helm chart installation instructions documentation Improvements or additions to documentation
#28211 opened Nov 6, 2025 by ccnmxns Loading…
ProTip! Exclude everything labeled bug with -label:bug.