Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Bugfix] Strip Gemma4 string delimiters from dict keys bug Something isn't working tool-calling
#44756 opened Jun 7, 2026 by he-yufeng Contributor Loading…
4 tasks done
[Bugfix] Shut down engine cores on startup handshake failure bug Something isn't working v1
#44751 opened Jun 6, 2026 by fiddleboy Loading…
[Bugfix] Propagate ImportError from load_audio_pyav when vllm[audio] … bug Something isn't working multi-modality Related to multi-modality (#4194)
#44750 opened Jun 6, 2026 by littlecircle0730 Loading…
3 of 4 tasks
[Misc] Remove orphaned env vars and stale env-var references documentation Improvements or additions to documentation
#44749 opened Jun 6, 2026 by DaoyuanLi2816 Contributor Loading…
[Cohere] Fix Cohere2MoE weight loading when using Transformers ≥5.10 ready ONLY add when PR is ready to merge/full CI is needed
#44747 opened Jun 6, 2026 by Terrencezzj Contributor Loading…
4 tasks
[Bugfix] Harden allowed_token_ids metadata for spec-decode bug Something isn't working v1
#44742 opened Jun 6, 2026 by jperezdealgaba Contributor Loading…
[Bugfix] Gemma4 streaming parser for multi-boundary tool deltas bug Something isn't working tool-calling
#44741 opened Jun 6, 2026 by yasu-oh Loading…
4 tasks done
[Bugfix][Model] GraniteMoE: load FP8_DYNAMIC expert weight_scale tensors bug Something isn't working ci/build
#44739 opened Jun 6, 2026 by javierdejesusda Contributor Loading…
[Opt] Optimize rotary embedding cache length
#44738 opened Jun 6, 2026 by labAxiaoming Contributor Loading…
4 tasks
[Bugfix] Canonicalize FP8 weight layout to (K, N) at the source bug Something isn't working quantization ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#44735 opened Jun 6, 2026 by mgoin Member Loading…
3 of 4 tasks
[Bugfix][Rust Frontend] Set a structured-output backend so requests do not 500 bug Something isn't working rust
#44729 opened Jun 6, 2026 by Sunt-ing Contributor Loading…
[Bugfix] Fix shape mismatch crash and add logprob_token_ids support in RejectionSampler bug Something isn't working v1
#44727 opened Jun 6, 2026 by skajre Loading…
4 tasks done
[Bugfix][Core] Close underlying iterator in merge_async_iterators single-iterator fast path bug Something isn't working
#44726 opened Jun 6, 2026 by Sunt-ing Contributor Loading…
[Bugfix][Frontend] Fix Anthropic count_tokens decorator order driving server load negative bug Something isn't working frontend
#44725 opened Jun 6, 2026 by Sunt-ing Contributor Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.
Morty Proxy This is a proxified and sanitized view of the page, visit original site.