Skip to content

chore(deps): update vllm requirement from >=0.19.1 to >=0.20.2#801

Merged
github-actions[bot] merged 1 commit into
mainfrom
dependabot/pip/vllm-gte-0.20.2
May 10, 2026
Merged

chore(deps): update vllm requirement from >=0.19.1 to >=0.20.2#801
github-actions[bot] merged 1 commit into
mainfrom
dependabot/pip/vllm-gte-0.20.2

Conversation

@dependabot
Copy link
Copy Markdown
Contributor

@dependabot dependabot Bot commented on behalf of github May 10, 2026

Updates the requirements on vllm to permit the latest version.

Release notes

Sourced from vllm's releases.

v0.20.2

vLLM v0.20.2

Highlights

This release features 6 commits from 6 contributors (0 new)!

This is a small patch release with bug fixes for DeepSeek V4, gpt-oss, and Qwen3-VL

Bug Fixes

  • DeepSeek V4 sparse attention: Re-enable the persistent topk path on Hopper and ensure the memset kernel runs at CUDA graph capture time regardless of max_seq_len, fixing the MTP=1 hang on DeepSeek V4 (#41665, revert of #41605).
  • DeepSeek V4 KV cache: Fixed a "failure to allocate KV blocks" error in the V1 engine KV cache manager (#41282).
  • gpt-oss MXFP4 + torch.compile: Plumbed hidden_dim_unpadded through the moe_forward fake op so MXFP4 works under torch.compile on v0.20.x (#42002, backport of #41646).
  • Qwen3-VL: Removed an invalid deepstack boundary check that could fail under heavy load (#40932).

Contributors

@​ywang96, @​zyongye, @​stecasta, @​wzhao18, @​Isotr0py, @​khluu

Commits
  • bc150f5 [CI] Automate Docker Hub release image publishing (#40415)
  • 9bc5a0d [Bugfix] Remove invalid deepstack boundary check for Qwen3-VL (#40932)
  • fa8acca [Bugfix] Fix failure to allocate KV blocks error (#41282)
  • 637495c [Bugfix] Plumb hidden_dim_unpadded through moe_forward fake to fix gpt-oss MX...
  • fbd51e3 [Bugfix] Fix condition to clear persistent topk so that it can be captured re...
  • 75b3867 Revert "Temporary disable persistent topk for Hopper (#41605)"
  • 132765e Revert "[DSv4] Use cvt PTX for FP32->FP4 conversion (#41015)"
  • 43a21e6 Temporary disable persistent topk for Hopper (#41605)
  • f98b274 [DSv4] Tune default value of VLLM_MULTI_STREAM_GEMM_TOKEN_THRESHOLD (#41526)
  • 228d225 [DSV4] Guard megamoe flag with Pure TP (#41522)
  • Additional commits viewable in compare view

@dependabot dependabot Bot added dependencies Pull requests that update a dependency file python Pull requests that update python code labels May 10, 2026
@dependabot dependabot Bot requested a review from umyunsang as a code owner May 10, 2026 17:35
@dependabot dependabot Bot added dependencies Pull requests that update a dependency file python Pull requests that update python code labels May 10, 2026
Updates the requirements on [vllm](https://github.com/vllm-project/vllm) to permit the latest version.
- [Release notes](https://github.com/vllm-project/vllm/releases)
- [Changelog](https://github.com/vllm-project/vllm/blob/main/RELEASE.md)
- [Commits](vllm-project/vllm@v0.19.1...v0.20.2)

---
updated-dependencies:
- dependency-name: vllm
  dependency-version: 0.20.2
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot Bot force-pushed the dependabot/pip/vllm-gte-0.20.2 branch from 732c855 to f244cdb Compare May 10, 2026 17:36
@github-actions github-actions Bot added the infra 인프라/배포 관련 label May 10, 2026
Copy link
Copy Markdown
Contributor

@github-actions github-actions Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Dependabot auto-approval (CODEOWNER)

@github-actions github-actions Bot merged commit d64f822 into main May 10, 2026
4 checks passed
@dependabot dependabot Bot deleted the dependabot/pip/vllm-gte-0.20.2 branch May 10, 2026 17:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file infra 인프라/배포 관련 python Pull requests that update python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants