[vLLM] Multimodal caches are not reset after weight updates

## Bug: vLLM multimodal caches are not reset after weight updates

### Description

Currently, verl resets the vLLM prefix/KV cache after rollout weight updates, but it does not reset the multimodal cache or encoder cache.

For multimodal rollouts, vLLM may cache multimodal inputs and encoder outputs. These cached values can depend on the current model weights. After `update_weights`, reusing cached multimodal or encoder outputs from the previous weights may lead to stale features being used during rollout.

### Expected behavior

After updating rollout model weights, all relevant vLLM caches should be invalidated:

* prefix/KV cache
* multimodal cache
* encoder cache, when supported by the installed vLLM version

### Proposed solution

Reset all available vLLM rollout caches after weight updates.

I opened a PR with the proposed fix here:

https://github.com/verl-project/verl/pull/6522

The PR adds a `clear_all_caches` helper and calls it after `update_weights`, replacing the current prefix-only cache reset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[vLLM] Multimodal caches are not reset after weight updates #6523

Bug: vLLM multimodal caches are not reset after weight updates

Description

Expected behavior

Proposed solution

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[vLLM] Multimodal caches are not reset after weight updates #6523

Description

Bug: vLLM multimodal caches are not reset after weight updates

Description

Expected behavior

Proposed solution

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions