Skip to content

[RFC] Pod lifecycle subscription through data layer #2017

@vMaroon

Description

@vMaroon

Motivation

For Active-Active Multi-Replica HA enablement in llm-d's precise prefix-cache indexing, a new sub-system is introduced to handle discovery and subscription to vLLM KVEvent publishers.

For the discovery part, informer-like functionality is required. For standalone deployments, I appended a controller example, but for proper integration with the llm-d inference-scheduler, the logic should be provided through the data-layer.

Proposal

After discussions with @elevran, we propose adding a Pod lifecycle source into the data-layer, for the precise-prefix-cache scorer to subscribe to.

cc @elevran @kfswain

Metadata

Metadata

Assignees

No one assigned

    Labels

    triage/acceptedIndicates an issue or PR is ready to be actively worked on.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions