Skip to content

fix: account for partially-allocated gpus#644

Closed
pintohutch wants to merge 1 commit into
NVIDIA:mainfrom
pintohutch:partial-alloc
Closed

fix: account for partially-allocated gpus#644
pintohutch wants to merge 1 commit into
NVIDIA:mainfrom
pintohutch:partial-alloc

Conversation

@pintohutch
Copy link
Copy Markdown
Contributor

Fixes the issue mentioned in
#549 (comment).

Given KUBERNETES_VIRTUAL_GPUS is enabled:
When the Kubernetes pod-mapping builds the set of new metrics, we previously ignored the case where some devices were allocated, while others were not. So we add a check to each device lookup, where if there are no associated pods, we copy over the "pod-less" metrics to the final map.

Fixes the issue mentioned in
NVIDIA#549 (comment).

Given KUBERNETES_VIRTUAL_GPUS is enabled:
When the Kubernetes pod-mapping builds the set of new metrics, we
previously ignored the case where some devices were allocated, while
others were not. So we add a check to each device lookup, where if there
are no associated pods, we copy over the "pod-less" metrics to the final
map.

Signed-off-by: Daniel Clark <danielclark@google.com>
@rvatkar
Copy link
Copy Markdown
Collaborator

rvatkar commented Apr 8, 2026

Can you rebase this on the latest in main?

@rvatkar
Copy link
Copy Markdown
Collaborator

rvatkar commented Apr 8, 2026

@pintohutch thanks for this contribution.
After reviewing, we've found that PR #594 (already merged to main) already addresses this use case. We're planning to include it in the next release.

We're closing this PR since the functionality is covered

@rvatkar rvatkar closed this Apr 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants