Skip to content

feat(device-plugin): supports dynamic detection of hot plugged in GPUs#13

Merged
dkeven merged 1 commit intofeat/nvsharefrom
feat/hot_plug_in
Jan 22, 2026
Merged

feat(device-plugin): supports dynamic detection of hot plugged in GPUs#13
dkeven merged 1 commit intofeat/nvsharefrom
feat/hot_plug_in

Conversation

@dkeven
Copy link
Copy Markdown
Member

@dkeven dkeven commented Jan 22, 2026

What type of PR is this?

/kind feature

What this PR does / why we need it:
Resolving several issues preventing HAMi from dynamically detecting hot plugged in GPUs:
Updating the device map currently only built upon initialization in every loop of CheckHealth and WatchAndRegister
Make sure every nvml.Init call is cleared by nvml.Shutdown in every function call, instead of keep it in the whole process lifecycle
Delay garbage collection in order to wait for cold plugged-in eGPUs to initiate by thunderbolt and Nvidia driver

@dkeven dkeven merged commit 2c572ae into feat/nvshare Jan 22, 2026
1 check passed
@dkeven dkeven deleted the feat/hot_plug_in branch January 22, 2026 08:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant