Skip to content

Conversation

@biluriuday
Copy link
Contributor

Motivation

  1. added more customization options for auto node remediation functionality
  2. updated documentation for auto node remediation section

Technical Details

Test Plan

Test Result

Submission Checklist

(cherry picked from commit 4c737d702356d13e978c04bd80a3375c49c241e6)
* GPUOP-525 update auto node remediation documentation

* address review comments

(cherry picked from commit 8e3f3e0865d4c741e252676b797e6887d7274447)
* customize auto node remediation options

* address review comments

* commit generated files

* support custom labels and taints in workflow

* handle custom drain policy

* update documentation

* fix e2e test

(cherry picked from commit 8dd51968b8f00fa7e9455bd58777bb5bc5f82649)
Copy link
Contributor

@spraveenio spraveenio left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lgtm

@sajmera-pensando sajmera-pensando merged commit 9cd3392 into ROCm:main Jan 30, 2026
1 of 3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants