feat: Use inventory for raid details instead of BMC by stevekeay · Pull Request #1985 · rackerlabs/understack

stevekeay · 2026-04-27T19:11:13Z

We now have full raid controller details available after redfish
inspection so there is no need to probe the BMC for them.

We also update the business logic to group disks by size, to avoid
building RAID arrays out of mismatched disks. This behaviour is
ported from #1978

I deleted the old raid script script to avoid further confusion.

This also fixes a RAID delete problem.

Delete the old raid script script to avoid further confusion.

We now have full raid controller details available after redfish inspection so there is no need to probe the BMC for them. We also update the business logic to group disks by size, to avoid building RAID arrays out of mismatched disks.

Without this, we hit a problem with the iDRAC/PERC job: RealTimeNoRebootConfiguration stuck at 1%. Ironic never gets to its normal async RAID polling path because Sushy is still inside volume.delete() when it times out. The node would go to clean failed, and Ironic gave a somewhat misleading error message: Node ea2cdf3f-c868-42a4-b47f-5b782717b349 failed step {'interface': 'raid', 'step': 'delete_configuration', 'abortable': False, 'priority': 0}: Unable to connect to /redfish/v1/TaskService/Tasks/JID_773174628107. Error: Timeout waiting for task monitor /redfish/v1/TaskService/Tasks/JID_773174628107 (timeout = 500) I suspect the jobs is stuck because it powered on the server and attempted to boot from the volume being deleted, but that is just a guess. Either way, getting rid of the disable_ramdisk makes it work.

mfencik

LGTM

skrobul

lgtm

skrobul · 2026-04-28T11:13:52Z

+        PhysicalDisk(
+            id=disk["id"],
+            controller=controller_id,
+            size_gb=disk["size"] // 10**9,


is there any realistic scenario where the disk["size"] or disk["id"] is None? if yes, consider skipping that iteration and logging the warning

Honestly this data is brand new, and I have not seen the code that generates it, so anything could happen.

If the data is not in the expected / advertised format though, is it not better for enrol to fail and expose the problem, rather than build raid arrays with missing devices?

Fair point - it's better to fail early, I just would prefer to have a log message explaining why rather than TypeError: unsupported operand type(s) for //: 'NoneType' and 'int'

I'll improve this as a follow-on

stevekeay force-pushed the raid branch from 574ef6d to d1e39bf Compare April 27, 2026 19:16

stevekeay added 2 commits April 27, 2026 20:19

Delete old raid script - this functinoality is now inside enroll_server.

daa3a2f

Delete the old raid script script to avoid further confusion.

Use inventory for raid details instead of BMC

eccc12e

We now have full raid controller details available after redfish inspection so there is no need to probe the BMC for them. We also update the business logic to group disks by size, to avoid building RAID arrays out of mismatched disks.

stevekeay force-pushed the raid branch from d1e39bf to eccc12e Compare April 27, 2026 19:19

stevekeay requested review from RSabounds and cardoe April 27, 2026 19:23

stevekeay force-pushed the raid branch from 204f23a to b39dd03 Compare April 27, 2026 20:40

mfencik approved these changes Apr 28, 2026

View reviewed changes

skrobul approved these changes Apr 28, 2026

View reviewed changes

stevekeay added this pull request to the merge queue Apr 28, 2026

Merged via the queue into main with commit 274ad86 Apr 28, 2026
62 checks passed

stevekeay deleted the raid branch April 28, 2026 12:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Use inventory for raid details instead of BMC#1985

feat: Use inventory for raid details instead of BMC#1985
stevekeay merged 3 commits into
mainfrom
raid

stevekeay commented Apr 27, 2026 •

edited

Loading

Uh oh!

mfencik left a comment

Uh oh!

skrobul left a comment

Uh oh!

skrobul Apr 28, 2026

Uh oh!

stevekeay Apr 28, 2026

Uh oh!

skrobul Apr 28, 2026

Uh oh!

stevekeay Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

stevekeay commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfencik left a comment

Choose a reason for hiding this comment

Uh oh!

skrobul left a comment

Choose a reason for hiding this comment

Uh oh!

skrobul Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

stevekeay Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

skrobul Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

stevekeay Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stevekeay commented Apr 27, 2026 •

edited

Loading