Skip to content

recover ActionSet stuck in running phase after controller restart#3971

Open
abdul-saqib wants to merge 2 commits intomasterfrom
bugfix/ActionSet-stuck-running
Open

recover ActionSet stuck in running phase after controller restart#3971
abdul-saqib wants to merge 2 commits intomasterfrom
bugfix/ActionSet-stuck-running

Conversation

@abdul-saqib
Copy link
Copy Markdown
Contributor

Change Overview

Problem

When the Kanister controller restarted (e.g. due to OOMKill) while an ActionSet was executing, the ActionSet would remain stuck in the running phase indefinitely.

Solution

Introduce handleRunningActionSet which is called on startup for any ActionSet found in running state. If all phases are already complete it transitions the ActionSet to complete otherwise it transitions to failed with a clear error message prompting the user to retry.

Pull request type

Please check the type of change your PR introduces:

  • 🚧 Work in Progress
  • 🌈 Refactoring (no functional changes, no api changes)
  • 🐹 Trivial/Minor
  • 🐛 Bugfix
  • 🌻 Feature
  • 🗺️ Documentation
  • 🤖 Test
  • 🏗️ Build

Issues

Test Plan

  • 💪 Manual
  • ⚡ Unit test
  • 💚 E2E

@abdul-saqib abdul-saqib self-assigned this Mar 18, 2026
Signed-off-by: Abdul Saqib <s.abdul@veeam.com>
Signed-off-by: Abdul Saqib <s.abdul@veeam.com>
@abdul-saqib abdul-saqib force-pushed the bugfix/ActionSet-stuck-running branch from 7eac17c to 96e306a Compare March 19, 2026 05:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[BUG] Kanister ActionSet stays in the running phase despite successful job completion

1 participant