Fix Orchestrator task persistence and file lock handling #10957

roomote · 2026-01-25T18:46:36Z

Fixes the issue where Orchestrator mode tasks would stop prematurely and disappear from the task list after creating multiple subtasks in rapid succession.

Root Cause

Lock contention during rapid file writes when Orchestrator creates multiple subtasks. Each delegation involves concurrent writes to:

Parent task API conversation history
Child task API conversation history
Task history metadata

When lock acquisition failed or metadata persistence failed silently, parent tasks would appear to "disappear" from the UI.

Changes

Increased lock resilience in safeWriteJson:
- Lock staleness timeout: 31s → 60s (handles complex orchestrator scenarios)
- Lock retries: 5 → 10 attempts
- Backoff timeout range: 100-1000ms → 200-2000ms
- Enhanced error messages with contention context
Added retry logic for critical delegation metadata:
- 3 retry attempts with exponential backoff for parent task status persistence
- User-visible warnings when all retries fail (prevents silent data loss)
- Detailed logging for debugging lock contention issues

Testing

All existing safeWriteJson tests pass (16/16)
All delegation tests pass (13/13)
No lint errors or type check failures

View task on Roo Code Cloud

Important

Enhances task persistence and lock handling in Orchestrator mode by adding retry logic and increasing lock resilience in ClineProvider.ts and safeWriteJson.ts.

Behavior:
- Fixes premature stopping and disappearance of Orchestrator tasks after creating multiple subtasks.
- Adds retry logic for parent task metadata persistence in ClineProvider.ts.
- Increases lock resilience in safeWriteJson.ts with longer timeouts and more retries.
Lock Handling:
- safeWriteJson.ts: Lock staleness timeout increased from 31s to 60s, retries from 5 to 10, and backoff timeout range from 100-1000ms to 200-2000ms.
- Enhanced error messages for lock acquisition failures.
Retry Logic:
- ClineProvider.ts: 3 retry attempts with exponential backoff for parent task status persistence.
- Logs user-visible warnings and detailed debugging information when retries fail.
Testing:
- All existing safeWriteJson tests pass (16/16).
- All delegation tests pass (13/13).

^{This description was created by}^{for 8a5eccb. You can customize this summary. It will automatically update as commits are pushed.}

- Increase lock staleness timeout from 31s to 60s for complex orchestrator scenarios - Increase lock retries from 5 to 10 with higher backoff timeouts (200ms-2s) - Add retry logic (3 attempts) for critical parent task metadata persistence - Add user-visible warnings when delegation metadata persistence fails - Improve error messages with context about lock contention causes This addresses issues where Orchestrator mode tasks would disappear from the task list after creating multiple subtasks in rapid succession, caused by lock contention during concurrent file writes.

roomote · 2026-01-25T18:47:03Z

Rooviewer See task on Roo Cloud

Review complete. No issues found.

The changes appropriately address lock contention during rapid orchestrator delegation by:

Increasing lock resilience parameters in safeWriteJson (staleness timeout, retries, backoff ranges)
Adding retry logic with exponential backoff for critical parent task metadata persistence
Providing user-visible warnings when all retries fail instead of silent failures

_{Mention @roomote in a comment to request specific changes to this pull request or fix all unresolved issues.}

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Jan 25, 2026

github-project-automation bot moved this to Triage in Roo Code Roadmap Jan 25, 2026

github-project-automation bot moved this to New in Roo Code Roadmap Jan 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Orchestrator task persistence and file lock handling #10957

Fix Orchestrator task persistence and file lock handling #10957

roomote bot commented Jan 25, 2026 •

edited by ellipsis-dev bot

Loading

Uh oh!

roomote bot commented Jan 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix Orchestrator task persistence and file lock handling #10957

Are you sure you want to change the base?

Fix Orchestrator task persistence and file lock handling #10957

Conversation

roomote bot commented Jan 25, 2026 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Root Cause

Changes

Testing

Uh oh!

roomote bot commented Jan 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

roomote bot commented Jan 25, 2026 •

edited by ellipsis-dev bot

Loading

roomote bot commented Jan 25, 2026 •

edited

Loading