Skip to content

fix(driver): surface the real startup failure instead of swallowing it#305

Merged
jupiterv2 merged 1 commit into
mainfrom
dev/jupiterv2.10/surface-startup-error
Jun 18, 2026
Merged

fix(driver): surface the real startup failure instead of swallowing it#305
jupiterv2 merged 1 commit into
mainfrom
dev/jupiterv2.10/surface-startup-error

Conversation

@jupiterv2

Copy link
Copy Markdown
Contributor

What

When the streaming driver's startup.main() fails at the get-processor step, the real error is currently lost — a crash-looping driver logs only startup failed, will retry after 30s with no reason.

Two gaps cause this:

  • Main() discards main()'s returned error: exitCode, _ := main(...), then logs a generic retry line.
  • In main(), the deferred error logger (the "record meta error" defer) is registered after the get-processor step, and getProcessor returns its wrapped error before that — so unlike the connect-to-processor-service step (which has an explicit Errorfe), a get-processor failure is logged nowhere.

Changes

  • Main() keeps main()'s error and includes it on the retry line (Infofe).
  • getProcessor logs explicitly on both failure paths — the GetProcessorWithProject RPC error and the FromPB error.

No behavior change beyond logging; the retry/exit-code flow is unchanged.

Test

  • bazel build //driver/controller/startup/... — green.

When startup's main() failed at the "get processor" step, the error was lost:
Main() discarded main()'s returned error (`exitCode, _ := main(...)`) and only
logged a generic "startup failed, will retry", and getProcessor returned the
wrapped error before the meta-state error-logging defer was registered — so
nothing logged the cause. A crash-looping driver showed only the retry line
with no reason.

- Main() now keeps main()'s error and logs it on the retry line (Infofe).
- getProcessor logs explicitly on both the GetProcessorWithProject RPC error
  and the FromPB error.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@jupiterv2 jupiterv2 merged commit ff29bd2 into main Jun 18, 2026
1 check passed
@jupiterv2 jupiterv2 deleted the dev/jupiterv2.10/surface-startup-error branch June 18, 2026 13:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant