Skip to content

CI: investigate conda-cpp-tests jobs hitting the 40m timeout #1131

@rgsl888prabhu

Description

@rgsl888prabhu

Context

The `conda-cpp-tests` job in the `pr` workflow has been hitting the outer 40-minute `timeout` wrapper around `./ci/run_ctests.sh` (in `ci/test_cpp.sh`) and failing CI as a result.

Example failing run (PR #1035):

As a short-term unblock, PR #1130 bumps the outer timeout from 40m to 50m. This issue tracks the underlying investigation — we should understand why the job is now running longer, not just raise the ceiling indefinitely.

Investigation items

  • Break down the ctest run-time per test binary (e.g. `ctest --output-on-failure -V --timeout N` or parse the `Test.xml`) to identify which suites have regressed.
  • Compare wall-clock of `conda-cpp-tests` on `main` over the last ~30 days — is this a recent regression or a gradual drift?
  • Check whether slow tests are concentrated in one area (MILP, LP, routing GES, grpc integration) so we can target the fix.
  • Confirm the regression is not specific to one runner pool or GPU (H100 vs. others in the matrix).
  • Decide: (a) reduce test runtime at the source, (b) split into parallel jobs, (c) gate slow tests behind a nightly job, or (d) accept a permanently higher ceiling.

Acceptance criteria

  • Root cause is documented on this issue.
  • Either the offending tests are sped up / skipped on PRs, or a CI topology change is proposed and merged, such that the PR `conda-cpp-tests` job consistently finishes well under the outer timeout.

Related

Metadata

Metadata

Assignees

No one assigned

    Labels

    awaiting responseThis expects a response from maintainer or contributor depending on who requested in last comment.

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions