Fix failing test suite: restore sorry placeholders in example problems by Cuuper22 · Pull Request #28 · Cuuper22/Erdos

Cuuper22 · 2026-06-10T00:02:01Z

What

Restores the unsolved (sorry-placeholder) versions of the four example problem files in examples/. The test suite on main currently fails with 14 failed / 300 passed; with this change it's 314 passed, 9 skipped.

Why

PR #27 added tests/test_operational.py, which asserts that example problem files contain sorry placeholders (test_original_content_is_preloaded, test_intermediate_has_sorry_placeholders). But the same PR committed the example files with completed proofs — solver output evidently leaked into the working tree before commit. The tests it added have been failing ever since.

Solved example files also defeat the project's premise: problems must ship unsolved for the Prover/Critic loop (and the README's mock-mode demo) to have anything to prove.

Changes

examples/test_theorem.lean, examples/basic_algebra.lean, examples/number_theory.lean: restored verbatim to their pre-Fix critical bugs and add comprehensive test suite for Erdos proof system #27 versions (from commit 970c3a8).
examples/intermediate.lean: this file was introduced by Fix critical bugs and add comprehensive test suite for Erdos proof system #27 already solved, so there's no prior version — its six theorem statements are kept and each proof body replaced with sorry (6 ≥ the 5 the test requires).

Verification

python -m pytest tests/ -q → 314 passed, 9 skipped (the 9 skips are environment-gated and unchanged from before).
Confirmed the solver writes solutions to a separate work dir (solution_<id>.json + ZIP), never back into examples/, so the current code won't re-introduce this.

Notes for the maintainer

The other 12 failures I saw locally (TestGeminiProvider, provider factory) were a broken cffi wheel in my container, not a code issue — google-generativeai imports fine after reinstalling cffi, and CI installs it cleanly via pyproject deps.
GitHub Actions has zero workflow runs for this repo, despite .github/workflows/ci.yml being configured for pushes/PRs to main and the README sporting a CI badge. That's why this breakage went unnoticed. Worth checking Settings → Actions to confirm Actions is enabled — once it is, this PR should get a CI run.

https://claude.ai/code/session_01KAKnLjpFZ9oF74GNW9QNtN

Generated by Claude Code

PR #27 added tests/test_operational.py, which requires the example problem files to contain unsolved 'sorry' placeholders, but the same PR committed those files with completed proofs (solver output leaked into the working tree). The suite has failed on main ever since: - test_original_content_is_preloaded: expects 'sorry' in every manifest problem - test_intermediate_has_sorry_placeholders: expects >=5 sorries in examples/intermediate.lean Restore test_theorem.lean, basic_algebra.lean, and number_theory.lean to their pre-#27 unsolved versions, and replace the proof bodies in intermediate.lean with sorry while keeping its theorem statements. Solved files also defeat the project's purpose: problems must ship unsolved for the Prover/Critic loop to have anything to prove. Result: 314 passed, 9 skipped (was 14 failed, 300 passed, 9 skipped). https://claude.ai/code/session_01KAKnLjpFZ9oF74GNW9QNtN

Cuuper22 merged commit d94af0f into main Jun 10, 2026
0 of 5 checks passed

Cuuper22 mentioned this pull request Jun 10, 2026

Final integration: exact counts, OpenRouter in spec, black sweep #35

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix failing test suite: restore sorry placeholders in example problems#28

Fix failing test suite: restore sorry placeholders in example problems#28
Cuuper22 merged 1 commit into
mainfrom
claude/tender-lamport-s4wejv

Cuuper22 commented Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Cuuper22 commented Jun 10, 2026

What

Why

Changes

Verification

Notes for the maintainer

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants