Skip to content

Add HQIV Arena (CLI, CI, leaderboard)#1

Merged
disregardfiat merged 5 commits into
mainfrom
feat/hqiv-arena
Jun 2, 2026
Merged

Add HQIV Arena (CLI, CI, leaderboard)#1
disregardfiat merged 5 commits into
mainfrom
feat/hqiv-arena

Conversation

@disregardfiat
Copy link
Copy Markdown
Collaborator

Summary

  • Adds the full HQIV Arena platform: hqiv-arena CLI, five-stage hqiv-arena.yml workflow, modular scoring/badges, and arena/leaderboard.json for the public site at https://disregardfiat.tech/#arena.
  • Ships Lean-aligned support modules (lightcone, metric, lean_witnesses, witnesses.json, packaged so8_generators.json) plus scripts/validate_hqiv_alignment.py.
  • Documents workflow in CONTRIBUTING.md and README.

Test plan

  • pytest tests/test_hqiv_arena.py tests/test_paper_numbers.py passes locally
  • python scripts/validate_hqiv_alignment.py exits 0
  • python scripts/arena/compute_score.py --out /tmp/arena.json --print-badges
  • HQIV Arena CI green on this PR (alignment + arena pytest gate)
  • After merge, disregardfiat.tech/#arena loads live arena/leaderboard.json

Made with Cursor

disregardfiat and others added 5 commits June 2, 2026 20:28
Ships hqiv-arena, Lean-aligned lightcone/metric/witness modules, alignment gate,
and arena/leaderboard.json for disregardfiat.tech/#arena. Arena pytest gate uses
test_hqiv_arena + test_paper_numbers while the full suite runs informational.

Co-authored-by: Cursor <cursoragent@cursor.com>
Unblocks alignment/scoring on pull requests while main merges still require lake build.

Co-authored-by: Cursor <cursoragent@cursor.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Jun 2, 2026

HQIV Arena — Score Report (branch)

Overall Score: 1000.0
Weighted σ (lower better): 0.0
Protected regressions: 0 (must be 0 to be eligible for merge)
Improved metrics vs baseline: 0

Full results artifact: arena-score-26854920195arena_results.json

Sigma everywhere: broad error reduction across observables is rewarded. Single-metric gaming is discouraged.
See CONTRIBUTING.md for details.

@disregardfiat disregardfiat merged commit 1c4a07b into main Jun 2, 2026
9 of 14 checks passed
@disregardfiat disregardfiat deleted the feat/hqiv-arena branch June 2, 2026 23:54
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Jun 3, 2026

HQIV Arena — Score Report (branch)

Overall Score: 1000.0
Weighted σ (lower better): 0.0
Protected regressions: 0 (must be 0 to be eligible for merge)
Improved metrics vs baseline: 0

Full results artifact: arena-score-26854569883arena_results.json

Sigma everywhere: broad error reduction across observables is rewarded. Single-metric gaming is discouraged.
See CONTRIBUTING.md for details.

disregardfiat added a commit that referenced this pull request Jun 3, 2026
- Integrated latest remote Arena CI changes (PR #1 etc.: dropped slow pytest from jobs, timeout plugin removal).
- Kept/resolved our feature work: clean src rebuild (thermo/orbital with A/Z, error-bar paper tests for flyby/SPARC/thermo/allotrope/etc., updated README, new arena metrics, etc.).
- Resolved conflicts preferring our updated README, metrics (thermo/orbital), witnesses, pyproject (with local_conditions), while taking remote's CI configs for .github, CONTRIBUTING, arena templates, validate script.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant