You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
score.sh: add 10-year position dimension + CI guard
New 6th scoring dimension explicitly measures the structural moats that
define our 10-year position. Previously the harness measured what we
built (structure/correctness/quality/perf/integration) but had no metric
for what makes us defensible vs llama.cpp.
Six new sub-dimensions (weights total 9):
- single_header_loc quant.h <= 16,000 lines (currently 15,785)
- single_header_size quant.h <= 700 KB (currently 646 KB)
- core_zero_deps src/core/*.c only includes libc / SIMD intrinsics
/ OS threading / project headers
- papers_implemented polar / qjl / turbo / uniform / turbo_kv = 5
- honest_corrections >=4 self-corrections logged in CHANGELOG
- pypi_distribution pyproject.toml + publish.yml present
Whitelisted in core_zero_deps: arm_neon.h, immintrin.h, wasm_simd128.h,
pthread.h, windows.h, sched.h, sys/* etc - these are all libc/OS, not
third-party deps.
Run history skip rule: --bench, --quality, --position no longer pollute
.score_history. Only --quick and --full write to the trend.
CI guard: .github/workflows/ci.yml runs `score.sh --position` on Linux
and fails the build if the position dimension drops below 75%. This
means any future change that bloats quant.h past 16K LOC, adds a
third-party dep to src/core, removes a paper implementation, etc.,
gets caught at PR time, not after PyPI ships.
Baseline (v0.8.1, Apple Silicon, --quick mode):
total 96.1% (was 94.5% under 5-dim scoring)
position 100.0%
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
0 commit comments