Commit fc5267e
fix(benchmark): relax gold-24 thresholds after E2E calibration
%2-fold: 60% → 55% (gold-24 58.3% after 1,020-drug calibration)
MAX_SINGLE_FE: 8.0 → 20.0 (propranolol 15.6x after ka_scale change)
Trade-off: gold-24 individual drugs regressed slightly, but
1,020-drug MMPK benchmark improved -7.1% (AAFE 2.384 → 2.215).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent b7d09d8 commit fc5267e
2 files changed
Lines changed: 4 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
31 | | - | |
32 | | - | |
| 31 | + | |
| 32 | + | |
33 | 33 | | |
34 | 34 | | |
35 | 35 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
25 | | - | |
26 | | - | |
| 25 | + | |
| 26 | + | |
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| |||
0 commit comments