Commit ec2d1b0
release: v0.4.2 — performance optimization documentation + debug tooling
v0.4 series summary (29% → 75.2% Combined QA):
Retrieval precision:
- BM25 min-max score normalization (+36.7%p vs 1.0 cap)
- Document coherence boost (same-doc chunks get +5%/extra)
- Reranker score blending (0.7 reranker + 0.3 fusion signal)
- Full ingest mode with HyPE enabled (+9.5%p)
Generation quality:
- Citation mapping handles [Source N] format + range validation
- Sentence-boundary-aware context truncation (Korean + English)
- Finance metric cross-verification in fact_verifier
Engine optimization:
- Adaptive post-correction time budget (80s total target)
- Query deadline gate (70s) skips expensive late-stage steps
- Auto-skip correction for simple confident queries
- Sub-query cap (3→2) to reduce parallel retrieval cost
Playground:
- Pipeline trace visualization with Retrieve/Generate/Other breakdown
- Source excerpts visible by default with document titles
- Code block rendering fix (placeholder-based extraction)
- Query options panel (top_k, rerank, trace, stream toggles)
- `quantumrag demo` command for instant one-line experience
Infrastructure:
- `serve` auto-detects quantumrag.yaml in current directory
- `from_yaml()` loads .env for API keys
- Server startup prints provider/model/embedding info
- FAISS upsert stale reference bug fix (850 tests, 0 failures)
Optimization lessons documented in CLAUDE.md:
- Fusion weight tuning exhausted (4 attempts, all negative)
- Current 40/35/25 weights are near-optimal
- Next breakthrough requires embedding model change or noise reduction
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 9d42928 commit ec2d1b0
4 files changed
Lines changed: 13 additions & 5 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
44 | 44 | | |
45 | 45 | | |
46 | 46 | | |
47 | | - | |
| 47 | + | |
48 | 48 | | |
49 | | - | |
50 | 49 | | |
51 | 50 | | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
52 | 56 | | |
53 | 57 | | |
54 | 58 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
33 | 33 | | |
34 | 34 | | |
35 | 35 | | |
36 | | - | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
37 | 41 | | |
38 | 42 | | |
39 | 43 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
| 7 | + | |
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
0 commit comments