Skip to content

vector-caliper: first real-data baseline (Qwen LoRA run) + dogfood notes#8

Closed
mcp-tool-shop wants to merge 1 commit into
mainfrom
baselines/tallow-fen-qwen-lora-v1
Closed

vector-caliper: first real-data baseline (Qwen LoRA run) + dogfood notes#8
mcp-tool-shop wants to merge 1 commit into
mainfrom
baselines/tallow-fen-qwen-lora-v1

Conversation

@mcp-tool-shop

Copy link
Copy Markdown
Member

First production training run fed through VectorCaliper end-to-end, per the "establish baselines, then predict/hypothesize" direction.

What this adds

  • baselines/qwen-lora-tallow-fen-v1.json — 8 checkpoint states from a real Qwen-Image style-LoRA run (rank 16, RTX 5090, 2000 steps): CLIP embedding-cloud geometry + performance per checkpoint, uncertainty via documented proxies.
  • baselines/qwen-lora-tallow-fen-v1.svg — rendered through the actual ProjectionEngine → SemanticMapper → SceneBuilder → SVGRenderer pipeline (apparently its first real-data artifact; the canonical demo bypasses it).
  • baselines/README.md + DOGFOOD-NOTES-2026-06-09.md.

The headline

Between steps 1750→2000, performance.accuracy ROSE (CLIP-sim 0.7796→0.7937) while geometry collapsed (anisotropy 8.2→12.5, effective dimension 7.0→6.76). Performance-only selection picks the overfit checkpoint; geometry+performance picks step 1250 — which independently matched the CMMD minimum AND the human eyeball call (monochrome drift on neutral subjects). The combined view caught what the single metric missed. That is the tool thesis, validated on its first real run.

Dogfood findings (full notes in the file)

  1. PCA power-iteration seeds with raw Math.random() — the seeded Mulberry32 in the same file is unused → determinism guarantee is not byte-level.
  2. tsc emits extensionless/directory relative imports → raw Node ESM cannot load dist (ERR_UNSUPPORTED_DIR_IMPORT); 32 files patched locally to render. Fix: moduleResolution: NodeNext + .js extensions in source.
  3. README scope @mcp-tool-shop/ vs package @mcptoolshop/ (+ private: true); repository.url points at a different repo.
  4. Required uncertainty.* fields have no native analogue for diffusion-LoRA runs — suggest an optional group or a domain preset with blessed proxies.
  5. The canonical demo hand-rolls its SVG and its JSON is not the ModelState schema.

🤖 Generated with Claude Code

…dogfood notes

8-checkpoint trajectory from a production Qwen-Image style-LoRA run (RTX 5090),
captured as ModelState series + rendered through the real pipeline. Headline:
geometry (anisotropy 8.2->12.5, effdim collapse) caught checkpoint-2000 overfit
that performance (CLIP-sim) alone rewarded — first live validation of the
combined-view thesis. Dogfood notes: unseeded PCA breaks the determinism
guarantee, dist is not Node-ESM-consumable (extensionless imports), README/package
scope mismatch, required uncertainty fields need a diffusion-LoRA preset, and the
canonical demo bypasses the public pipeline.

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
@mcp-tool-shop

Copy link
Copy Markdown
Member Author

Withdrawn — prototypes is a seed vault; vector-caliper is being resurrected as a local working repo instead (baselines + dogfood notes land there).

@mcp-tool-shop mcp-tool-shop deleted the baselines/tallow-fen-qwen-lora-v1 branch June 10, 2026 02:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant