Code and lightweight audit artifacts for auditable repair of scientific reasoning graph extraction on a 350-row benchmark.
knowledge-graph reproducibility graph-extraction llm-evaluation scientific-reasoning auditable-ai factscore reasoning-graphs
-
Updated
Jun 22, 2026 - Python