FreeCite: A Judge-Free Benchmark for Granular Citation Evaluation in Large Language Models
-
Updated
Feb 22, 2026 - Python
FreeCite: A Judge-Free Benchmark for Granular Citation Evaluation in Large Language Models
HCAC is a completion contract for agents and tools. It defines when something is truly “done” — not just structurally, but behaviorally.
Add a description, image, and links to the deterministic-evaluation topic page so that developers can more easily learn about it.
To associate your repository with the deterministic-evaluation topic, visit your repo's landing page and select "manage topics."