llm-validation

Here are 3 public repositories matching this topic...

Oncoshot / llm-validation-framework

A comprehensive Python framework for evaluating LLM-extracted structured data against ground truth labels. Supports binary classification, scalar values, and list fields with detailed performance metrics, confidence-based evaluation, and statistical uncertainty quantification via non-parametric bootstrap confidence intervals.

ml-validation llm-validation

Updated Mar 2, 2026
Python

chigwell / llmtestr

Sponsor

Star

A new package that helps developers integration-test AI and LLM applications by validating structured outputs. It takes a user's test scenario or prompt as input, sends it to an LLM, and uses pattern

Updated Dec 21, 2025
Python

ajdedeaux / ai-eval-framework

Star

Systematic AI evaluation framework that transforms subjective assessment into objective measurement. Reduce research time by 85% while maintaining 95%+ accuracy through multi-LLM validation.

quality-control ai-framework evidence-based ai-evaluation prompt-engineering research-automation multi-llm llm-validation systematic-research analytics-aiml

Updated Aug 26, 2025

Improve this page

Add a description, image, and links to the llm-validation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-validation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly