inspect-ai
Here are 15 public repositories matching this topic...
Consolidated model evaluation framework for LLM benchmarking with Ollama
-
Updated
Apr 2, 2026 - Python
Factor(UT): Controlling Untrusted AI Decomposers — AAAI 2026 workshop paper on monitoring untrusted decomposition in code generation workflows.
-
Updated
May 1, 2026 - Jupyter Notebook
Benchmark for measuring instrumental-convergence behaviour in tool-using LLM agents
-
Updated
May 9, 2026 - Python
LLM agent that plays the Wikipedia game, built on AISI Inspect
-
Updated
May 17, 2026 - Python
Inspect AI task pack for financial agent safety and first public LangGraph adapter for Inspect AI.
-
Updated
May 14, 2026 - Python
Export Inspect Petri alignment audits to Braintrust experiments, with first-class Ollama support.
-
Updated
May 14, 2026 - Python
Automated prompt optimization for Inspect AI via structured failure analysis
-
Updated
Feb 4, 2026 - Python
Reusable audit scaffold for detecting prefill awareness confounds in transcript-based AI evals
-
Updated
Apr 23, 2026 - Python
EuroSafeAI's AI safety certificiation pipeline.
-
Updated
May 4, 2026 - Python
Run inspect_ai evals via Claude Code CLI — use your Claude subscription instead of per-token API billing
-
Updated
Apr 13, 2026 - Python
Personal research project — solo, unaffiliated. Inspect AI evaluation framework for LLM agent security: ASR, benign utility, and Transparency Rate across prompt injection, tool poisoning, and psych attacks.
-
Updated
May 6, 2026 - Python
PaperBench-style Inspect AI benchmark for computational reproducibility in computational biology.
-
Updated
May 10, 2026 - Python
BioProtocolBench: an Inspect AI evaluation environment for stochastic agent execution of benign molecular-microbiology protocols.
-
Updated
May 14, 2026 - Python
PRML pre-registration adapter for Inspect AI eval logs. MIT.
-
Updated
May 16, 2026 - Python
Improve this page
Add a description, image, and links to the inspect-ai topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the inspect-ai topic, visit your repo's landing page and select "manage topics."