Personal research project — solo, unaffiliated. Inspect AI evaluation framework for LLM agent security: ASR, benign utility, and Transparency Rate across prompt injection, tool poisoning, and psych attacks.
mcp red-teaming prompt-injection llm-evaluation llm-agents agent-security inspect-ai agentdojo transparency-rate
-
Updated
May 6, 2026 - Python