Trying to bring good engineering practices to AI Safety Evaluations :)
- Measuring Prefill Awareness in transcript-based evals. Building an Inspect-based eval to audit existing benchmarks for Prefill Awareness as a confounding factor: prefill-awareness-audit.
- Eval tooling and methodology contributions to UK AISI's Inspect AI and Inspect Evals.
Inspect AI Ecosystem:
inspect_evals#1503— Fixmean_ofon_missing="skip"to also skip None-valued samples.inspect_evals#1501—cyberseceval_4: tolerate fenced and prose-wrapped judge JSON.inspect_ai#3709— Add vLLM chat template controls for base-model evals.inspect_evals#1429— Fix CodeIPI exfiltration scorer to check tool result messages.
- Email: joesposito8@gmail.com
- LinkedIn: joseph-esposito8



