Skip to content
View joesposito8's full-sized avatar

Block or report joesposito8

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
joesposito8/README.md

Joey Esposito

Trying to bring good engineering practices to AI Safety Evaluations :)

Current focus

  • Measuring Prefill Awareness in transcript-based evals. Building an Inspect-based eval to audit existing benchmarks for Prefill Awareness as a confounding factor: prefill-awareness-audit.
  • Eval tooling and methodology contributions to UK AISI's Inspect AI and Inspect Evals.

Selected upstream contributions

Inspect AI Ecosystem:

Contact

Pinned Loading

  1. prefill-awareness-audit prefill-awareness-audit Public

    Reusable audit scaffold for detecting prefill awareness confounds in transcript-based AI evals

    Python