-
Notifications
You must be signed in to change notification settings - Fork 525
Pull requests: UKGovernmentBEIS/inspect_ai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(scorer): return NOANSWER (not INCORRECT) when model_graded grade parser fails
#4048
opened May 26, 2026 by
vladmesh
Loading…
1 of 5 tasks
fix(scorer): set Score.answer on model_graded grade-parse failure
#4039
opened May 25, 2026 by
ernestprovo23
Loading…
Bound transcript memory for long-running samples
#4038
opened May 25, 2026 by
rasmusfaber
Contributor
Loading…
3 of 5 tasks
Complete samples from buffer history
#4037
opened May 25, 2026 by
rasmusfaber
Contributor
Loading…
3 of 5 tasks
Fix transcript subscriber delivery
#4036
opened May 25, 2026 by
rasmusfaber
Contributor
Loading…
2 of 5 tasks
Add Krippendorff's α metric for multi-judge agreement
#4035
opened May 25, 2026 by
joesposito8
Contributor
Loading…
2 of 5 tasks
Arena: Add pairwise comparison with win rate and Elo metrics
#4034
opened May 25, 2026 by
showpiecep
•
Draft
align CLI --display and --effort type annotations with their choices
#4032
opened May 25, 2026 by
RecreationalMath
Contributor
Loading…
1 of 5 tasks
Agent Bridge: Add
span_id_resolver callback for providing parent span ids for model generations
#4024
opened May 24, 2026 by
jjallaire
Collaborator
Loading…
Fix Bedrock provider to support adaptive thinking and output_config (closes #3765)
#4020
opened May 22, 2026 by
ernestprovo23
Loading…
2 tasks done
test: add parse_cli_args coverage for vLLM nested arg preservation (fixes #3348)
#4019
opened May 22, 2026 by
finaspirant
Loading…
Inspect View: edit log tags and metadata in the viewer
#4014
opened May 22, 2026 by
ransomr
Collaborator
Loading…
1 of 5 tasks
text_editor: fix corrupted-history failure mode and cap history size
#4010
opened May 22, 2026 by
tadamcz
Contributor
Loading…
feat: add host-mode backend for computer() tool
#4009
opened May 22, 2026 by
marov
Loading…
1 of 5 tasks
fix(eval): record default epochs reducer consistently
#4001
opened May 21, 2026 by
herbert-apollo
Contributor
•
Draft
1 of 5 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-04-26.