-
Notifications
You must be signed in to change notification settings - Fork 20
Pull requests: Exgentic/exgentic
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add PinchBench benchmark adapter for coding agent evaluation
#240
opened Jun 21, 2026 by
zeroasterisk
Contributor
Loading…
feat: add GAIA benchmark for multi-step reasoning
#239
opened Jun 21, 2026 by
zeroasterisk
Contributor
Loading…
4 of 5 tasks
feat: cloud-native runner + pod-native SWE-bench sandbox (run evals in Kubernetes)
#238
opened Jun 16, 2026 by
almogtavor
Contributor
Loading…
feat: A2A agent adapter for consuming external A2A agents
#232
opened Jun 8, 2026 by
zeroasterisk
Contributor
Loading…
fix: RITS/hosted_vllm pricing tolerance, AppWorld init lock, env-conf
#231
opened Jun 2, 2026 by
korenLazar
Collaborator
Loading…
litellm/trace_logger: normalize tool definitions + input messages to OTel spec
#230
opened Jun 1, 2026 by
elronbandel
Contributor
Loading…
3 tasks
Allow custom model pricing without modifying source code
#227
opened May 18, 2026 by
elronbandel
Contributor
Loading…
fix(tau2): zero-fill missing/empty sessions in aggregate score
#224
opened May 4, 2026 by
elronbandel
Contributor
Loading…
2 of 3 tasks
feat(extract): add --scope session and --slim to batch extract
#222
opened May 4, 2026 by
elronbandel
Contributor
Loading…
fix(health): treat proxy-wrapped permanent errors as PERMANENT (auth + model-availability)
#221
opened May 4, 2026 by
elronbandel
Contributor
Loading…
fix(session): write results.json when scorer crashes
#216
opened May 3, 2026 by
elronbandel
Contributor
Loading…
feat: add Every Eval Ever format to batch publish
#39
opened Mar 19, 2026 by
elronbandel
Contributor
Loading…
3 of 4 tasks
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.