Are there reliable benchmarks showing Graphify improves coding agent performance on large repos? #1328
real-worlds
started this conversation in
General
Replies: 1 comment
-
|
I've created a reproducible benchmark framework to measure whether Graphify improves agent performance: https://github.com/FolatheDuckofDuckingburg/graphify/tree/v8/benchmarks This includes: 16 concrete benchmark tasks (bug fixes, features, refactoring, architecture Q&A) Ready to run the first benchmarks to answer your question! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Without task-success benchmarks, it is hard to distinguish Graphify from a useful visualization/context-compression tool versus something that actually improves coding agent capability.
Beta Was this translation helpful? Give feedback.
All reactions