fix(#1835): require file reads before asserting contents in findings#2443
fix(#1835): require file reads before asserting contents in findings#2443fullsend-ai-coder[bot] wants to merge 1 commit into
Conversation
The review agent hallucinated file contents on PR konflux-ci/konflux-test#833, claiming a Dockerfile contained --nogpgcheck when it never did. The root cause was that the code-review skill and correctness sub-agent had no explicit requirement to read files outside the PR diff before asserting what they contain. Changes: - code-review SKILL.md step 2: added cross-file verification bullet requiring the agent to read any file it references in a finding, even if not in the diff. - code-review SKILL.md step 4: added cross-file finding self-check requiring verification that referenced files were read before finalizing findings. - correctness sub-agent: added cross-file verification section with the same read-before-assert requirement. Note: make lint could not run due to sandbox network restrictions preventing shellcheck installation. Closes #1835
E2E tests did not runE2E tests run automatically for org/repo members and collaborators on pull requests. For other contributors, a maintainer must add the See E2E testing guide for details. |
Site previewPreview: https://01f829bf-site.fullsend-ai.workers.dev Commit: |
|
🤖 Finished Review · ✅ Success · Started 7:56 PM UTC · Completed 8:05 PM UTC |
Codecov Report✅ All modified and coverable lines are covered by tests. 📢 Thoughts on this report? Let us know! |
ReviewFindingsMedium
Low
Labels: PR modifies review agent skill and sub-agent definitions under the scaffold harness. |
|
/fs-fix |
The review agent hallucinated file contents on PR konflux-ci/konflux-test#833, claiming a Dockerfile contained
--nogpgcheck when it never did. The root cause was that the
code-review skill and correctness sub-agent had no explicit requirement to read files outside the PR diff before asserting what they contain.
Changes:
bullet requiring the agent to read any file it references
in a finding, even if not in the diff.
self-check requiring verification that referenced files
were read before finalizing findings.
section with the same read-before-assert requirement.
Note: make lint could not run due to sandbox network restrictions preventing shellcheck installation.
Closes #1835
Post-script verification
agent/1835-verify-file-contents-before-asserting)59159d05e2cb01f868054a4c7143303d6049214b..HEAD)