Skill Being Reviewed
Skill name: agentic-top-10
Skill path: skills/ai-security/agentic-top-10/
False Positive Analysis
Benign-looking memory design that can be over-credited:
memory:
store: pgvector
write_policy: "agent may save useful facts"
retrieval: "top_k: 8"
delete_endpoint: "/memory/{id}"
Why this is a false positive:
The design acknowledges persistent memory but does not prove that memory writes are approved, provenance-tagged, integrity-protected, retrievable by trust tier, quarantinable, or removable after poisoning. A review can mark AG04 Memory Poisoning as mitigated because a vector store exists or memory is "reviewed periodically" while poisoned content can still be written, retrieved into future prompts, and retained indefinitely.
Coverage Gaps
Missed variant 1: untrusted write path becomes long-term context
User-submitted documents or tool outputs are summarized and saved to persistent memory without trust labels, source attribution, or human approval.
Missed variant 2: retrieval ignores trust boundaries
System-provided memory, user-submitted context, and agent-generated notes are mixed in the same vector index and ranked only by similarity.
Missed variant 3: poisoned memory cannot be contained
The system has no quarantine, tombstone, re-embedding, audit replay, or downstream cache invalidation process after a poisoned memory entry is discovered.
Edge Cases
- Ephemeral session memory can be lower risk than cross-session memory but still needs prompt-injection handling before reuse within the same task.
- User-authored personal memory may be acceptable when it is isolated per user and never promoted to system/developer trust.
- Vector deletes may not remove derived embeddings, summaries, caches, or replicated indexes.
Remediation Quality
Comparison to Other Tools
| Tool |
Catches this? |
Notes |
| Vector DB access controls |
Partial |
Can restrict direct writes, but cannot prove agent-level memory promotion policy. |
| RAG evaluation suites |
Partial |
May detect bad retrieval quality but often miss trust-tier and containment evidence. |
| Prompt-injection tests |
Partial |
Exercise injection payloads but usually do not validate persistent memory cleanup and replay. |
Overall Assessment
Strengths: Strong AG04 explanation and high-level mitigations around provenance, append-only integrity, trust separation, and memory decay.
Needs improvement: Add operational evidence requirements so reviewers can distinguish a documented memory concept from an enforceable memory integrity lifecycle.
Priority recommendations:
- Add an AG04 memory integrity evidence checklist for write, retrieval, and deletion paths.
- Require trust labels, source attribution, approval state, TTL, and integrity metadata for saved memories.
- Add output fields for memory stores, write sources, retrieval filters, quarantine/removal controls, and residual cache risk.
Sources Checked
Bounty Info
Skill Being Reviewed
Skill name:
agentic-top-10Skill path:
skills/ai-security/agentic-top-10/False Positive Analysis
Benign-looking memory design that can be over-credited:
Why this is a false positive:
The design acknowledges persistent memory but does not prove that memory writes are approved, provenance-tagged, integrity-protected, retrievable by trust tier, quarantinable, or removable after poisoning. A review can mark AG04 Memory Poisoning as mitigated because a vector store exists or memory is "reviewed periodically" while poisoned content can still be written, retrieved into future prompts, and retained indefinitely.
Coverage Gaps
Missed variant 1: untrusted write path becomes long-term context
User-submitted documents or tool outputs are summarized and saved to persistent memory without trust labels, source attribution, or human approval.
Missed variant 2: retrieval ignores trust boundaries
System-provided memory, user-submitted context, and agent-generated notes are mixed in the same vector index and ranked only by similarity.
Missed variant 3: poisoned memory cannot be contained
The system has no quarantine, tombstone, re-embedding, audit replay, or downstream cache invalidation process after a poisoned memory entry is discovered.
Edge Cases
Remediation Quality
Comparison to Other Tools
Overall Assessment
Strengths: Strong AG04 explanation and high-level mitigations around provenance, append-only integrity, trust separation, and memory decay.
Needs improvement: Add operational evidence requirements so reviewers can distinguish a documented memory concept from an enforceable memory integrity lifecycle.
Priority recommendations:
Sources Checked
Bounty Info