Skip to content

[REVIEW] agentic-top-10: add persistent memory integrity evidence gates #1374

@catcherintheroad-hub

Description

@catcherintheroad-hub

Skill Being Reviewed

Skill name: agentic-top-10
Skill path: skills/ai-security/agentic-top-10/

False Positive Analysis

Benign-looking memory design that can be over-credited:

memory:
  store: pgvector
  write_policy: "agent may save useful facts"
  retrieval: "top_k: 8"
  delete_endpoint: "/memory/{id}"

Why this is a false positive:

The design acknowledges persistent memory but does not prove that memory writes are approved, provenance-tagged, integrity-protected, retrievable by trust tier, quarantinable, or removable after poisoning. A review can mark AG04 Memory Poisoning as mitigated because a vector store exists or memory is "reviewed periodically" while poisoned content can still be written, retrieved into future prompts, and retained indefinitely.

Coverage Gaps

Missed variant 1: untrusted write path becomes long-term context

User-submitted documents or tool outputs are summarized and saved to persistent memory without trust labels, source attribution, or human approval.

Missed variant 2: retrieval ignores trust boundaries

System-provided memory, user-submitted context, and agent-generated notes are mixed in the same vector index and ranked only by similarity.

Missed variant 3: poisoned memory cannot be contained

The system has no quarantine, tombstone, re-embedding, audit replay, or downstream cache invalidation process after a poisoned memory entry is discovered.

Edge Cases

  • Ephemeral session memory can be lower risk than cross-session memory but still needs prompt-injection handling before reuse within the same task.
  • User-authored personal memory may be acceptable when it is isolated per user and never promoted to system/developer trust.
  • Vector deletes may not remove derived embeddings, summaries, caches, or replicated indexes.

Remediation Quality

  • Fix resolves the vulnerability
  • Fix doesn't introduce new security issues
  • Fix doesn't break functionality
  • Issues found: Add AG04 evidence gates for memory write authorization, provenance, trust-tiered retrieval, poisoning detection, quarantine/removal, cache invalidation, and audit retention.

Comparison to Other Tools

Tool Catches this? Notes
Vector DB access controls Partial Can restrict direct writes, but cannot prove agent-level memory promotion policy.
RAG evaluation suites Partial May detect bad retrieval quality but often miss trust-tier and containment evidence.
Prompt-injection tests Partial Exercise injection payloads but usually do not validate persistent memory cleanup and replay.

Overall Assessment

Strengths: Strong AG04 explanation and high-level mitigations around provenance, append-only integrity, trust separation, and memory decay.

Needs improvement: Add operational evidence requirements so reviewers can distinguish a documented memory concept from an enforceable memory integrity lifecycle.

Priority recommendations:

  1. Add an AG04 memory integrity evidence checklist for write, retrieval, and deletion paths.
  2. Require trust labels, source attribution, approval state, TTL, and integrity metadata for saved memories.
  3. Add output fields for memory stores, write sources, retrieval filters, quarantine/removal controls, and residual cache risk.

Sources Checked

Bounty Info

  • I have read and agree to the CONTRIBUTING.md bounty terms
  • Preferred payment method: GitHub Sponsors

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions