[REVIEW] agentic-top-10: add persistent memory integrity evidence gates

## Skill Being Reviewed
**Skill name:** `agentic-top-10`
**Skill path:** `skills/ai-security/agentic-top-10/`

## False Positive Analysis

**Benign-looking memory design that can be over-credited:**
```yaml
memory:
  store: pgvector
  write_policy: "agent may save useful facts"
  retrieval: "top_k: 8"
  delete_endpoint: "/memory/{id}"
```

**Why this is a false positive:**

The design acknowledges persistent memory but does not prove that memory writes are approved, provenance-tagged, integrity-protected, retrievable by trust tier, quarantinable, or removable after poisoning. A review can mark AG04 Memory Poisoning as mitigated because a vector store exists or memory is "reviewed periodically" while poisoned content can still be written, retrieved into future prompts, and retained indefinitely.

## Coverage Gaps

**Missed variant 1: untrusted write path becomes long-term context**

User-submitted documents or tool outputs are summarized and saved to persistent memory without trust labels, source attribution, or human approval.

**Missed variant 2: retrieval ignores trust boundaries**

System-provided memory, user-submitted context, and agent-generated notes are mixed in the same vector index and ranked only by similarity.

**Missed variant 3: poisoned memory cannot be contained**

The system has no quarantine, tombstone, re-embedding, audit replay, or downstream cache invalidation process after a poisoned memory entry is discovered.

## Edge Cases

- Ephemeral session memory can be lower risk than cross-session memory but still needs prompt-injection handling before reuse within the same task.
- User-authored personal memory may be acceptable when it is isolated per user and never promoted to system/developer trust.
- Vector deletes may not remove derived embeddings, summaries, caches, or replicated indexes.

## Remediation Quality

- [x] Fix resolves the vulnerability
- [x] Fix doesn't introduce new security issues
- [x] Fix doesn't break functionality
- **Issues found:** Add AG04 evidence gates for memory write authorization, provenance, trust-tiered retrieval, poisoning detection, quarantine/removal, cache invalidation, and audit retention.

## Comparison to Other Tools

| Tool | Catches this? | Notes |
|------|:---:|-------|
| Vector DB access controls | Partial | Can restrict direct writes, but cannot prove agent-level memory promotion policy. |
| RAG evaluation suites | Partial | May detect bad retrieval quality but often miss trust-tier and containment evidence. |
| Prompt-injection tests | Partial | Exercise injection payloads but usually do not validate persistent memory cleanup and replay. |

## Overall Assessment

**Strengths:** Strong AG04 explanation and high-level mitigations around provenance, append-only integrity, trust separation, and memory decay.

**Needs improvement:** Add operational evidence requirements so reviewers can distinguish a documented memory concept from an enforceable memory integrity lifecycle.

**Priority recommendations:**
1. Add an AG04 memory integrity evidence checklist for write, retrieval, and deletion paths.
2. Require trust labels, source attribution, approval state, TTL, and integrity metadata for saved memories.
3. Add output fields for memory stores, write sources, retrieval filters, quarantine/removal controls, and residual cache risk.

## Sources Checked

- OWASP Top 10 for LLM Applications 2025: https://owasp.org/www-project-top-10-for-large-language-model-applications/
- NIST AI RMF 1.0: https://www.nist.gov/itl/ai-risk-management-framework
- MITRE ATLAS Data Poisoning: https://atlas.mitre.org/techniques/AML.T0020/

## Bounty Info
- [x] I have read and agree to the [CONTRIBUTING.md](../../CONTRIBUTING.md) bounty terms
- **Preferred payment method:** GitHub Sponsors


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] agentic-top-10: add persistent memory integrity evidence gates #1374

Skill Being Reviewed

False Positive Analysis

Coverage Gaps

Edge Cases

Remediation Quality

Comparison to Other Tools

Overall Assessment

Sources Checked

Bounty Info

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Tool	Catches this?	Notes
Vector DB access controls	Partial	Can restrict direct writes, but cannot prove agent-level memory promotion policy.
RAG evaluation suites	Partial	May detect bad retrieval quality but often miss trust-tier and containment evidence.
Prompt-injection tests	Partial	Exercise injection payloads but usually do not validate persistent memory cleanup and replay.

[REVIEW] agentic-top-10: add persistent memory integrity evidence gates #1374

Description

Skill Being Reviewed

False Positive Analysis

Coverage Gaps

Edge Cases

Remediation Quality

Comparison to Other Tools

Overall Assessment

Sources Checked

Bounty Info

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions