[REVIEW] model-supply-chain: add model promotion evidence gates

## Skill Being Reviewed
**Skill name:** `model-supply-chain`
**Skill path:** `skills/ai-security/model-supply-chain/`

## False Positive Analysis

**Benign-looking release flow that can be over-credited:**
```yaml
model:
  source: huggingface://org/model
  revision: main
evaluation:
  report: eval-results.json
approval:
  ticket: AI-1234
deploy:
  image: registry.example.com/inference:latest
  model_uri: s3://models/org/model/latest
```

**Why this is a false positive:**

The flow has a model source, evaluation report, approval ticket, and deployment target, but it does not prove that the evaluated artifact is the same artifact deployed to production. Mutable refs such as `main`, `latest`, and unpinned S3 paths can change between evaluation and release. A review can credit provenance, model card, and evaluation controls while missing model substitution during promotion.

## Coverage Gaps

**Missed variant 1: evaluated revision differs from deployed revision**

The evaluation report references a model repo name, while deployment pulls the current default branch or latest object path.

**Missed variant 2: approval is not bound to artifact identity**

The approval ticket names the model family but not the exact digest, commit SHA, artifact URI version, model card version, evaluation run ID, or signing attestation.

**Missed variant 3: rollback uses unverified artifacts**

Rollback points to a previous alias or bucket prefix without verifying its checksum, signature, evaluation status, and known vulnerability/backdoor test status.

## Edge Cases

- API-only hosted models may not expose weight digests, but still need provider version IDs, deployment IDs, or immutable snapshot references.
- LoRA/adapter releases must bind base model digest and adapter digest together.
- Canary and shadow deployments should record the exact artifact identity and evaluation gates used for each environment.

## Remediation Quality

- [x] Fix resolves the vulnerability
- [x] Fix doesn't introduce new security issues
- [x] Fix doesn't break functionality
- **Issues found:** Add model promotion evidence gates that bind model artifact identity, evaluation result, model card, approval, deploy manifest, and rollback target.

## Comparison to Other Tools

| Tool | Catches this? | Notes |
|------|:---:|-------|
| Model registry | Partial | Can track versions, but reviewers must verify promotion policies and deployment references. |
| MLflow / experiment tracker | Partial | Records runs and metrics, but may not enforce production artifact digest binding. |
| SLSA / attestations | Partial | Provides provenance when generated and verified; release policy must consume it. |

## Overall Assessment

**Strengths:** Strong coverage of model provenance, training pipeline integrity, model card completeness, and backdoor detection.

**Needs improvement:** Add release/promotion evidence so reviewers can prove that the model approved after evaluation is the immutable model deployed and rolled back in production.

**Priority recommendations:**
1. Add a model promotion gate checklist under provenance or fine-tuning pipeline review.
2. Require artifact digest/revision, evaluation run ID, model card version, approval ID, and deploy manifest to match.
3. Add output fields for promotion status, mutable references, rollback verification, and environment-specific artifact identity.

## Sources Checked

- SLSA v1.0: https://slsa.dev/spec/v1.0/
- OWASP Top 10 for LLM Applications 2025: https://owasp.org/www-project-top-10-for-large-language-model-applications/
- NIST AI RMF 1.0: https://www.nist.gov/itl/ai-risk-management-framework

## Bounty Info
- [x] I have read and agree to the [CONTRIBUTING.md](../../CONTRIBUTING.md) bounty terms
- **Preferred payment method:** GitHub Sponsors


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] model-supply-chain: add model promotion evidence gates #1377

Skill Being Reviewed

False Positive Analysis

Coverage Gaps

Edge Cases

Remediation Quality

Comparison to Other Tools

Overall Assessment

Sources Checked

Bounty Info

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Tool	Catches this?	Notes
Model registry	Partial	Can track versions, but reviewers must verify promotion policies and deployment references.
MLflow / experiment tracker	Partial	Records runs and metrics, but may not enforce production artifact digest binding.
SLSA / attestations	Partial	Provides provenance when generated and verified; release policy must consume it.

[REVIEW] model-supply-chain: add model promotion evidence gates #1377

Description

Skill Being Reviewed

False Positive Analysis

Coverage Gaps

Edge Cases

Remediation Quality

Comparison to Other Tools

Overall Assessment

Sources Checked

Bounty Info

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions