[REVIEW] sast-config: add polyglot monorepo workspace-scoped SARIF completeness gates

# [REVIEW] sast-config: add polyglot monorepo workspace-scoped SARIF completeness gates

## Skill Being Reviewed
**Skill name:** `sast-config`
**Skill path:** `skills/devsecops/sast-config/`

## False Positive Analysis

**Benign-looking SAST maturity result that can be incorrectly scored as broad coverage:**

```yaml
# .github/workflows/codeql.yml
strategy:
  matrix:
    language: [javascript]
paths-ignore:
  - services/**
  - apps/**
  - packages/**
```

```text
SAST Maturity Assessment Summary
- Active tool: CodeQL
- CWE Top 25 coverage: 18/25 for JavaScript
- CI gate: pass on PR
- Custom rules: 12 Semgrep rules in /security/semgrep
```

**Why this is a false positive:**

The skill can report healthy CWE coverage for "JavaScript" while the repository is a polyglot monorepo and the CI workflow only analyzes a narrow path or a single language slice. In the example, ignoring `services/`, `apps/`, and `packages/` means the assessment covers almost none of the production code. The skill checks whether rules exist and whether CWE rows are mapped, but not whether the analyzed workspace spans all language ecosystems and deployable components.

## Coverage Gaps

**Missed variant 1: Per-package SARIF uploaded from subdirectory scan only**

```yaml
# turbo / nx monorepo
projects:
  - payments-api (Go)
  - web-checkout (TypeScript)
  - auth-worker (Python)
```

```bash
semgrep ci --config p/ci --subdir apps/web-checkout
# SARIF uploaded as full-repo scan result
```

**Why it should be caught:** ASVS mapping can appear complete for TypeScript while Go and Python services remain unscanned. The skill should require a component-to-scan-artifact matrix, not a single global coverage table.

**Missed variant 2: Generated code and vendor subtree inflates pass rate**

```text
Scanned files: 4,812
Generated protobuf/grpc: 4,103
Handwritten source: 709
Gate result: pass (0 findings)
```

**Why it should be caught:** Findings suppression and low signal can hide missing coverage of handwritten code. The skill should gate on `% handwritten LOC scanned` or equivalent build-target coverage.

**Missed variant 3: CodeQL autobuild succeeds only for root app while failing silently for nested modules**

```text
CodeQL job summary:
- autobuild: success
- extracted: 1 Java database (root build.gradle only)
- modules not extracted: payments-core, ledger-worker
```

**Why it should be caught:** Existing reviews mention build completeness, but not monorepo workspace boundary proof. A single successful autobuild should not satisfy coverage for all compiled modules.

## Edge Cases

- **Bazel/Gradle composite builds:** Extraction may cover targets not shipped to production; skill should map scan targets to release artifacts.
- **Fork PR scans:** `pull_request_target` workflows may scan base branch only; PR delta coverage can be misrepresented.
- **Baseline suppression files:** Repo-wide `baseline` may hide findings in one package while giving another package a false clean bill of health.
- **Shared ruleset with language filters disabled:** One Semgrep config file referenced everywhere, but only Java rules enabled in CI.

## Remediation Quality

- [x] Fix resolves the vulnerability
- [x] Fix doesn't introduce new security issues
- [x] Fix doesn't break functionality
- **Issues found:** Add monorepo completeness gates to Step 1 Discovery and Step 2 CWE coverage validation.

## Comparison to Other Tools

| Tool | Catches this? | Notes |
|------|:---:|-------|
| Semgrep AppSec platform | Partial | Has project/tag scoping if configured; skill does not require it |
| CodeQL dependency analysis | Partial | Shows extracted languages/LOC if reviewer inspects logs |
| SonarQube monorepo | Partial | Can do per-project gates; skill lacks equivalent requirement |
| GitHub Advanced Security code scanning | Partial | SARIF category metadata exists but skill ignores it |

## Overall Assessment

**Strengths:** Solid CWE/ASVS mapping framework, good discovery patterns, and useful severity-tuning guidance.

**Needs improvement:** Coverage is evaluated at repo level, not deployable-component level. Polyglot monorepos are the common case for the target audience, so false completeness is likely.

**Priority recommendations:**
1. Add a "Workspace Coverage Matrix" output: each production component, language, scanner, last successful scan commit, and LOC extracted.
2. Treat single-language green CI in a polyglot repo as **High** finding until all release components are mapped.
3. Require SARIF/run logs to prove which paths were included/excluded; do not accept global CWE coverage without path evidence.

## Bounty Info
- [x] I have read and agree to the [CONTRIBUTING.md](https://github.com/UnitOneAI/SecuritySkills/blob/main/CONTRIBUTING.md) bounty terms
- **Preferred payment method:** PayPal


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] sast-config: add polyglot monorepo workspace-scoped SARIF completeness gates #1527

[REVIEW] sast-config: add polyglot monorepo workspace-scoped SARIF completeness gates

Skill Being Reviewed

False Positive Analysis

Coverage Gaps

Edge Cases

Remediation Quality

Comparison to Other Tools

Overall Assessment

Bounty Info

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Tool	Catches this?	Notes
Semgrep AppSec platform	Partial	Has project/tag scoping if configured; skill does not require it
CodeQL dependency analysis	Partial	Shows extracted languages/LOC if reviewer inspects logs
SonarQube monorepo	Partial	Can do per-project gates; skill lacks equivalent requirement
GitHub Advanced Security code scanning	Partial	SARIF category metadata exists but skill ignores it

[REVIEW] sast-config: add polyglot monorepo workspace-scoped SARIF completeness gates #1527

Description

[REVIEW] sast-config: add polyglot monorepo workspace-scoped SARIF completeness gates

Skill Being Reviewed

False Positive Analysis

Coverage Gaps

Edge Cases

Remediation Quality

Comparison to Other Tools

Overall Assessment

Bounty Info

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions