Replies: 1 comment
-
|
A secure-skill rule would be useful if it treats skills as executable influence, not just documentation. A malicious or careless skill can change how an agent selects tools, handles secrets, edits files, or interprets user intent. The rule could check for several patterns:
For remediation, the rule should require a minimal manifest: purpose, owner, version, required tools, permission scope, expected inputs/outputs, and safety constraints. That makes skills auditable and gives reviewers something concrete to approve. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
As AI agents are increasingly being used not only to write code, but also to create and maintain agent skills, I think Project CodeGuard should consider adding a new rule focused on AgentSkills security.
Project CodeGuard already positions itself as a model-agnostic security framework for AI coding agents, with skills and rules intended to guide secure behavior before, during, and after code generation. It also already treats skills and rules as first-class artifacts in the workflow. That makes secure skill creation a natural extension of the project scope. :contentReference[oaicite:1]{index=1}
This feels especially timely because OWASP’s Agentic Skills Top 10 now highlights security risks specific to agent skills across ecosystems including OpenClaw, Claude Code, Cursor/Codex, and VS Code. OWASP describes skills as the execution/behavior layer that controls workflow orchestration, tool use, filesystem/network/shell access, safety guardrails, and persistent state, and it recommends practices such as verified publishers, automated scanning, permission review, and version pinning.
Why this could be useful
A CodeGuard rule here could provide best-practice guidance for agents when they are asked to create or modify skills, for example:
Possible scope
This could be introduced either as:
Open questions
I’d be interested in feedback on whether this fits the current CodeGuard roadmap and, if so, what the initial MVP for such a rule should include.
Beta Was this translation helpful? Give feedback.
All reactions