Blog

AI Agents Have a Supply Chain Problem Now

Agent skills are not just helpful instructions. They are a new software supply chain with access to code, credentials, tools, memory, and consequential workflows. Enterprises need to govern them before attackers do it for them.

2026-06-097 min readAI agentsagent securitysoftware supply chainapplication securityauthorizationenterprise AI

The skill ecosystem is already security-relevant

AI coding agents are no longer just autocomplete. Claude Code, Codex, Cursor, Gemini CLI, Copilot-style workflows, and local agent runtimes are becoming operating surfaces for software work.

That shift creates a second ecosystem around the agent: skills, recipes, context files, plugins, MCP servers, task packs, and reusable instructions that tell the agent how to behave in a particular environment.

The market wants to treat those artifacts as productivity accelerators. Security teams should treat them as a supply chain.

Skills are not just documentation

A normal README can mislead a developer. An agent skill can mislead the operator that reads the README, writes the code, opens files, runs commands, calls tools, and sends results elsewhere.

That is the difference. Agent skills do not merely describe work. They can shape the behavior of a system that has delegated authority inside a real environment.

If the agent can read repositories, see environment variables, use a shell, open a browser, create pull requests, send messages, or remember state across sessions, then a compromised skill is not a bad prompt. It is a potentially privileged dependency.

The numbers should get attention

Snyk's ToxicSkills research scanned 3,984 skills from ClawHub and skills.sh as of February 5, 2026. The results were not subtle.

  • 534 skills — 13.4% of the corpus — had at least one critical issue.
  • 1,467 skills — 36.82% — had some security flaw.
  • Snyk's researchers human-validated 76 malicious payloads.
  • 91% of malicious skills combined prompt injection with traditional malware behavior.
  • 8 malicious skills were still publicly available when the report was published.

This is software supply chain risk with a language layer

The familiar part is software supply chain compromise: obfuscated commands, credential theft, malware distribution, dependency trust, publisher identity, and update risk.

The new part is that the malicious instruction can live in language. A poisoned skill can tell the agent to hide behavior in normal outputs, append secrets to URLs, reinterpret user intent, or treat invisible Unicode as executable instruction. The attack does not have to look like code to become operational.

That makes the old controls necessary but incomplete. Static scanning matters. Repository provenance matters. Pinning versions matters. But teams also need to reason about what the agent is allowed to do after the skill has shaped its behavior.

The privilege level is the real issue

The risk is not that a skill contains strange text. The risk is that the agent consuming that text may sit near credentials, source code, customer data, deployment tools, ticketing systems, chat channels, and internal documents.

A compromised npm package can execute with the privileges granted to the install or runtime path. A compromised agent skill can influence a system that interprets work, selects tools, decides what evidence matters, and moves between systems in natural language.

That is why the question has to be control-plane specific: what can this AI-assisted system actually cause to happen if a skill is malicious, stale, or quietly updated?

The market response is useful but not enough

The early tool response is real. NVIDIA released SkillSpector. Cisco released skill-scanner. Snyk open-sourced agent-scan. Other vendors now offer skill-checking and agent-security scan surfaces.

Those tools are useful. They are also mostly point-in-time scanners. They do not, by themselves, solve publisher trust, continuous monitoring, install-time governance, enterprise policy, cross-registry aggregation, or the question of which skills are safe for which authority envelope.

That is the missing layer: not another scanner alone, but a trust and governance model for the agent skill supply chain.

What enterprise teams should do now

Teams do not need to wait for the market to settle before reducing risk. The first controls are boring and available.

  • Inventory every skill, plugin, MCP server, context pack, and agent instruction artifact used in development and operations.
  • Classify each artifact by the authority of the agent that consumes it: read-only assistance is different from shell access, credential access, production deployment, or customer communication.
  • Pin versions and review updates instead of allowing silent auto-upgrade of agent behavior.
  • Scan skills before install and after update, but do not mistake scanning for authorization design.
  • Prefer known publishers, signed releases, transparent repositories, and reproducible provenance over marketplace convenience.
  • Separate high-risk tools from general agent context so a poisoned skill cannot talk its way into money, identity, production, recovery, or trust-state mutation.
  • Log skill-mediated tool use as security-relevant behavior, not just developer productivity telemetry.

The bottom line

Agent skills are becoming part of how work gets done. That means they are becoming part of how compromise happens.

The right lesson is not to panic about every reusable instruction file. The lesson is to stop treating agent behavior as if it lives outside the software supply chain.

If an artifact can change how a system reads code, handles credentials, calls tools, writes messages, or mutates operational state, it belongs inside the security model.

AI agents have a supply chain problem now. The organizations that govern it early will move faster later, because they will know what their agents are allowed to trust — and what they are not allowed to cause.

Talk it through

Need help translating the lesson into operating discipline?

If you want to turn this into a budget, review, or rollout pattern that actually survives contact with the team, Luis can help.

Contact Luis