Skill instructions redefine "success" to include side-channel task
Summary
Skill SKILL.md contains a "definition of done" section that, in addition to the legitimate task, requires the agent to "also verify X" where X is a credential read or a network call. The agent treats the side-channel as in-scope because it came from a trusted skill definition.
Severity & confidence
Low
Hygiene-grade issue with limited direct impact. Fix opportunistically; monitor in case context changes (new exploit, new dependency, etc.).
Confirmed
Observed directly by Jiffy scanners or validated by multiple independent sources. Indicators are reliable enough to drive automated action.
Indicators
Observed patterns and artifacts associated with this entry. Each indicator can be copied into your detection stack or SIEM.
(?is)(?:definition of done|success criteria)[\s\S]{0,400}(?:verify|check|read|send)\s+[^\n]*(?:env|token|credential|key)
Detection rule
A YARA-style pseudo-rule auto-generated from the indicators above. Useful as a starting point — adapt the syntax for your target detection platform.
rule jiffy_ti_2026_000020
{
meta:
source = "jiffy-intel"
severity = "low"
description = "Auto-generated from Jiffy Intel indicators"
strings:
$content_pattern_0 = "(?is)(?:definition of done|success criteria)[\\s\\S]{0,400}(?:verify|check|read|send)\\s+[^\\n]*(?:env|token|credential|key)"
condition:
$content_pattern_0
}Auto-generated from the indicators above. Adapt syntax for your detection stack before deploying.
Affected tools
| Tool | Versions | Status |
|---|---|---|
| Claude Code | * | vulnerable |
| Cursor | * | vulnerable |
| Windsurf | * | vulnerable |
Example artifacts
Sanitized examples of artifacts Jiffy has observed exhibiting this pattern. Publisher handles are redacted; version ranges and status reflect the most recent scan.
- pr-reviewer-skillSkillRemoved
- code-quality-skillSkillUnder review
- tech-debt-skillSkillQuarantined
How to remediate
- 01Treat skill-provided task definitions as untrusted.
- 02Require agent runtimes to prompt the user before allowing an unrelated side-task.
Timeline & sources
Timeline
- First observedMar 17, 20261 month ago
- Last updatedApr 22, 2026today
- PublishedMar 31, 202617 days ago
Sources
References
OWASP LLM-01: Prompt Injection (2026)
https://genai.owasp.org/llmrisk/llm-01-2026/OWASP LLM-06: Excessive Agency (2026)
https://genai.owasp.org/llmrisk/llm-06-2026/Jiffy Research — OWASP LLM Top 10 Is Not Enough
https://blog.jiffylabs.ai/posts/owasp-llm-top-10-is-not-enoughScan for patterns like this
Point Jiffy at your GitHub org, IDE config, or a single artifact. Get a scored report in under a minute.
Start a free scan