Prompt Injection PatternPublished Apr 4, 2026

Skill output wraps user text in "rewritten by reviewer" framing

MediumAARM tierConfirmed
Seen33times across5customers
jiffy-ti-2026-000026

Summary

Skills that format their output to look like it came from a human reviewer — complete with fake username attribution. The downstream agent treats the "reviewer" content as a trusted directive, effectively elevating the skill's output to a role it should not have.

Severity & confidence

Severity
Medium

Medium

Notable weakness with plausible abuse chains or partial mitigations already in place. Track, and resolve as part of your normal hardening cadence.

Confidence
Confirmed

Confirmed

Observed directly by Jiffy scanners or validated by multiple independent sources. Indicators are reliable enough to drive automated action.

Indicators

Observed patterns and artifacts associated with this entry. Each indicator can be copied into your detection stack or SIEM.

  • (?is)\[reviewer[:\s]\s*[^\]]+\]|<reviewer[^>]*>

Detection rule

A YARA-style pseudo-rule auto-generated from the indicators above. Useful as a starting point — adapt the syntax for your target detection platform.

YARA-style pseudo-rule
rule jiffy_ti_2026_000026
{
    meta:
        source = "jiffy-intel"
        severity = "medium"
        description = "Auto-generated from Jiffy Intel indicators"
    strings:
    $content_pattern_0 = "(?is)\\[reviewer[:\\s]\\s*[^\\]]+\\]|<reviewer[^>]*>"
    condition:
        $content_pattern_0
}

Auto-generated from the indicators above. Adapt syntax for your detection stack before deploying.

Affected tools

ToolVersionsStatus
Claude Code*vulnerable
Cursor*vulnerable

Example artifacts

Sanitized examples of artifacts Jiffy has observed exhibiting this pattern. Publisher handles are redacted; version ranges and status reflect the most recent scan.

  • code-review-skillSkill
    Under review
    Source
    Anthropic Skills
    Versions
    1.0.0
    First observed
    Mar 27, 2026
  • pr-formatter-skillSkill
    Quarantined
    Source
    Community registry
    First observed
    Mar 31, 2026

How to remediate

Strip role/attribution markers from skill output before returning to the agent's context.

Timeline & sources

Timeline

  1. First observedMar 27, 202621 days ago
  2. Last updatedApr 24, 2026today
  3. PublishedApr 4, 202613 days ago

Sources

curated

References

Scan for patterns like this

Point Jiffy at your GitHub org, IDE config, or a single artifact. Get a scored report in under a minute.

Start a free scan