Skip to content

chore(llm): Better defense against prompt injection in triage skill#19410

Merged
chargome merged 8 commits intodevelopfrom
cg/multilingual-prompt-injection
Feb 20, 2026
Merged

chore(llm): Better defense against prompt injection in triage skill#19410
chargome merged 8 commits intodevelopfrom
cg/multilingual-prompt-injection

Conversation

@chargome
Copy link
Member

@chargome chargome commented Feb 19, 2026

Adds

  • Language filter: Reject non-English issues (detects accented characters)
  • Injection detection: Scan for malicious patterns with confidence scoring

Closes #19411 (added automatically)

@github-actions
Copy link
Contributor

github-actions bot commented Feb 19, 2026

Codecov Results 📊


Generated by Codecov Action

@github-actions
Copy link
Contributor

github-actions bot commented Feb 19, 2026

size-limit report 📦

Path Size % Change Change
@sentry/browser 25.61 kB - -
@sentry/browser - with treeshaking flags 24.12 kB - -
@sentry/browser (incl. Tracing) 42.42 kB - -
@sentry/browser (incl. Tracing, Profiling) 47.08 kB - -
@sentry/browser (incl. Tracing, Replay) 81.24 kB - -
@sentry/browser (incl. Tracing, Replay) - with treeshaking flags 70.86 kB - -
@sentry/browser (incl. Tracing, Replay with Canvas) 85.93 kB - -
@sentry/browser (incl. Tracing, Replay, Feedback) 98.09 kB - -
@sentry/browser (incl. Feedback) 42.33 kB - -
@sentry/browser (incl. sendFeedback) 30.28 kB - -
@sentry/browser (incl. FeedbackAsync) 35.28 kB - -
@sentry/browser (incl. Metrics) 26.78 kB - -
@sentry/browser (incl. Logs) 26.92 kB - -
@sentry/browser (incl. Metrics & Logs) 27.6 kB - -
@sentry/react 27.37 kB - -
@sentry/react (incl. Tracing) 44.76 kB - -
@sentry/vue 30.06 kB - -
@sentry/vue (incl. Tracing) 44.26 kB - -
@sentry/svelte 25.64 kB - -
CDN Bundle 28.16 kB - -
CDN Bundle (incl. Tracing) 43.25 kB - -
CDN Bundle (incl. Logs, Metrics) 29 kB - -
CDN Bundle (incl. Tracing, Logs, Metrics) 44.09 kB - -
CDN Bundle (incl. Replay, Logs, Metrics) 68.08 kB - -
CDN Bundle (incl. Tracing, Replay) 80.12 kB - -
CDN Bundle (incl. Tracing, Replay, Logs, Metrics) 80.99 kB - -
CDN Bundle (incl. Tracing, Replay, Feedback) 85.56 kB - -
CDN Bundle (incl. Tracing, Replay, Feedback, Logs, Metrics) 86.46 kB - -
CDN Bundle - uncompressed 82.33 kB - -
CDN Bundle (incl. Tracing) - uncompressed 128.05 kB - -
CDN Bundle (incl. Logs, Metrics) - uncompressed 85.17 kB - -
CDN Bundle (incl. Tracing, Logs, Metrics) - uncompressed 130.88 kB - -
CDN Bundle (incl. Replay, Logs, Metrics) - uncompressed 208.83 kB - -
CDN Bundle (incl. Tracing, Replay) - uncompressed 244.93 kB - -
CDN Bundle (incl. Tracing, Replay, Logs, Metrics) - uncompressed 247.75 kB - -
CDN Bundle (incl. Tracing, Replay, Feedback) - uncompressed 257.73 kB - -
CDN Bundle (incl. Tracing, Replay, Feedback, Logs, Metrics) - uncompressed 260.54 kB - -
@sentry/nextjs (client) 47.17 kB - -
@sentry/sveltekit (client) 42.88 kB - -
@sentry/node-core 52.18 kB +0.02% +8 B 🔺
@sentry/node 166.54 kB +0.01% +7 B 🔺
@sentry/node - without tracing 93.97 kB +0.01% +9 B 🔺
@sentry/aws-serverless 109.47 kB +0.01% +7 B 🔺

View base workflow run

@github-actions
Copy link
Contributor

github-actions bot commented Feb 19, 2026

node-overhead report 🧳

Note: This is a synthetic benchmark with a minimal express app and does not necessarily reflect the real-world performance impact in an application.

Scenario Requests/s % of Baseline Prev. Requests/s Change %
GET Baseline 8,834 - 8,690 +2%
GET With Sentry 1,577 18% 1,571 +0%
GET With Sentry (error only) 6,009 68% 6,075 -1%
POST Baseline 1,188 - 1,152 +3%
POST With Sentry 554 47% 567 -2%
POST With Sentry (error only) 1,022 86% 1,027 -0%
MYSQL Baseline 3,263 - 3,244 +1%
MYSQL With Sentry 421 13% 410 +3%
MYSQL With Sentry (error only) 2,651 81% 2,643 +0%

View base workflow run

Copy link

@cursor cursor bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.

(r"\b(ignore|disregard|forget)\s+(all\s+)?(previous|prior|above)\s+(instructions?|prompts?|rules?)", 8, "Instruction override"),

# Prompt extraction (8 points)
(r"\b(show|reveal|display|output|print)\s+(your\s+)?(system\s+)?(prompt|instructions?)", 8, "Prompt extraction attempt"),
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Prompt extraction regex matches common English phrases

High Severity

The "Prompt extraction attempt" pattern \b(show|reveal|display|output|print)\s+(your\s+)?(system\s+)?(prompt|instructions?) scores 8 points — exactly the rejection threshold — but both (your\s+)? and (system\s+)? are optional. This means perfectly innocuous phrases like "print instructions", "show instructions", or "display instructions" alone trigger full rejection. The project's own docs/changelog/v8.md even uses "we print instructions on how to prepare for the next major", so an issue quoting or paraphrasing that text would be falsely rejected. The intent was to catch "show your system prompt" or "reveal your instructions" but the double-optional qualifiers make the pattern far too broad for its high confidence score.

Fix in Cursor Fix in Web

@chargome chargome merged commit 4ee0fea into develop Feb 20, 2026
224 of 225 checks passed
@chargome chargome deleted the cg/multilingual-prompt-injection branch February 20, 2026 12:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

chore(llm): Better defense against prompt injection in triage skill

2 participants

Comments