## Grouped Bar Chart: Attack Success Rate (ASR) by Prompt Injection Attack Type and Defense Mechanism
### Overview
The image displays a 2x6 grid of grouped bar charts. Each subplot represents a specific text-based prompt injection attack technique. Within each subplot, the performance of five different AI security defense mechanisms is compared based on their Attack Success Rate (ASR). The ASR is a metric where a higher value indicates the attack was more successful against the defense, implying the defense was less effective.
### Components/Axes
* **Y-Axis (Common to all subplots):** Labeled "Attack Success Rate (ASR)". The scale runs from 0.0 to 1.0, with major gridlines at intervals of 0.2.
* **X-Axis (Per subplot):** The categorical variable is the defense mechanism, represented by colored bars. The specific defense names are not labeled on the x-axis but are defined in the legend.
* **Legend:** Located at the bottom center of the entire figure. It maps colors to defense mechanisms:
* **Teal/Green:** Azure Prompt Shield
* **Blue-Grey:** Protect AI v1
* **Light Green:** Meta Prompt Guard
* **Yellow:** Vijil Prompt Injection
* **Pink:** Protect AI v2
* **Subplot Titles (Attack Types):** Each of the 12 subplots has a title at the top:
1. Deletion Characters
2. Diacritics
3. Emoji Smuggling
4. Full Width Text
5. Homoglyphs
6. Numbers
7. Bidirectional Text
8. Spaces
9. Underline Accent Marks
10. Unicode Tags Smuggling
11. Upside Down Text
12. Zero Width
### Detailed Analysis
Below is the approximate ASR for each defense mechanism (color) within each attack type subplot. Values are estimated from bar height relative to the y-axis gridlines.
**Top Row (Left to Right):**
1. **Deletion Characters:**
* Azure Prompt Shield: ~0.5
* Protect AI v1: ~0.72
* Meta Prompt Guard: ~0.0 (bar not visible)
* Vijil Prompt Injection: ~0.0 (bar not visible)
* Protect AI v2: ~0.12
2. **Diacritics:**
* Azure Prompt Shield: ~0.38
* Protect AI v1: ~0.86
* Meta Prompt Guard: ~0.93
* Vijil Prompt Injection: ~1.0
* Protect AI v2: ~0.0 (bar not visible)
3. **Emoji Smuggling:**
* Azure Prompt Shield: ~1.0
* Protect AI v1: ~1.0
* Meta Prompt Guard: ~1.0
* Vijil Prompt Injection: ~1.0
* Protect AI v2: ~1.0
4. **Full Width Text:**
* Azure Prompt Shield: ~0.5
* Protect AI v1: ~0.73
* Meta Prompt Guard: ~0.0 (bar not visible)
* Vijil Prompt Injection: ~1.0
* Protect AI v2: ~0.17
5. **Homoglyphs:**
* Azure Prompt Shield: ~1.0
* Protect AI v1: ~0.92
* Meta Prompt Guard: ~0.49
* Vijil Prompt Injection: ~0.59
* Protect AI v2: ~0.0 (bar not visible)
6. **Numbers:**
* Azure Prompt Shield: ~0.99
* Protect AI v1: ~0.94
* Meta Prompt Guard: ~1.0
* Vijil Prompt Injection: ~1.0
* Protect AI v2: ~0.13
**Bottom Row (Left to Right):**
7. **Bidirectional Text:**
* Azure Prompt Shield: ~1.0
* Protect AI v1: ~0.93
* Meta Prompt Guard: ~1.0
* Vijil Prompt Injection: ~1.0
* Protect AI v2: ~0.0 (bar not visible)
8. **Spaces:**
* Azure Prompt Shield: ~0.83
* Protect AI v1: ~0.09
* Meta Prompt Guard: ~1.0
* Vijil Prompt Injection: ~1.0
* Protect AI v2: ~0.0 (bar not visible)
9. **Underline Accent Marks:**
* Azure Prompt Shield: ~0.93
* Protect AI v1: ~0.98
* Meta Prompt Guard: ~0.03
* Vijil Prompt Injection: ~1.0
* Protect AI v2: ~0.0 (bar not visible)
10. **Unicode Tags Smuggling:**
* Azure Prompt Shield: ~0.5
* Protect AI v1: ~1.0
* Meta Prompt Guard: ~1.0
* Vijil Prompt Injection: ~1.0
* Protect AI v2: ~1.0
11. **Upside Down Text:**
* Azure Prompt Shield: ~0.17
* Protect AI v1: ~1.0
* Meta Prompt Guard: ~1.0
* Vijil Prompt Injection: ~1.0
* Protect AI v2: ~0.0 (bar not visible)
12. **Zero Width:**
* Azure Prompt Shield: ~0.83
* Protect AI v1: ~0.09
* Meta Prompt Guard: ~1.0
* Vijil Prompt Injection: ~0.97
* Protect AI v2: ~0.0 (bar not visible)
### Key Observations
* **Universal Vulnerability:** The "Emoji Smuggling" attack achieves a perfect or near-perfect ASR of ~1.0 against all five tested defenses.
* **Defense-Specific Strengths:**
* **Protect AI v2 (Pink)** shows very low ASR (often ~0.0 or <0.2) against most attacks, with the notable exceptions of "Emoji Smuggling" and "Unicode Tags Smuggling" where it fails completely (ASR ~1.0).
* **Meta Prompt Guard (Light Green)** is highly effective (ASR ~0.0) against "Deletion Characters", "Full Width Text", "Underline Accent Marks", and "Spaces", but is completely bypassed (ASR ~1.0) by "Bidirectional Text", "Numbers", "Spaces" (contradictory), "Unicode Tags Smuggling", "Upside Down Text", and "Zero Width".
* **Azure Prompt Shield (Teal)** has mixed results, performing best against "Upside Down Text" (~0.17) but failing against "Bidirectional Text", "Emoji Smuggling", "Homoglyphs", and "Numbers" (all ~1.0).
* **Attack Effectiveness:** "Bidirectional Text", "Emoji Smuggling", "Numbers", and "Unicode Tags Smuggling" are highly effective, achieving ASR ~1.0 against 4 out of 5 defenses in most cases.
* **Notable Anomaly:** The "Spaces" attack shows a drastic difference in effectiveness: it is highly successful against Azure, Meta, and Vijil defenses (ASR >0.8) but almost completely ineffective against Protect AI v1 (ASR ~0.09).
### Interpretation
This chart provides a comparative security analysis of different AI prompt injection attack vectors and defense mechanisms. The data suggests that:
1. **No Universal Defense:** No single defense mechanism is robust against all tested attack types. Each has specific vulnerabilities. Protect AI v2 appears the most resilient overall but has critical failures against specific, sophisticated encoding-based attacks (Emoji, Unicode Tags).
2. **Attack Sophistication Matters:** Attacks that exploit Unicode encoding, bidirectional text, or invisible characters (Zero Width) are generally more successful than simpler character manipulations. The perfect score for "Emoji Smuggling" across the board indicates it is a particularly potent and difficult-to-defend-against technique.
3. **Defense Specialization:** The stark contrasts in performance (e.g., Meta Prompt Guard's perfect defense vs. perfect failure on different attacks) imply that these defenses likely use different underlying methodologies—some may focus on pattern matching for specific character sets, while others might analyze semantic meaning, leading to blind spots.
4. **Practical Implication:** For a system designer, this chart argues for a **layered defense strategy**. Relying on a single vendor's solution (e.g., only using Azure Prompt Shield) would leave the system vulnerable to numerous attack vectors. Combining defenses that have complementary strengths (e.g., one strong against encoding attacks, another against structural text attacks) would provide more comprehensive protection.
The investigation reveals an ongoing "arms race" in AI security, where new attack methods constantly probe for weaknesses in defense systems, and no solution is yet foolproof.