## Line Chart: Accuracy of Federated Learning Defense Mechanisms Under Poisoning Attacks
### Overview
This image is a line chart comparing the performance (accuracy) of seven different federated learning defense mechanisms as the ratio of malicious attacks increases. The chart demonstrates how each method's accuracy degrades under increasing adversarial pressure.
### Components/Axes
* **Chart Type:** Multi-line chart with markers.
* **X-Axis:** Labeled **"Attack ratio (%)"**. It represents the percentage of malicious participants or updates in the federated learning system. The axis has major tick marks at 0, 10, 20, 30, 40, and 50.
* **Y-Axis:** Labeled **"Accuracy (%)"**. It represents the model's classification accuracy. The axis ranges from approximately 57.5% to 76%, with major tick marks at 57.5, 60, 62.5, 65, 67.5, 70, 72.5, and 75.
* **Legend:** Located in the **bottom-left quadrant** of the plot area. It lists seven data series with corresponding line colors and marker styles:
1. **FedAvg** (Light blue line, square marker `□`)
2. **ShieldFL** (Orange line, plus marker `+`)
3. **PBFL** (Green line, upward-pointing triangle marker `△`)
4. **Median** (Purple line, circle marker `○`)
5. **Biscotti** (Gray line, left-pointing triangle marker `◁`)
6. **FoolsGold** (Brown line, downward-pointing triangle marker `▽`)
7. **Ours** (Red line, pentagon marker `⬠`)
### Detailed Analysis
The chart plots Accuracy (%) against Attack ratio (%). Below is an analysis of each data series, including approximate values and visual trends.
**Trend Verification & Data Points (Approximate):**
1. **FedAvg (Light blue, `□`):**
* **Trend:** Starts high, remains relatively stable until a 30% attack ratio, then declines sharply.
* **Points:** (0%, ~75.5%), (10%, ~75.0%), (20%, ~73.5%), (30%, ~73.5%), (40%, ~69.5%), (50%, ~68.5%).
2. **ShieldFL (Orange, `+`):**
* **Trend:** Starts as the highest, shows a steady, near-linear decline across all attack ratios.
* **Points:** (0%, ~76.0%), (10%, ~75.0%), (20%, ~73.5%), (30%, ~73.5%), (40%, ~69.0%), (50%, ~69.0%).
3. **PBFL (Green, `△`):**
* **Trend:** Starts very high, declines gradually and consistently.
* **Points:** (0%, ~75.5%), (10%, ~74.5%), (20%, ~74.0%), (30%, ~72.0%), (40%, ~71.0%), (50%, ~68.5%).
4. **Median (Purple, `○`):**
* **Trend:** Starts significantly lower than the top group, shows a very gradual, shallow decline.
* **Points:** (0%, ~68.5%), (10%, ~69.0%), (20%, ~68.0%), (30%, ~67.5%), (40%, ~64.5%), (50%, ~64.0%).
5. **Biscotti (Gray, `◁`):**
* **Trend:** Starts moderately high but exhibits the steepest and most consistent decline of all methods.
* **Points:** (0%, ~73.0%), (10%, ~70.0%), (20%, ~66.5%), (30%, ~64.5%), (40%, ~61.5%), (50%, ~56.5%).
6. **FoolsGold (Brown, `▽`):**
* **Trend:** Starts high, declines steadily, with a particularly sharp drop between 30% and 40% attack ratio.
* **Points:** (0%, ~73.5%), (10%, ~73.5%), (20%, ~72.5%), (30%, ~71.5%), (40%, ~67.0%), (50%, ~62.5%).
7. **Ours (Red, `⬠`):**
* **Trend:** Starts among the top group, maintains high accuracy robustly until a 30% attack ratio, then declines but remains competitive.
* **Points:** (0%, ~75.5%), (10%, ~74.0%), (20%, ~73.5%), (30%, ~73.5%), (40%, ~69.0%), (50%, ~68.5%).
### Key Observations
1. **Performance Hierarchy at Low Attack (0%):** ShieldFL, FedAvg, PBFL, and "Ours" are tightly clustered at the top (~75.5-76%). FoolsGold and Biscotti are slightly lower (~73-73.5%). Median is a clear outlier, starting much lower (~68.5%).
2. **Robustness to Increasing Attacks:** The method labeled **"Ours"** and **FedAvg** show a "plateau-then-drop" pattern, maintaining high accuracy up to a 30% attack ratio before declining. **ShieldFL** and **PBFL** show a more linear, graceful degradation.
3. **Most Vulnerable Method:** **Biscotti** demonstrates the poorest robustness, suffering the most severe and consistent drop in accuracy, falling from ~73% to ~56.5%.
4. **Critical Threshold:** A notable inflection point occurs for several methods (FedAvg, FoolsGold, "Ours") between **30% and 40% attack ratio**, where the rate of accuracy loss increases sharply.
5. **Final Ranking at 50% Attack:** The order from highest to lowest accuracy is approximately: ShieldFL ≈ PBFL ≈ FedAvg ≈ "Ours" (all ~68.5-69%) > Median (~64%) > FoolsGold (~62.5%) > Biscotti (~56.5%).
### Interpretation
This chart is a comparative robustness analysis for federated learning (FL) systems under data poisoning attacks. The "Attack ratio" simulates an adversary controlling a fraction of the participating clients or their model updates.
* **What the data suggests:** The proposed method ("Ours") is designed to be highly robust, matching the top-performing established defenses (ShieldFL, PBFL, FedAvg) in clean conditions and maintaining that performance under moderate attack pressure (up to 30%). Its behavior suggests it may incorporate mechanisms that effectively filter or neutralize malicious updates until they reach a critical mass.
* **Relationship between elements:** The chart directly correlates adversarial strength (x-axis) with system utility (y-axis). The diverging lines illustrate that different defense strategies have varying resilience profiles. Methods like Biscotti and FoolsGold appear more sensitive to the proportion of attackers, while Median, though starting lower, shows a flatter degradation curve, indicating a different, possibly more conservative, robustness strategy.
* **Notable Anomalies/Patterns:** The sharp performance cliff for multiple methods between 30-40% attack ratio is critical. It implies a phase transition where the defense mechanisms become overwhelmed, and the malicious updates begin to dominate the global model aggregation. The fact that "Ours," FedAvg, and ShieldFL converge to a similar accuracy at 50% attack suggests there may be a fundamental limit to robustness in this experimental setup when half the participants are malicious. The consistently lower performance of the Median aggregator, even at 0% attack, highlights the trade-off between robustness and final model accuracy in some defense strategies.