# Technical Data Extraction: Performance Analysis of Qwen3 Models
## 1. Document Overview
This image is a line graph illustrating the relationship between an "Induced Error Rate" and "Turn 100 Accuracy" for four different versions of the "Qwen3" model family. The chart includes error bars for each data point, indicating variability or confidence intervals.
## 2. Component Isolation
### A. Header/Axes
* **Y-Axis Label:** `Turn 100 Accuracy`
* **Y-Axis Scale:** Linear, ranging from `0.0` to `1.0` with major tick marks every `0.2`.
* **X-Axis Label:** `Induced Error Rate`
* **X-Axis Scale:** Linear, ranging from `0.00` to `1.00` with major tick marks at `0.00`, `0.25`, `0.50`, `0.75`, and `1.00`.
### B. Legend (Spatial Grounding: Right Margin)
The legend is located on the right side of the plot area. It uses a color-coded gradient of blue to distinguish between model sizes.
* **Lightest Blue:** `Qwen3-4b`
* **Light-Medium Blue:** `Qwen3-8b`
* **Medium-Dark Blue:** `Qwen3-14b`
* **Darkest Blue:** `Qwen3-32b`
## 3. Data Series Analysis and Trend Verification
### Series 1: Qwen3-4b (Lightest Blue)
* **Visual Trend:** This series generally maintains the lowest accuracy relative to the others. It shows a slight decline from 0.00 to 0.25, stays relatively flat to 0.50, dips at 0.75, and recovers at 1.00.
* **Data Points (Approximate):**
* 0.00: ~0.70
* 0.25: ~0.66
* 0.50: ~0.76
* 0.75: ~0.62
* 1.00: ~0.76
### Series 2: Qwen3-8b (Light-Medium Blue)
* **Visual Trend:** Shows moderate volatility. It starts mid-range, dips slightly at 0.25, peaks significantly at 0.50, drops at 0.75, and rises again at 1.00.
* **Data Points (Approximate):**
* 0.00: ~0.76
* 0.25: ~0.74
* 0.50: ~0.86
* 0.75: ~0.76
* 1.00: ~0.84
### Series 3: Qwen3-14b (Medium-Dark Blue)
* **Visual Trend:** Relatively stable compared to the others. It starts at a similar level to the 8b model, experiences a slight downward trend until 0.75, and then sharpens upward at 1.00.
* **Data Points (Approximate):**
* 0.00: ~0.78
* 0.25: ~0.72
* 0.50: ~0.74
* 0.75: ~0.70
* 1.00: ~0.86
### Series 4: Qwen3-32b (Darkest Blue)
* **Visual Trend:** Highly volatile. It starts with high accuracy, peaks at 0.25, suffers a major drop at 0.50 (becoming the lowest performer at that specific point), and then recovers to a high plateau for 0.75 and 1.00.
* **Data Points (Approximate):**
* 0.00: ~0.84
* 0.25: ~0.94
* 0.50: ~0.66
* 0.75: ~0.88
* 1.00: ~0.88
## 4. Reconstructed Data Table (Estimated Values)
| Induced Error Rate | Qwen3-4b (Acc) | Qwen3-8b (Acc) | Qwen3-14b (Acc) | Qwen3-32b (Acc) |
| :--- | :--- | :--- | :--- | :--- |
| **0.00** | 0.70 | 0.76 | 0.78 | 0.84 |
| **0.25** | 0.66 | 0.74 | 0.72 | 0.94 |
| **0.50** | 0.76 | 0.86 | 0.74 | 0.66 |
| **0.75** | 0.62 | 0.76 | 0.70 | 0.88 |
| **1.00** | 0.76 | 0.84 | 0.86 | 0.88 |
## 5. Key Observations
* **Non-Linear Correlation:** There is no simple linear correlation between the Induced Error Rate and Accuracy across the models. Performance fluctuates significantly as the error rate increases.
* **Model Scaling:** Generally, the larger models (32b and 14b) perform better than the smaller models (4b), but this is inconsistent. For example, at an Error Rate of 0.50, the 8b model outperforms all others, while the 32b model performs significantly worse than its own baseline.
* **Error Bars:** All data points include vertical error bars extending approximately ±0.05 to ±0.10 from the mean, suggesting a notable margin of error or variance in the testing results.