# Technical Document Extraction: Performance Analysis Charts
## Chart 1: Stage-Based Performance Comparison
### Axes and Labels
- **X-axis (Stage):** CoT, APO, 1PO, 2TO, 3PO
- **Y-axis (Performance %):** 55–80 range
### Data Points
| Stage | Performance (%) |
|-------|-----------------|
| CoT | 63.5 |
| APO | 68.0 |
| 1PO | 74.2 |
| 2TO | 77.3 |
| 3PO | 78.8 |
### Legend
- **Colors:**
- Pink (#FF69B4) for CoT
- Light Green (#90EE90) for APO
- Purple (#E6E6FA) for 1PO
- Teal (#008080) for 2TO
- Light Blue (#ADD8E6) for 3PO
## Chart 2: Ablation Study Performance
### Axes and Labels
- **X-axis (Ablation):** w/o PO, w/o Prune, PO→TO
- **Y-axis (Performance %):** 62–68 range
### Box Plot Components
| Ablation | Median | Q1–Q3 Range | Outliers |
|---------------|--------|-------------|----------|
| w/o PO | ~64 | 63–65 | 62–66 |
| w/o Prune | ~66 | 65–67 | 64–68 |
| PO→TO | ~67 | 66–68 | 65–68 |
### Legend
- **Colors:**
- Pink (#FF69B4) for w/o PO
- Light Green (#90EE90) for w/o Prune
- Purple (#E6E6FA) for PO→TO
## Key Observations
1. **Stage Progression:** Performance increases monotonically from CoT (63.5%) to 3PO (78.8%).
2. **Ablation Impact:**
- Removing PO (w/o PO) results in the lowest performance (~64%).
- PO→TO ablation achieves the highest performance (~67%).
- Removing Prune (w/o Prune) shows intermediate performance (~66%).
## Color Consistency Check
- All legend colors match bar/box colors in both charts.
- No discrepancies detected between legend labels and visual elements.