# Technical Document Extraction: Translation Task Performance Analysis
## Chart 1: Loss Objective Comparison
### Axes and Labels
- **X-axis**: Loss Objective
Categories:
- `L_prefer`
- `L_NLL`
- `L_prefer + L_NLL`
- **Y-axis**: Avg. Ref-free Eval (80–90)
- **Legend**:
- `xx→en` (Blue)
- `en→xx` (Orange)
### Data Points and Trends
1. **`L_prefer`**
- `xx→en`: 82.81 (Blue)
- `en→xx`: 85.50 (Orange)
*Trend*: Orange bar exceeds blue by 2.69 points.
2. **`L_NLL`**
- `xx→en`: 83.78 (Blue)
- `en→xx`: 85.84 (Orange)
*Trend*: Orange bar exceeds blue by 2.06 points.
3. **`L_prefer + L_NLL`**
- `xx→en`: 84.29 (Blue)
- `en→xx`: 87.71 (Orange)
*Trend*: Orange bar exceeds blue by 3.42 points.
### Key Observations
- `en→xx` consistently outperforms `xx→en` across all loss objectives.
- Combining `L_prefer` and `L_NLL` yields the highest performance for both directions.
---
## Chart 2: Preference Data Comparison
### Axes and Labels
- **X-axis**: Preference Data
Categories:
- `ALMA + Ref`
- `GPT-4 + Ref`
- `All of them`
- **Y-axis**: Avg. Ref-free Eval (80–90)
- **Legend**:
- `xx→en` (Blue)
- `en→xx` (Orange)
### Data Points and Trends
1. **`ALMA + Ref`**
- `xx→en`: 83.70 (Blue)
- `en→xx`: 86.99 (Orange)
*Trend*: Orange bar exceeds blue by 3.29 points.
2. **`GPT-4 + Ref`**
- `xx→en`: 84.20 (Blue)
- `en→xx`: 86.66 (Orange)
*Trend*: Orange bar exceeds blue by 2.46 points.
3. **`All of them`**
- `xx→en`: 84.29 (Blue)
- `en→xx`: 87.71 (Orange)
*Trend*: Orange bar exceeds blue by 3.42 points.
### Key Observations
- `en→xx` outperforms `xx→en` in all preference data configurations.
- `All of them` achieves the highest performance for both directions.
---
## Cross-Chart Analysis
- **Language Direction**:
- `en→xx` (orange) consistently outperforms `xx→en` (blue) in both charts.
- **Performance Gains**:
- Combining loss objectives (`L_prefer + L_NLL`) and preference data (`All of them`) maximizes performance.
- **Notable Values**:
- Highest `en→xx` score: 87.71 (both charts).
- Highest `xx→en` score: 84.29 (both charts).
## Notes
- No non-English text detected in labels or axis titles.
- All numerical values extracted directly from bar annotations.
- Legend colors (blue/orange) match bar colors in both charts.