# Technical Document Extraction: Latency Analysis Chart
## Chart Type
Stacked bar chart comparing latency distribution across three scenarios:
1. Small (h=2560, a=20)
2. Large (h=16384, a=128)
3. Large + Flash (h=16384, a=128)
## Axis Labels
- **Y-axis**: "Percentage of Latency (%)" (0-100 scale)
- **X-axis**: Scenario labels with hardware parameters
## Legend
[Top-right placement] Color-coded components:
1. QKV (blue)
2. Flash (cyan)
3. Score (orange)
4. AOV (green)
5. Linproj (yellow)
6. MLP h to 4h (purple)
7. MLP 4h to h (gray)
8. Non-GEMM (red)
## Key Trends
1. **Small Scenario (h=2560, a=20)**:
- Non-GEMM dominates (45%)
- MLP h to 4h (20%) and MLP 4h to h (15%) significant
- QKV (10%) and Flash (5%) minimal
2. **Large Scenario (h=16384, a=128)**:
- QKV becomes dominant (35%)
- MLP h to 4h (25%) and MLP 4h to h (20%) persist
- Non-GEMM reduced to 10%
3. **Large + Flash Scenario**:
- QKV dominates (40%)
- Flash component appears (5%)
- MLP h to 4h (25%) and MLP 4h to h (20%) remain
- Non-GEMM minimal (5%)
## Data Point Verification
| Scenario | QKV | Flash | Score | AOV | Linproj | MLP h→4h | MLP 4h→h | Non-GEMM |
|-------------------|-----|-------|-------|-----|---------|----------|----------|----------|
| Small | 10% | 5% | 10% | 5% | 5% | 20% | 15% | 45% |
| Large | 35% | 0% | 2% | 1% | 5% | 25% | 20% | 10% |
| Large + Flash | 40% | 5% | 2% | 1% | 5% | 25% | 20% | 5% |
## Spatial Grounding
- Legend positioned at [x=0.85, y=0.95] (top-right corner)
- Color consistency verified: All legend colors match bar segments exactly
## Component Isolation
1. **Header**: Chart title "Percentage of Latency (%)"
2. **Main Chart**:
- Three vertical bars with segmented color coding
- Each segment's height corresponds to percentage contribution
3. **Footer**:
- X-axis labels with hardware parameters
- Y-axis scale (0-100%)
## Language Note
All text appears in English. No non-English content detected.
## Trend Verification Logic
- Non-GEMM dominance in Small scenario confirmed by red segment height
- QKV growth in Large/Large+Flash scenarios verified by increasing blue segment
- Flash component only appears in Large+Flash scenario (cyan segment)
- MLP components maintain consistent proportions across scenarios