# Technical Document Extraction: Attention Key Query Score Chart
## Chart Title
**Attention Key Query Score, a=256**
---
## Axis Labels
- **X-Axis (Horizontal):**
Label: `Hidden Size`
Range: `0` to `32768` (logarithmic scale)
Tick Markers: `0`, `4096`, `8192`, `12288`, `16384`, `20480`, `24576`, `28672`, `32768`
- **Y-Axis (Vertical):**
Label: `Throughput (TFLOPs/s)`
Range: `0` to `100`
Tick Markers: `0`, `50`, `100`
---
## Legend
- **Placement:** Right side of the chart
- **Color-Coded Labels:**
- `1` (Blue)
- `2` (Orange)
- `4` (Green)
- `8` (Red)
- `16` (Purple)
- `32` (Brown)
- `64` (Pink)
- `h / a` (Key for ratio interpretation)
---
## Line Trends and Data Points
1. **Blue Line (h/a = 1):**
- **Trend:** Gradual upward slope with minor fluctuations.
- **Data Points:**
- At `Hidden Size = 32768`: `~35 TFLOPs/s`
2. **Orange Line (h/a = 2):**
- **Trend:** Similar to blue line but with sharper fluctuations.
- **Data Points:**
- At `Hidden Size = 32768`: `~70 TFLOPs/s`
3. **Green Line (h/a = 4):**
- **Trend:** Starts flat, sharp rise at `Hidden Size = 20480`, then stabilizes.
- **Data Points:**
- At `Hidden Size = 32768`: `~75 TFLOPs/s`
4. **Red Line (h/a = 8):**
- **Trend:** Steady upward slope.
- **Data Points:**
- At `Hidden Size = 32768`: `~85 TFLOPs/s`
5. **Purple Line (h/a = 16):**
- **Trend:** Steeper slope than red line.
- **Data Points:**
- At `Hidden Size = 32768`: `~95 TFLOPs/s`
6. **Brown Line (h/a = 32):**
- **Trend:** Steepest slope, linear increase.
- **Data Points:**
- At `Hidden Size = 32768`: `~90 TFLOPs/s`
7. **Pink Line (h/a = 64):**
- **Trend:** Highest slope, exceeds y-axis maximum.
- **Data Points:**
- At `Hidden Size = 32768`: `~105 TFLOPs/s`
---
## Key Observations
- **Inverse Relationship:** Higher `h/a` ratios (e.g., 64) achieve significantly higher throughput than lower ratios (e.g., 1).
- **Scalability:** Throughput increases non-linearly with `Hidden Size`, especially for larger `h/a` values.
- **Anomalies:** Green line (`h/a = 4`) shows a sharp spike at `Hidden Size = 20480`, suggesting an optimization point.
---
## Notes
- All textual information extracted from the chart.
- No additional languages or non-textual elements present.
- Data points extrapolated from visual trends; exact values may require interpolation.