# Technical Document Extraction: Speedup on Different Model Sizes
## Chart Title
**Speedup on different model sizes**
## Axis Labels
- **X-Axis (Categories):** Model Size
- Vicuna-7B
- Zephyr-7B
- Vicuna-13B
- Vicuna-33B
- **Y-Axis (Quantitative):** Tokens per Second
## Legend
- **Blue Bars:** w/o Medusa
- **Orange Bars:** Medusa-2
## Data Points and Trends
1. **Vicuna-7B**
- w/o Medusa: 45 tokens/second
- Medusa-2: 130 tokens/second
- Speedup: **2.83x**
2. **Zephyr-7B**
- w/o Medusa: 40 tokens/second
- Medusa-2: 110 tokens/second
- Speedup: **2.66x**
3. **Vicuna-13B**
- w/o Medusa: 35 tokens/second
- Medusa-2: 100 tokens/second
- Speedup: **2.83x**
4. **Vicuna-33B**
- w/o Medusa: 18 tokens/second
- Medusa-2: 45 tokens/second
- Speedup: **2.35x**
## Observations
- **Speedup Consistency:**
- Vicuna-7B and Vicuna-13B exhibit identical speedup multipliers (**2.83x**) despite differing model sizes.
- Zephyr-7B shows a slightly lower speedup (**2.66x**) compared to Vicuna variants.
- Vicuna-33B has the lowest speedup (**2.35x**), indicating diminishing returns at larger model sizes.
- **Performance Gains:**
- Medusa-2 consistently outperforms the baseline (w/o Medusa) across all model sizes.
- Larger models (e.g., Vicuna-33B) show reduced proportional improvement, suggesting scalability limitations.
## Structural Notes
- **Bar Colors:**
- Blue (w/o Medusa) and orange (Medusa-2) bars are visually distinct, aligning with the legend.
- **Speedup Multipliers:**
- Embedded text above orange bars provides direct quantitative comparisons.
## Data Table Reconstruction
| Model Size | w/o Medusa (Tokens/sec) | Medusa-2 (Tokens/sec) | Speedup (x) |
|--------------|-------------------------|-----------------------|-------------|
| Vicuna-7B | 45 | 130 | 2.83 |
| Zephyr-7B | 40 | 110 | 2.66 |
| Vicuna-13B | 35 | 100 | 2.83 |
| Vicuna-33B | 18 | 45 | 2.35 |
## Conclusion
The chart demonstrates that Medusa-2 significantly accelerates token generation across all tested model sizes, with speedup diminishing as model complexity increases. Vicuna-7B and Vicuna-13B achieve the highest efficiency gains, while Vicuna-33B exhibits the weakest proportional improvement.