Image 30f02166c25c...

EXPERT: gemini-3-flash-free VERSION 1

RUNTIME: nugit/gemini/gemini-3-flash-preview
INTEL_VERIFIED
# Technical Data Extraction: Model Compression Performance Comparison

This document contains a detailed extraction of data from a grouped bar chart comparing different model compression methods across two large language models (LLMs) and four task categories.

## 1. General Metadata
*   **Y-Axis Title:** Mean Accuracy (%)
*   **Y-Axis Scale:** 0 to 90 (increments of 10)
*   **X-Axis Categories (Tasks):** Coding, Math, Creative Writing, MC (Multiple Choice)
*   **Models Evaluated:**
    *   **GLM – 4.5 – Air** (Left Plot)
    *   **Qwen3 – 30B – A3B** (Right Plot)

## 2. Legend and Key
The chart uses a combination of colors and textures to represent different methods and compression ratios.

### Compression Ratio (Texture/Line)
*   **0%:** Indicated by a horizontal dashed grey line above each task group (Baseline).
*   **50%:** Indicated by bars with a **diagonal hatching pattern** (forward slash).
*   **25%:** Indicated by **solid color bars**.

### Methods (Color)
*   **Pruning methods:**
    *   **REAP (ours):** Blue
    *   **EAN:** Pink/Red
    *   **Frequency:** Green
*   **Merging methods:**
    *   **HC-SMoE:** Gold/Yellow
    *   **M-SMoE:** Light Blue/Cyan

---

## 3. Data Extraction: GLM – 4.5 – Air (Left Plot)

| Task | Baseline (0%) | Method | 25% Ratio (Solid) | 50% Ratio (Hatched) |
| :--- | :--- | :--- | :--- | :--- |
| **Coding** | ~60% | REAP (ours) | ~60% | ~55% |
| | | EAN | ~59% | ~51% |
| | | Frequency | ~57% | ~32% |
| | | HC-SMoE | ~58% | ~44% |
| | | M-SMoE | ~55% | ~30% |
| **Math** | ~88% | REAP (ours) | ~88% | ~81% |
| | | EAN | ~87% | ~61% |
| | | Frequency | ~87% | ~70% |
| | | HC-SMoE | ~87% | ~47% |
| | | M-SMoE | ~86% | ~39% |
| **Creative Writing** | ~84% | REAP (ours) | ~84% | ~75% |
| | | EAN | ~82% | ~70% |
| | | Frequency | ~79% | ~60% |
| | | HC-SMoE | ~78% | ~59% |
| | | M-SMoE | ~78% | ~39% |
| **MC** | ~75% | REAP (ours) | ~68% | ~56% |
| | | EAN | ~64% | ~51% |
| | | Frequency | ~65% | ~52% |
| | | HC-SMoE | ~70% | ~56% |
| | | M-SMoE | ~59% | ~45% |

---

## 4. Data Extraction: Qwen3 – 30B – A3B (Right Plot)

| Task | Baseline (0%) | Method | 25% Ratio (Solid) | 50% Ratio (Hatched) |
| :--- | :--- | :--- | :--- | :--- |
| **Coding** | ~58% | REAP (ours) | ~58% | ~56% |
| | | EAN | ~58% | ~55% |
| | | Frequency | ~58% | ~47% |
| | | HC-SMoE | ~53% | ~38% |
| | | M-SMoE | ~55% | ~41% |
| **Math** | ~89% | REAP (ours) | ~88% | ~86% |
| | | EAN | ~88% | ~87% |
| | | Frequency | ~88% | ~73% |
| | | HC-SMoE | ~85% | ~1% |
| | | M-SMoE | ~88% | ~72% |
| **Creative Writing** | ~81% | REAP (ours) | ~81% | ~72% |
| | | EAN | ~81% | ~70% |
| | | Frequency | ~81% | ~68% |
| | | HC-SMoE | ~50% | ~0% |
| | | M-SMoE | ~81% | ~72% |
| **MC** | ~72% | REAP (ours) | ~68% | ~55% |
| | | EAN | ~63% | ~53% |
| | | Frequency | ~63% | ~54% |
| | | HC-SMoE | ~67% | ~56% |
| | | M-SMoE | ~56% | ~45% |

---

## 5. Key Observations and Trends
*   **Performance Retention:** The "REAP (ours)" method consistently maintains the highest accuracy across almost all tasks and models, especially at the 50% compression ratio (hatched bars), where other methods often show significant degradation.
*   **Task Sensitivity:** "Math" and "Creative Writing" generally show higher baseline accuracies (~80-90%) compared to "Coding" and "MC" (~60-75%).
*   **Method Failure:** In the Qwen3 model, the "HC-SMoE" method shows a near-total failure (accuracy drops to near 0%) in the "Math" and "Creative Writing" tasks at the 50% compression ratio.
*   **Compression Impact:** Performance at 25% compression (solid bars) is generally very close to the 0% baseline for most methods, while 50% compression (hatched bars) reveals the robustness of the "REAP" method compared to "Frequency" or "Merging" methods.
DECODING INTELLIGENCE...
TECHNICAL ASSET FINGERPRINT

30f02166c25c436742e3831d

FOUND IN PAPERS

EXPERT: gemini-3-flash-free VERSION 1