# Chart Analysis: Entity Frequency Distribution
## Chart Type
Line chart comparing entity frequency distributions across three models.
## Axis Labels
- **X-axis**: "Entity ID's Sorted by Frequency" (logarithmic scale, 0–700)
- **Y-axis**: "Entity Frequency" (logarithmic scale, 10⁻⁴–10⁻¹)
## Legend
| Color | Label |
|--------|-------------|
| Blue | ZeroGen |
| Orange | DemoGen |
| Green | Ground Truth|
## Key Trends
1. **Initial Convergence**:
- All three lines (ZeroGen, DemoGen, Ground Truth) start at ~10⁻¹ frequency for the first 50 Entity IDs.
- Lines diverge sharply after Entity ID 50.
2. **Performance Divergence**:
- **ZeroGen** (blue) and **DemoGen** (orange) drop below **Ground Truth** (green) by Entity ID 200.
- Ground Truth maintains a steeper decline compared to the other two models.
3. **Long-Tail Behavior**:
- Ground Truth retains higher frequency values for Entity IDs >300 (e.g., ~10⁻³ at ID 300 vs. ~10⁻⁴ for ZeroGen/DemoGen).
- ZeroGen and DemoGen flatten near 10⁻⁴ frequency after Entity ID 200.
## Data Points
- **Entity ID 0**: All models ≈10⁻¹ frequency.
- **Entity ID 100**:
- Ground Truth ≈10⁻²
- ZeroGen/DemoGen ≈10⁻².5
- **Entity ID 200**:
- Ground Truth ≈10⁻².8
- ZeroGen/DemoGen ≈10⁻³.2
- **Entity ID 700**:
- Ground Truth ≈10⁻³.5
- ZeroGen/DemoGen ≈10⁻⁴
## Observations
- Ground Truth demonstrates superior long-tail frequency retention.
- ZeroGen and DemoGen exhibit similar performance, with DemoGen slightly outperforming ZeroGen in early Entity IDs (<100).
- Logarithmic scaling emphasizes frequency disparities at lower Entity IDs.