# Technical Document Analysis of Bar Chart
## 1. Labels and Axis Titles
- **X-Axis**: Datasets (Categorical)
- LitCovid
- CDR
- MEDIQA-RQE
- MQP
- CHEMDNER
- BC5CDR-D
- **Y-Axis**: Cumulative Match Distance (CMD) - Numerical scale from 0.0 to 2.0
- **Legend**: Model classifications with color coding (Top-left placement)
- ZeroGen: Dark purple
- ClinGen w/KG: Red
- ClinGen w/LLM: Orange
- DemoGen: Maroon
- ProGen: Pink
- Ground Truth: Beige
## 2. Categories and Sub-Categories
- **Models** (Legend labels):
- ZeroGen
- ClinGen w/KG
- ClinGen w/LLM
- DemoGen
- ProGen
- Ground Truth
- **Datasets** (X-axis categories):
- LitCovid
- CDR
- MEDIQA-RQE
- MQP
- CHEMDNER
- BC5CDR-D
## 3. Data Points and Trends
### Dataset: LitCovid
- **ZeroGen**: ~0.82 (Dark purple)
- **ClinGen w/KG**: ~0.48 (Red)
- **ClinGen w/LLM**: ~0.47 (Orange)
- **DemoGen**: ~0.55 (Maroon)
- **ProGen**: ~0.53 (Pink)
- **Ground Truth**: ~0.41 (Beige)
- **Trend**: ZeroGen shows highest CMD; Ground Truth lowest.
### Dataset: CDR
- **ZeroGen**: ~1.22 (Dark purple)
- **ClinGen w/KG**: ~0.88 (Red)
- **ClinGen w/LLM**: ~0.82 (Orange)
- **DemoGen**: ~1.12 (Maroon)
- **ProGen**: ~1.14 (Pink)
- **Ground Truth**: ~0.79 (Beige)
- **Trend**: ZeroGen peaks; Ground Truth remains lowest.
### Dataset: MEDIQA-RQE
- **ZeroGen**: ~1.26 (Dark purple)
- **ClinGen w/KG**: ~0.78 (Red)
- **ClinGen w/LLM**: ~0.79 (Orange)
- **DemoGen**: ~1.13 (Maroon)
- **ProGen**: ~1.12 (Pink)
- **Ground Truth**: ~0.78 (Beige)
- **Trend**: ZeroGen highest; ClinGen models cluster near Ground Truth.
### Dataset: MQP
- **ZeroGen**: ~1.28 (Dark purple)
- **ClinGen w/KG**: ~0.85 (Red)
- **ClinGen w/LLM**: ~0.84 (Orange)
- **DemoGen**: ~1.23 (Maroon)
- **ProGen**: ~1.24 (Pink)
- **Ground Truth**: ~0.83 (Beige)
- **Trend**: ZeroGen and DemoGen/ProGen show similar high CMD.
### Dataset: CHEMDNER
- **ZeroGen**: ~1.95 (Dark purple)
- **ClinGen w/KG**: ~0.98 (Red)
- **ClinGen w/LLM**: ~0.95 (Orange)
- **DemoGen**: ~1.10 (Maroon)
- **ProGen**: ~1.17 (Pink)
- **Ground Truth**: ~0.88 (Beige)
- **Trend**: ZeroGen dramatically higher than others; Ground Truth lowest.
### Dataset: BC5CDR-D
- **ZeroGen**: ~1.52 (Dark purple)
- **ClinGen w/KG**: ~0.86 (Red)
- **ClinGen w/LLM**: ~0.84 (Orange)
- **DemoGen**: ~1.50 (Maroon)
- **ProGen**: ~1.48 (Pink)
- **Ground Truth**: ~0.78 (Beige)
- **Trend**: ZeroGen and DemoGen/ProGen show near-identical high CMD.
## 4. Key Observations
- **ZeroGen** consistently exhibits the highest CMD across all datasets, indicating poorer performance relative to other models.
- **ClinGen w/KG** and **ClinGen w/LLM** demonstrate comparable performance, often clustering near the **Ground Truth**.
- **DemoGen** and **ProGen** show similar performance, slightly outperforming ClinGen models but underperforming relative to Ground Truth.
- **Ground Truth** (beige bars) consistently has the lowest CMD, serving as the benchmark for optimal performance.
## 5. Spatial Grounding and Color Verification
- **Legend Position**: Top-left corner, aligned with bar colors.
- **Color Consistency**: All bars match legend colors exactly (e.g., ZeroGen = dark purple, ClinGen w/KG = red).
## 6. Component Isolation
- **Main Chart**: Bar chart with grouped bars per dataset.
- **Legend**: Top-left, no overlap with data.
- **Axes**: Clearly labeled with dataset names (X) and CMD values (Y).
## 7. Data Table Reconstruction
| Dataset | ZeroGen | ClinGen w/KG | ClinGen w/LLM | DemoGen | ProGen | Ground Truth |
|---------------|---------|--------------|---------------|---------|--------|--------------|
| LitCovid | 0.82 | 0.48 | 0.47 | 0.55 | 0.53 | 0.41 |
| CDR | 1.22 | 0.88 | 0.82 | 1.12 | 1.14 | 0.79 |
| MEDIQA-RQE | 1.26 | 0.78 | 0.79 | 1.13 | 1.12 | 0.78 |
| MQP | 1.28 | 0.85 | 0.84 | 1.23 | 1.24 | 0.83 |
| CHEMDNER | 1.95 | 0.98 | 0.95 | 1.10 | 1.17 | 0.88 |
| BC5CDR-D | 1.52 | 0.86 | 0.84 | 1.50 | 1.48 | 0.78 |
## 8. Conclusion
The chart compares generative models' performance (CMD) across biomedical datasets. ZeroGen underperforms consistently, while ClinGen variants and Ground Truth show closer alignment. DemoGen and ProGen exhibit intermediate performance.