## Heatmap: ProOntoQA Ablation Study
### Overview
The heatmap illustrates the performance of ProOntoQA under different training settings and ablation studies. Each row represents a different training setting, and each column represents a different ablation study. The color intensity indicates the score, with darker shades representing higher scores.
### Components/Axes
- **Rows**: Represent different training settings (Commonsense - 2%, Anticommense - 2%, Noncommense - 2%, Commonsense - 5%, Anticommense - 5%, Noncommense - 5%, Commonsense - 90%, Anticommense - 90%, Noncommense - 90%).
- **Columns**: Represent different ablation studies (CAPT Setting, CAPT=null, CAPT=order, CAPT=random).
- **X-Axis**: Represents the number of epochs.
- **Y-Axis**: Represents the score.
### Detailed Analysis or ### Content Details
- **Commonsense - 2% Training**: The highest scores are observed in the CAPT Setting and CAPT=order, with scores peaking around 95.
- **Anticommense - 2% Training**: The scores are generally lower, with the highest score around 85 in the CAPT Setting.
- **Noncommense - 2% Training**: Similar to Anticommense, the highest score is around 85 in the CAPT Setting.
- **Commonsense - 5% Training**: The scores are slightly higher than in the 2% training setting, with the highest score around 90 in the CAPT Setting.
- **Anticommense - 5% Training**: The scores are similar to the 2% training setting, with the highest score around 85 in the CAPT Setting.
- **Noncommense - 5% Training**: The scores are slightly higher than in the 2% training setting, with the highest score around 85 in the CAPT Setting.
- **Commonsense - 90% Training**: The scores are consistently high, with the highest score around 95 in the CAPT Setting.
- **Anticommense - 90% Training**: The scores are similar to the 2% and 5% training settings, with the highest score around 85 in the CAPT Setting.
- **Noncommense - 90% Training**: The scores are slightly higher than in the 2% and 5% training settings, with the highest score around 85 in the CAPT Setting.
### Key Observations
- The CAPT Setting consistently yields the highest scores across all training settings and ablation studies.
- The scores are highest when the model is trained with 90% of the data.
- The scores are lowest when the model is trained with 2% of the data and the CAPT Setting is used.
### Interpretation
The heatmap suggests that the CAPT Setting is the most effective training method for ProOntoQA, regardless of the amount of data used or the ablation studies conducted. The highest scores are achieved when the model is trained with 90% of the data, indicating that a larger dataset is beneficial for the model's performance. The ablation studies show that the presence of common sense and the specific training settings (CAPT Setting, CAPT=null, CAPT=order, CAPT=random) do not significantly impact the model's performance.