## Chart Type: Stacked Bar Chart
### Overview
The image is a stacked bar chart titled "Success Rates". It compares the success rates of four different conditions: "w-ctx", "wo-ctx", "4o-NL", and "o3-NL". The success rates are broken down into three categories: "Proved" (teal), "Proof Gap" (purple), and "Rejected" (orange).
### Components/Axes
* **Title:** Success Rates
* **Y-axis:** Ranges from 0.0 to 1.0 in increments of 0.2. The y-axis represents the success rate.
* **X-axis:** Categorical axis with four categories: "w-ctx", "wo-ctx", "4o-NL", and "o3-NL".
* **Legend:** Located in the top-right corner, it identifies the colors corresponding to each category:
* Teal: Proved
* Purple: Proof Gap
* Orange: Rejected
### Detailed Analysis
Here's a breakdown of the data for each category:
* **w-ctx:**
* Proved (Teal): Approximately 0.7
* Proof Gap (Purple): Approximately 0.0 (visually negligible)
* Rejected (Orange): Approximately 0.3
* **wo-ctx:**
* Proved (Teal): Approximately 0.0 (visually negligible)
* Proof Gap (Purple): Approximately 0.0 (visually negligible)
* Rejected (Orange): Approximately 1.0
* **4o-NL:**
* Proved (Teal): Approximately 0.0 (visually negligible)
* Proof Gap (Purple): Approximately 0.4
* Rejected (Orange): Approximately 0.6
* **o3-NL:**
* Proved (Teal): Approximately 0.0 (visually negligible)
* Proof Gap (Purple): Approximately 0.0 (visually negligible)
* Rejected (Orange): Approximately 1.0
### Key Observations
* "w-ctx" has the highest "Proved" success rate.
* "wo-ctx" and "o3-NL" have a "Rejected" success rate of 1.0.
* "4o-NL" has the highest "Proof Gap" success rate.
### Interpretation
The chart compares the success rates of four different conditions, breaking down the results into "Proved", "Proof Gap", and "Rejected" categories. The "w-ctx" condition appears to be the most successful in terms of proofs, while "wo-ctx" and "o3-NL" are entirely rejected. The "4o-NL" condition shows a significant "Proof Gap", suggesting that it often encounters issues that prevent it from being fully proven or rejected. The data suggests that the "w-ctx" condition is the most reliable, while "wo-ctx" and "o3-NL" are the least.