## Pie Charts: Synthetic Problems Difficulty for SwS Variants
### Overview
This image contains four pie charts, each representing the distribution of "Synthetic Problems Difficulty" for different "SwS" variants. The variants are SwS-3B, SwS-7B, SwS-7B-Math, and SwS-32B. Each slice of the pie chart represents a difficulty level, indicated by a number (0 through 8), and its corresponding percentage of the total problems for that variant.
### Components/Axes
Each pie chart has the following characteristics:
- **Title**: Each chart is titled "Synthetic Problems Difficulty for [SwS Variant Name]".
- **Slices**: Each pie chart is divided into multiple slices, representing different difficulty levels.
- **Labels**: Each slice is labeled with a number representing the difficulty level and a percentage in parentheses, indicating its proportion of the total. The difficulty levels range from 0 to 8.
- **Colors**: Distinct colors are used for each difficulty level across all charts, though the specific mapping of color to difficulty level might vary slightly in hue between charts. A red outline is present around some of the smaller slices in each pie chart.
### Detailed Analysis
**1. Synthetic Problems Difficulty for SwS-3B**
| Difficulty Level | Percentage | Color |
| :--------------- | :--------- | :---------- |
| 0 | 32.2% | Teal |
| 1 | 11.0% | Orange |
| 2 | 6.9% | Blue-grey |
| 3 | 5.6% | Purple |
| 4 | 5.0% | Yellow |
| 5 | 4.9% | Tan |
| 6 | 5.3% | Light Brown |
| 7 | 7.1% | Grey |
| 8 | 22.0% | Light Blue |
**2. Synthetic Problems Difficulty for SwS-7B**
| Difficulty Level | Percentage | Color |
| :--------------- | :--------- | :---------- |
| 0 | 23.3% | Teal |
| 1 | 9.9% | Orange |
| 2 | 7.1% | Blue-grey |
| 3 | 6.0% | Purple |
| 4 | 5.6% | Yellow |
| 5 | 5.6% | Tan |
| 6 | 6.1% | Light Brown |
| 7 | 7.9% | Grey |
| 8 | 28.6% | Light Blue |
**3. Synthetic Problems Difficulty for SwS-7B-Math**
| Difficulty Level | Percentage | Color |
| :--------------- | :--------- | :---------- |
| 0 | 30.2% | Teal |
| 1 | 9.7% | Orange |
| 2 | 6.4% | Blue-grey |
| 3 | 5.3% | Purple |
| 4 | 4.9% | Yellow |
| 5 | 4.8% | Tan |
| 6 | 5.3% | Light Brown |
| 7 | 7.7% | Grey |
| 8 | 25.7% | Light Blue |
**4. Synthetic Problems Difficulty for SwS-32B**
| Difficulty Level | Percentage | Color |
| :--------------- | :--------- | :---------- |
| 0 | 18.8% | Teal |
| 1 | 9.2% | Orange |
| 2 | 6.9% | Blue-grey |
| 3 | 5.8% | Purple |
| 4 | 5.5% | Yellow |
| 5 | 5.5% | Tan |
| 6 | 6.2% | Light Brown |
| 7 | 8.2% | Grey |
| 8 | 33.8% | Light Blue |
### Key Observations
- **Dominant Difficulties**: Across all variants, difficulty level 0 and difficulty level 8 consistently represent the largest proportions of problems. Difficulty 0 is generally the largest slice, followed closely by difficulty 8.
- **SwS-3B**: Has the highest percentage for difficulty 0 (32.2%) and a significant portion for difficulty 8 (22.0%).
- **SwS-7B**: Shows the highest percentage for difficulty 8 (28.6%) and a moderate percentage for difficulty 0 (23.3%).
- **SwS-7B-Math**: Similar to SwS-3B, with a high percentage for difficulty 0 (30.2%) and a substantial portion for difficulty 8 (25.7%).
- **SwS-32B**: Exhibits the highest percentage for difficulty 8 (33.8%) and the lowest percentage for difficulty 0 (18.8%) among the four charts.
- **Mid-range Difficulties**: Difficulties 1 through 7 generally represent smaller, more evenly distributed percentages, typically ranging from approximately 4.8% to 11.0%.
- **Consistency**: The relative ordering and approximate percentages of difficulties 1 through 7 appear relatively consistent across the different SwS variants, with some minor variations.
### Interpretation
The data suggests that for all analyzed SwS variants, synthetic problems are predominantly concentrated at the two extremes of the difficulty scale: very easy (difficulty 0) and very hard (difficulty 8). This pattern implies a potential bimodal distribution of problem difficulty, with fewer problems existing in the intermediate difficulty ranges.
The variations between the SwS variants highlight differences in their problem generation or selection strategies. For instance, SwS-32B appears to generate a higher proportion of very difficult problems (difficulty 8) compared to the other variants, while SwS-3B and SwS-7B-Math lean more towards generating very easy problems (difficulty 0). The "Math" variant of SwS-7B shows a distribution very similar to the base SwS-3B, suggesting that the mathematical focus might not drastically alter the overall difficulty distribution compared to a non-math specific variant.
This bimodal distribution could indicate that the problem generation systems are optimized to create either trivial or challenging tasks, potentially for specific testing or training purposes. The red outlines around some of the smaller slices might be a visual cue to draw attention to these less frequent difficulty levels, or they could represent a specific category of problems within the dataset. Further investigation into the nature of "difficulty 0" and "difficulty 8" problems, as well as the context of these SwS variants, would be necessary to fully understand the implications of this observed distribution.