## Bar Chart: Edit Distance vs. Noise Level
### Overview
The chart compares the edit distance (a measure of string similarity) across four categories (All, Insertion, Deletion, Modify) at varying noise levels (5% to 30%). Each category is represented by a distinct patterned bar, with edit distance values on the y-axis (0.0–0.8) and noise levels on the x-axis (5%–30%).
### Components/Axes
- **X-axis**: Noise Level (%) with markers at 5%, 10%, 15%, 20%, 25%, and 30%.
- **Y-axis**: Edit Distance (0.0–0.8) in increments of 0.2.
- **Legend**: Located in the top-left corner, mapping categories to patterns/colors:
- **All**: Purple with diagonal stripes.
- **Insertion**: Red with diagonal stripes.
- **Deletion**: Blue with crosshatch.
- **Modify**: Green with dots.
### Detailed Analysis
- **5% Noise**:
- All: ~0.2
- Insertion: ~0.25
- Deletion: ~0.15
- Modify: ~0.2
- **10% Noise**:
- All: ~0.35
- Insertion: ~0.45
- Deletion: ~0.25
- Modify: ~0.35
- **15% Noise**:
- All: ~0.5
- Insertion: ~0.6
- Deletion: ~0.35
- Modify: ~0.5
- **20% Noise**:
- All: ~0.6
- Insertion: ~0.7
- Deletion: ~0.45
- Modify: ~0.6
- **25% Noise**:
- All: ~0.65
- Insertion: ~0.8
- Deletion: ~0.55
- Modify: ~0.7
- **30% Noise**:
- All: ~0.7
- Insertion: ~0.85
- Deletion: ~0.6
- Modify: ~0.8
### Key Observations
1. **Upward Trend**: All categories show increasing edit distance as noise level rises.
2. **Insertion Dominance**: Insertion consistently has the highest edit distance across all noise levels.
3. **Deletion Consistency**: Deletion has the lowest edit distance, remaining stable relative to other categories.
4. **Modify vs. All**: Modify and All categories exhibit similar trends, with Modify slightly outperforming All at higher noise levels (e.g., 25% and 30%).
### Interpretation
The data suggests that **insertion errors are the most disruptive** to edit distance, likely due to their direct impact on string alignment. Deletion errors are the least impactful, possibly because they remove characters without introducing new ones. Modify errors fall between the two, indicating that character substitutions are moderately disruptive. The "All" category aggregates all error types, showing a middle-ground trend. The consistent upward trend across noise levels highlights the sensitivity of edit distance to increasing corruption in input data. This could inform noise-robustness strategies in text processing systems.