# Technical Document Extraction: t-SNE Visualization of Hallucination Types
## 1. Image Overview
This image is a 2D scatter plot, likely generated using a dimensionality reduction technique such as t-SNE (t-distributed Stochastic Neighbor Embedding). It visualizes the clustering or distribution of three distinct categories of data points based on their semantic or factual properties.
## 2. Component Isolation
### A. Header/Title
* **Content:** None present.
### B. Main Chart Area (Data Visualization)
* **Type:** Scatter Plot.
* **X-Axis:** Numerical scale ranging from approximately **-30 to +25**. Major tick marks are labeled at **-20, -10, 0, 10, 20**.
* **Y-Axis:** Numerical scale ranging from approximately **-25 to +25**. Major tick marks are labeled at **-20, -10, 0, 10, 20**.
* **Data Points:** Approximately 300-400 semi-transparent circular markers.
### C. Legend (Spatial Grounding: Bottom-Left [x≈-28, y≈-22])
The legend is enclosed in a white box with a grey border. It maps colors to specific categories:
| Color | Label | Full Name |
| :--- | :--- | :--- |
| Green | `Factual Asso.` | Factual Association |
| Blue | `Asso. Hallu.` | Associative Hallucination |
| Red | `Unasso. Hallu.` | Unassociated Hallucination |
---
## 3. Data Series Analysis and Trends
### Series 1: Factual Asso. (Green)
* **Visual Trend:** This series is widely dispersed across the upper and central regions of the plot. It shows a high degree of overlap with the "Asso. Hallu." (Blue) series.
* **Spatial Distribution:**
* Concentrated primarily between X: [-20, 5] and Y: [-5, 20].
* Outliers exist in the lower-right quadrant (X: 10, Y: -15).
* **Observation:** The green points act as a "bridge" or background distribution for the other two types, indicating that factual associations share feature space with both types of hallucinations.
### Series 2: Asso. Hallu. (Blue)
* **Visual Trend:** This series is predominantly located in the upper-left and center-left of the plot, with a secondary cluster in the bottom-center.
* **Spatial Distribution:**
* Primary cluster: X: [-25, 5], Y: [0, 20].
* Secondary cluster: X: [0, 10], Y: [-25, -15].
* **Observation:** There is significant intermingling with the Green series, suggesting that "Associative Hallucinations" are semantically close to "Factual Associations."
### Series 3: Unasso. Hallu. (Red)
* **Visual Trend:** This series shows the most distinct clustering behavior. While some points are scattered in the center, there is a very dense, isolated cluster on the far right.
* **Spatial Distribution:**
* **Main Cluster:** X: [12, 23], Y: [-18, 2]. This cluster is relatively "clean" with very few green or blue points intermixed.
* **Scattered Points:** A few points are located in the center-top (X: -7, Y: 23) and center-bottom (X: 5, Y: -25).
* **Observation:** The distinct cluster on the right suggests that "Unassociated Hallucinations" possess unique features or vector representations that set them apart from both factual data and associative hallucinations.
---
## 4. Summary of Findings
The visualization demonstrates a gradient of semantic similarity:
1. **Factual Associations (Green)** and **Associative Hallucinations (Blue)** are highly related and occupy a similar region in the high-dimensional space (projected to the left and center of the plot).
2. **Unassociated Hallucinations (Red)** exhibit a unique signature, forming a distinct cluster on the right side of the plot, indicating they are mathematically/semantically distant from the other two categories.