## Data Table: Hierarchical Classification of Mathematical Problems with Sample Counts
### Overview
The image displays a structured table that categorizes mathematical problems into a three-level hierarchy. The table is presented in Chinese, with English translations provided in parentheses for each category. It lists the number of samples available for each specific problem type under the third level (LEVEL3). The table appears to be a dataset taxonomy, likely for an educational or research dataset focused on elementary or middle school mathematics.
### Components/Axes
The table has four columns:
1. **LEVEL1**: The broadest category (e.g., Geometry, Application).
2. **LEVEL2**: A sub-category within LEVEL1 (e.g., Two-dimensional shapes, Fundamental Problem).
3. **LEVEL3**: The most specific problem type (e.g., Triangles, Add & Differential & Multiple).
4. **# Samples**: The count of samples or problems available for that specific LEVEL3 category.
The table is organized into four main sections corresponding to the LEVEL1 categories.
### Detailed Analysis
The table contains the following hierarchical data:
**Language**: The primary language is Chinese. English translations are provided in parentheses for every category label.
**Table Content**:
| LEVEL1 | LEVEL2 | LEVEL3 | # Samples |
| :--- | :--- | :--- | :--- |
| **几何 (Geometry)** | **平面图形 (Two-dimensional shapes)** | 三角形 (Triangles) | 20 |
| | | 圆 (Circle) | 20 |
| | | 平行四边形 (Parallelogram) | 20 |
| | | 梯形 (Trapezium) | 20 |
| | | 正方形 (Square) | 20 |
| | | 平面图形综合 (Synthesis Problem) | 20 |
| | | 角 (Angle) | 20 |
| | | 长方形 (Rectangle) | 20 |
| | **立体图形 (Three-dimensional Shapes)** | 圆柱 (Cylinder) | 20 |
| | | 正方体 (Cube) | 20 |
| | | 立体图形综合问题 (Synthesis Problem) | 20 |
| | | 长方体 (Cuboid) | 20 |
| **应用 (Application)** | **基础 (Fundamental Problem)** | 和差倍问题 (Add & Differential & Multiple) | 20 |
| | | 基础 (Basics) | 21 |
| | | 差倍问题 (Differential) | 20 |
| | | 归一问题 (Normalization) | 20 |
| | | 归总问题 (Induction) | 20 |
| | **经典问题 (Classical Problem)** | 利息问题 (Interest) | 20 |
| | | 周期问题 (Period) | 10 |
| | | 对折问题 (Folding) | 20 |
| | | 工程问题 (Engineering) | 20 |
| | | 年龄问题 (Age) | 20 |
| | | 折扣问题 (Discount) | 20 |
| | | 植树问题 (Planting) | 20 |
| | | 税率问题 (Tax) | 15 |
| | | 还原问题 (Reduction) | 20 |
| | | 页码问题 (Pagination) | 20 |
| | | 鸡兔同笼问题 (Chickens & Rabbits in the Same Cage) | 20 |
| | **路程问题 (Distance Problem)** | 相遇问题 (Encounter) | 20 |
| | | 行程问题 (Travel) | 20 |
| | | 追击问题 (Pursuit) | 20 |
| **度量与统计 (Measurement and Statistics)** | **度量 (Measurement)** | 人民币问题 (RMB) | 9 |
| | | 时间问题 (Time) | 20 |
| | | 浓度问题 (Concentration) | 20 |
| | | 温度问题 (Temperature) | 6 |
| | | 面积问题 (Area) | 17 |
| | **统计 (Statistics)** | 排列组合 (Permutation) | 20 |
| | | 统计指标 (Statistical Metrics) | 20 |
| | | 规律 (Law) | 18 |
| **数与代数 (Number and algebra)** | **分数运算 (Fractional Operation)** | 分数与小数 (Fraction & Decimal) | 20 |
| | | 分数应用 (Fractional Application) | 20 |
| | | 分数运算 (Fractional Operation) | 20 |
| | | 最简分数 (Simplest Fraction) | 16 |
| | **因数与倍数 Factors & Multiples** | 公倍数问题 (Common Multiples) | 16 |
| | | 公约数问题 (Common Divisors) | 11 |
| | | 因数问题 (Factor) | 20 |
| | | 因数与倍数综合问题 (Synthesis Problem) | 11 |
| | | 质数问题 (Prime Number) | 9 |
| | **基础运算 (Basic Operation)** | 乘法问题 (Multiplication) | 20 |
| | | 倒数问题 (Reciprocal Problem) | 16 |
| | | 四则运算 (Four-rule Operation) | 20 |
| | | 新运算定义 (New Operation Definition) | 20 |
| | | 方程问题 (Equation) | 20 |
| | | 除法问题 (Division) | 20 |
| | **比 (Ratio)** | 倍数问题 (Multiple) | 20 |
| | | 概率问题 (Probability) | 20 |
| | | 比例问题 (Proportion) | 20 |
| | | 百分率问题 (Percentage) | 20 |
### Key Observations
1. **Sample Count Consistency**: The vast majority of LEVEL3 categories have exactly 20 samples. This suggests a deliberate effort to balance the dataset.
2. **Notable Outliers**: Several categories have fewer than 20 samples:
* **周期问题 (Period)**: 10 samples.
* **税率问题 (Tax)**: 15 samples.
* **人民币问题 (RMB)**: 9 samples.
* **温度问题 (Temperature)**: 6 samples.
* **面积问题 (Area)**: 17 samples.
* **最简分数 (Simplest Fraction)**: 16 samples.
* **公倍数问题 (Common Multiples)**: 16 samples.
* **公约数问题 (Common Divisors)**: 11 samples.
* **因数与倍数综合问题 (Synthesis Problem)**: 11 samples.
* **质数问题 (Prime Number)**: 9 samples.
* **倒数问题 (Reciprocal Problem)**: 16 samples.
* **规律 (Law)**: 18 samples.
3. **Single Exception**: The category **基础 (Basics)** under **基础 (Fundamental Problem)** has 21 samples, one more than the standard 20.
4. **Hierarchical Structure**: The taxonomy is well-organized, moving from broad mathematical domains (Geometry, Application, etc.) to specific, teachable problem types.
### Interpretation
This table represents the schema of a curated dataset for mathematical problem-solving, likely intended for training or evaluating AI models, or for educational research. The structure reveals a comprehensive coverage of core elementary mathematics topics.
* **Data Suggestion**: The dataset is designed to be balanced, with a target of 20 samples per fine-grained problem type. The outliers indicate either difficulty in sourcing/generating problems for those specific topics (e.g., Temperature, RMB) or a conscious decision to allocate fewer resources to them.
* **Relationships**: The hierarchy shows how complex application problems (like "Chickens & Rabbits") are built upon fundamental concepts. The separation of "Measurement" and "Statistics" under one LEVEL1 category groups practical quantification skills together.
* **Anomalies & Trends**: The near-perfect uniformity of 20 samples is the dominant trend, making the deviations significant. The lowest counts (6 for Temperature, 9 for RMB and Prime Number) may represent niche or more advanced topics within this curriculum scope. The presence of "Synthesis Problem" categories under both 2D and 3D geometry, as well as Factors & Multiples, indicates the dataset includes problems that integrate multiple concepts, which are crucial for assessing higher-order understanding.
* **Purpose**: This is not raw data but a metadata table. Its primary value is in documenting the composition and scope of an underlying collection of math problems, ensuring transparency about the dataset's coverage and potential biases (e.g., under-representation of temperature-related problems).