## Prompt: GPT-4o Training Dataset Construction
### Overview
The image presents a prompt designed for GPT-4o to construct a training dataset. The prompt outlines the task of analyzing and evaluating multi-step solutions to multimodal mathematical problems, focusing on image alignment and reasoning logic. It also specifies the output format, including step-by-step analysis and correction of the first incorrect step.
### Components/Axes
The prompt is structured into several key sections:
- **Task**: Defines the objectives, including analyzing the purpose and correctness of each step in a solution.
- **Question**: Placeholder for the multimodal mathematical problem.
- **Solution Steps**: Placeholder for the multiple-step solution.
- **Output Format**: Specifies the structure for the output, including step intent analysis, image alignment analysis, reasoning logic analysis, and final judgement.
### Detailed Analysis or ### Content Details
The prompt provides detailed instructions for evaluating each step of a solution:
1. **Task**:
* Analyze the purpose of each step and the specific actions taken.
* Analyze each step's correctness in terms of image alignment and reasoning logic.
* Image alignment: Check if the information and reasoning used are consistent with the provided image.
* Reasoning logic: Check if the reasoning is logically sound, calculations are correct, and information used matches previous steps and the question.
* Output "Correct" or "Incorrect" for each judgement.
* Correct the first incorrect step based on error analysis and output the corrected step.
2. **Question**:
* The multimodal mathematical problem is as follows: `<Question>`
3. **Solution Steps**:
* The multiple-step solution is as follows: `<Solution Steps>`
4. **Output Format**:
* Step 1:
* Step intent analysis: \[Describe what the step aims to do and the specific actions]
* Image alignment analysis: \[Analyze the consistency of image alignment]
* Judgement of image alignment: \[Correct/Incorrect]
* Reasoning logic analysis: \[Analyze the rationality of logic, correctness of calculations and consistency with prior step]
* Judgement of reasoning logic: \[Correct/Incorrect]
* Final judgement of the current step: \[Correct/Incorrect]
* Step 2: ...
* Corrected step of the first incorrected step in solution:
* Step n: \[assume that the first incorrect step is step n, and fill in the corrected step n in the square bracket]
### Key Observations
- The prompt emphasizes the importance of both image alignment and reasoning logic in evaluating the correctness of each step.
- The output format is highly structured, requiring detailed analysis and judgement for each step.
- The prompt includes placeholders for the question and solution steps, indicating that these will be provided separately.
### Interpretation
The prompt is designed to guide GPT-4o in creating a training dataset for evaluating multi-step solutions to multimodal mathematical problems. By focusing on image alignment and reasoning logic, the prompt aims to ensure that the model can accurately assess the correctness of each step and provide appropriate corrections. The structured output format facilitates the creation of a high-quality training dataset that can be used to improve the model's performance on similar tasks.