## Diagram: AgentFlow Performance Comparison
### Overview
The image presents a comparison of the performance of an AgentFlow system before and after applying Flow-GRPO fine-tuning. The comparison is based on the agent's ability to compute the check digit for a given Tropicos ID as if it were an ISBN-10 number. The diagram illustrates the steps taken by the agent, the tools used, and the results obtained at each step.
### Components/Axes
* **Question:** "Compute the check digit the Tropicos ID for the Order Helotiales would have if it were an ISBN-10 number." Ground truth answer: 3.
* **Left Side:** AgentFlow (before Flow-GRPO Fine-tuning)
* **Right Side:** AgentFlow (after Flow-GRPO Fine-tuning)
* **Steps:** Numbered steps (1, 2, 3, 4-9 on the left, 1, 2, 3, 4 on the right) indicating the sequence of actions taken by the agent.
* **Tool Icons:** Icons representing the tools used (Wikipedia, Google Search, Python Coder).
* **Status Indicators:** Green checkmarks indicate success, red "X" marks indicate failure.
### Detailed Analysis
**Left Side: AgentFlow (before Flow-GRPO Fine-tuning)**
* **Step 1:**
* Selected tool: Wikipedia Search
* Sub-goal: Retrieve Tropicos ID of Order Helotiales from Wikipedia.
* Tool call: query="Tropicos ID for the Order Helotiales"
* Result: No results found for query: Tropicos ID for Order Helotiales. (Failure)
* **Step 2:**
* Selected tool: Google Search
* Sub-goal: Retrieve the Tropicos ID of the Order.
* Tool call: query="Tropicos ID for the Order Helotiales"
* Result: The Tropicos ID for the Order Helotiales is 100370510. (Success)
* **Step 3:**
* Selected tool: Python Coder
* Sub-goal: Calculate the check digit for the Tropicos ID 100370510 if it were an ISBN-10 number.
* Tool call: query="Calculate check digit for ISBN-10 number 100370510"
* Result: name 'isbn' is not defined. (Failure)
* **Steps 4-9:**
* Action steps 3 to 9 have similar sub-goals, commands, and results; the agent is stuck in error loops and unable fix the naming issue.
* **Solution:** The Python tool failed repeatedly with the same 'isbn' variable error). The agent failed to calculate the check digit. (Failure)
**Right Side: AgentFlow (after Flow-GRPO Fine-tuning)**
* **Step 1:**
* Selected tool: Wikipedia Search (Truncated...).
* Result: No results found. (Failure)
* **Step 2:**
* Selected tool: Google Search (Truncated...).
* Result: Tropicos ID: 100370510. (Success)
* **Step 3:**
* Selected tool: Python Coder
* Sub-goal: Write and execute a Python script to calculate the check digit for `tropicos_id` as if it were an ISBN-10 number.
* Tool call: query="Calculate the check digit for the ISBN-10 number" + tropicos\_id
* Result: Error in execute\_command: name 'tropicos\_id' is not defined. (Failure)
* **Step 4:**
* Selected tool: Python Coder
* Sub-goal: Write and execute a Python script to calculate the check digit for `tropicos_id` as if it were an ISBN-10 number.
* Tool call: query="Compute the check digit for 100370510 as an ISBN-10 number"
* Result: 3 (printed output). (Success)
* Python Code: