# Technical Analysis: Language Model Sampling and Verification Strategies
This document provides a comprehensive extraction of the technical information contained in the provided diagram, which illustrates various methods for generating and verifying answers from a Language Model (LM).
## 1. Global Legend and Key
Located in the top-right header of the right-hand panel:
* **Dashed Blue Box:** "Apply Verifier"
* **Green Circle:** "Selected by verifier"
* **Red Circle:** "Rejected by verifier"
---
## 2. Left Panel: Sampling Methodologies
### Section: Parallel Sampling
* **Input (Blue Box):** "Q: If 4 daps = 7 yaps, and 5 yaps = 3 baps, how many daps equal 42 baps?"
* **Process:** The question is fed into a block labeled **"LM"**.
* **Flow:** The LM branches into three independent parallel paths.
* **Outputs:**
1. **Red Box:** "A: So 7/4 yap/dap ..."
2. **Red Box:** "A: We have 4 dap..."
3. **Green Box:** "A: If 7/4 yaps/dap ..."
* **Annotation:** "LM proposes answers independently, in parallel"
* **Visual Trend:** A one-to-many divergence where multiple complete answers are generated simultaneously.
### Section: Sequential Revisions
* **Input (Blue Box):** Same question as above regarding daps, yaps, and baps.
* **Process:** The question is fed into the **"LM"**.
* **Flow:** A linear chain of three boxes connected by arrows.
* **Outputs:**
1. **Red Box:** "A: We ..."
2. **Red Box:** "A: So ..."
3. **Green Box:** "A: If 7/4 ..."
* **Annotation:** "LM proposes a sequence of revisions, each conditioned on previous revisions"
* **Visual Trend:** A serial progression where each step modifies the previous output until a final (green) state is reached.
---
## 3. Right Panel: Using Revision Model + Verifier at Inference Time
### Section: Parallel Best-of-N
* **Structure:** A central "Question" node branches out to 7 parallel nodes.
* **Components:** Each node is enclosed in a dashed blue box ("Apply Verifier").
* **Data Points:**
* 6 nodes contain a **Red Circle** (Rejected).
* 1 node (the 3rd from the top) contains a **Green Circle** (Selected).
* **Annotation:** "Verifier selects the best answer"
* **Visual Trend:** A "shotgun" approach where many candidates are generated and a verifier picks the single successful one.
### Section: Sequential Revisions (with Verifier)
* **Structure:** A linear horizontal chain of 7 nodes following a "Question" input.
* **Components:** Each node is enclosed in a dashed blue box.
* **Data Points:**
* Nodes 1, 2, 3, 4, 6, and 7 contain **Red Circles**.
* Node 5 contains a **Green Circle**.
* **Annotation:** "Verifier selects the best answer"
* **Visual Trend:** The model iterates through steps; the verifier monitors the sequence and identifies the optimal state mid-chain or at a specific revision point.
### Section: Combining Sequential / Parallel
* **Structure:** A hybrid approach. The "Question" branches into two parallel horizontal chains. Each chain has 4 sequential nodes.
* **Internal Process:**
* **Top Chain:** Nodes 1, 2, and 4 are Red; Node 3 is Green.
* **Bottom Chain:** Nodes 1, 3, and 4 are Red; Node 2 is Green.
* **Annotation (Internal):** "Verifier selects the best answer within each chain"
* **Final Selection:** The two selected nodes (Green) from the chains are passed to a final verification stage.
* The top candidate is **Green**.
* The bottom candidate is **Red**.
* **Annotation (Final):** "Verifier selects the best answer across chains"
* **Visual Trend:** A hierarchical selection process. It uses parallel processing to explore different "paths" of sequential reasoning, then performs a final comparison to find the absolute best result.
---
## 4. Summary of Logic and Flow
The document describes a transition from simple generation (Left) to verified selection (Right).
* **Parallelism** increases the search space.
* **Sequencing** allows for iterative refinement.
* **Verification** acts as a filter to identify the correct mathematical logic (as evidenced by the "daps/yaps" word problem) among various failed attempts.