← All solutions

Reasoning Verification

Step-by-step verification of math, code, and logic.

Domain experts mark which steps in a chain-of-thought are valid and which break, producing the process-reward signal frontier reasoning models need.

Per-step process-reward annotation

Worqgrid's reasoning annotator splits a chain-of-thought into atomic steps. Reviewers mark each one as correct, flawed, or unverifiable, and explain why.

  • Auto-segmentation of model traces into discrete steps
  • Per-step verdict with rationale text
  • Optional final-answer correctness check
  • Export as PRM / ORM training pairs
Problem
Find ∫ x²·sin(x) dx using integration by parts.
  1. step 1 · Set u = x², dv = sin(x) dx.
    Correct
  2. step 2 · Then du = 2x dx, v = −cos(x).
    Correct
  3. step 3 · So ∫x²sin(x) dx = −x²cos(x) + 2∫x·cos(x) dx.
    Flawed
  4. step 4 ·
    Pending

For the domains where every step matters

Mathematics

Olympiad-level proofs, calculus, linear algebra, verified by PhDs.

Code & algorithms

Step-level checks for correctness, complexity claims, and edge cases.

Logic & formal reasoning

Predicate logic, set theory, and proof-style argument verification.

Quantitative finance & physics

Domain-specific reasoning audits with unit and dimensional checks.

Train models that reason, not just guess.