Why Critiqor?
Most evaluation tools ask the agent to grade its own output. That works until it doesn’t: hallucinated tool calls look fine in the response, infinite loops produce confident-sounding answers, and memory failures are invisible unless you were watching the trace. Critiqor solves five problems that agent self-reports cannot:| Problem | What goes wrong without Critiqor |
|---|---|
| Unreliable evaluation | Agents grade their own output — high confidence despite clear failures |
| No runtime visibility | Tool calls, retries, and memory events happen invisibly |
| Weak explainability | You know the output is wrong; you don’t know why |
| No root cause analysis | Failures are observed but not traced to a cause |
| No evidence-backed diagnosis | Guesses replace data when debugging agents |
How It Works
critiqor monitor openclaw, Critiqor:
- Attaches the bundled OpenClaw plugin before the agent starts
- Collects every observable runtime event into a session file
- After the session ends, runs the diagnosis engine over the evidence
- Writes a diagnosis file and opens a local dashboard with findings
What Critiqor Evaluates
Critiqor scores six reliability dimensions drawn from the OpenClaw failure taxonomy:Loop Control
Detects repeated tool calls with identical arguments — infinite tool loops
Memory Integrity
Flags failed recall, ignored memory, and lost context events
Tool Output Utilization
Identifies tool outputs that were available but unused in decisions
Context Health
Catches context saturation and compaction that can hide important state
Cost Efficiency
Measures token waste and redundant execution overhead
Skill Adherence
Detects mis-selected, ignored, or failed OpenClaw skill invocations
Quick Navigation
Installation
Install Critiqor with pip and get your environment ready
Quick Start
Run your first monitored session end-to-end in four steps
CLI Reference
Full reference for monitor, finalize, dashboard, and runs commands
Dashboard
Understand diagnosis sections and how to switch between runs
Evidence Types
Deep dive into every evidence category Critiqor collects
OpenClaw Integration
Full setup guide for the OpenClaw plugin