Benchmarking
Compare models on the same task.
Benchmarking lets you run the same prompt across multiple models, isolate changes, and apply the best result.
Start a benchmark (TUI)
In the TUI prompt:
/benchmark startThis creates a parent session and child sessions for each model you select.
Common commands
/benchmark stop
/benchmark next
/benchmark prev
/benchmark apply
/benchmark undoShortcuts (default)
- Next session:
ctrl+shift+right - Previous session:
ctrl+shift+left - Apply changes:
ctrl+alt+a - Undo changes:
ctrl+alt+u
Typical flow
- Run
/benchmark startand choose models - Switch children with
/benchmark next//benchmark prev - Apply a child’s changes with
/benchmark apply - Undo with
/benchmark undo - Exit with
/benchmark stop