The eval system is the reward function for the RL loop. It runs before and after an agent makes changes, and the delta determines whether the change is kept.Documentation Index
Fetch the complete documentation index at: https://docs.10et.ai/llms.txt
Use this file to discover all available pages before exploring further.
Eval Flow
Eval Store
All eval results are stored in.tenet/eval.jsonl:
Viewing Eval History
Eval Snapshots
When an agent starts, TENET freezes the eval script into a snapshot (SHA-based). This ensures the eval doesn’t change mid-run — the same script measures baseline and post-change. Snapshots are cached at~/.cache/tenet/eval-snapshots/<hash>/.
Writing Good Evals
See Eval Scripts for the complete guide on writing eval scripts that produce real gradient. Key principles:- Output JSON with a primary metric
- Use
AGENT_WORKTREEfor cross-repo agents - Ensure the metric has room to improve (not at ceiling)
- Keep evals fast (under 30s) and deterministic

