Eval Flow
Eval Store
All eval results are stored in.jfl/eval.jsonl:
Viewing Eval History
Eval Snapshots
When an agent starts, TENET freezes the eval script into a snapshot (SHA-based). This ensures the eval doesn’t change mid-run — the same script measures baseline and post-change. Snapshots are cached at~/.cache/jfl/eval-snapshots/<hash>/.
Writing Good Evals
See Eval Scripts for the complete guide on writing eval scripts that produce real gradient. Key principles:- Output JSON with a primary metric
- Use
AGENT_WORKTREEfor cross-repo agents - Ensure the metric has room to improve (not at ceiling)
- Keep evals fast (under 30s) and deterministic