Train a lightweight RL model on your project’s (state, action, reward) tuples. The trained policy biases agents toward actions that historically improved YOUR metrics.Documentation Index
Fetch the complete documentation index at: https://docs.10et.ai/llms.txt
Use this file to discover all available pages before exploring further.
Usage
How It Works
- Reads
.tenet/training-buffer.jsonl— tuples from agent rounds - Extracts features: action type, files affected, scope, prior score
- Trains a small neural network to predict reward from state+action
- Saves weights to
.tenet/policy-weights.json - Future agent runs consult the policy head before choosing actions
When to Train
- After accumulating 50+ tuples (minimum for useful signal)
- After a batch of overnight agent runs
- The overnight build script runs training automatically

