How It Works
Every night at 2 AM (configurable):- Mine tuples — Extract training data from today’s journal entries
- Synthesize context — Generate product context summary for agents
- Strategic reasoning — Stratus decides which agents to run based on ROI
- Run agents — 5 rounds per agent, ~1 hour total
- Retrain policy head — If 50+ new tuples since last training
- Pick up backlog — Grab GitHub/Linear issues for autonomous PRs
Setup
OpenClaw Cron (Recommended)
If you have OpenClaw running, add a cron job:System Cron
Manual Run
What to Expect
First run output:Monitoring
Check Last Night’s Results
Scorecard Output
Tuning
Rounds Per Agent
Default is 5. Increase for agents with high gradient:Strategic Reasoning
PP uses recent agent performance to decide who runs. If an agent has plateaued (0 improvements in last 10 runs), it gets skipped automatically.Cost Control
Each round costs ~15/night. Use--budget to cap costs: