WorldState
RLState (Policy Head Input)
The world state is converted to a compact format for the policy head:How It’s Used
- Before each round — World state captured as the “before” snapshot
- Policy head scoring — RLState fed to transformer for action ranking
- Strategic reasoning — Peter Parker uses state to decide which agents to run
- Training — State included in training tuples for policy head learning
State Transitions
Each agent round creates a state transition:.jfl/telemetry/resource-transitions.jsonl for analysis.