Train the policy head from accumulated training data
tenet train transform # prepare tuples for training tenet train policy-head # train the policy head tenet train policy-head --force # retrain even if recent