Think Fast! Learning to Control Online Reasoning in Stochastic Environments, Supplementary Material | Synapse