Reinforcement Learning from World Feedback (RLWF): A Preliminary Concept | Synapse