What question did this study set out to answer?

This research aims to introduce the Reinforcement Learning from World Feedback (RLWF) concept, emphasizing its unique learning process.

March 25, 2026Open Access

Reinforcement Learning from World Feedback (RLWF): A Preliminary Concept

Key Points

This research aims to introduce the Reinforcement Learning from World Feedback (RLWF) concept, emphasizing its unique learning process.
Introduced the RLWF framework by contrasting it with RLHF.
Explored the development of intelligence in biological neural networks.
Outlined the types of world feedback that influence learning.
Highlighted the distinction between RLWF and RLHF in learning models.
Established that RLWF includes various feedback types: physical, sensory, biochemical, emotional, and social.
Indicated potential implications of RLWF for advancing anthropomorphic AGI concepts.

Abstract

This paper introduces the concept of Reinforcement Learning from World Feedback (RLWF) to describe the continuous, embodied, and grounded learning process through which biological neural networks develop intelligence. Unlike Reinforcement Learning from Human Feedback (RLHF), which applies approval-based fine-tuning to a frozen artificial neural network architecture, RLWF begins at conception, approximately nine months before birth, and continues throughout the lifespan of the organism. The feedback signal in RLWF encompasses the full spectrum of world feedback: physical, sensory, biochemical, emotional, and social, including early social and approval signals from caregivers, all grounded in real consequences and inseparable from the co-evolving biological architecture that receives them. This distinction has profound implications for the anthropomorphic AGI project and for understanding the fundamental grounding gap between biological and artificial intelligence.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

T. Bass

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Reinforcement Learning from World Feedback (RLWF): A Preliminary Concept

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider