July 1, 2016Open Access

Active inference and learning

Key Points

Key points are not available for this paper at this time.

Abstract

This paper offers an active inference account of choice behaviour and learning. It focuses on the distinction between goal-directed and habitual behaviour and how they contextualise each other. We show that habits emerge naturally (and autodidactically) from sequential policy optimisation when agents are equipped with state-action policies. In active inference, behaviour has explorative (epistemic) and exploitative (pragmatic) aspects that are sensitive to ambiguity and risk respectively, where epistemic (ambiguity-resolving) behaviour enables pragmatic (reward-seeking) behaviour and the subsequent emergence of habits. Although goal-directed and habitual policies are usually associated with model-based and model-free schemes, we find the more important distinction is between belief-free and belief-based schemes. The underlying (variational) belief updating provides a comprehensive (if metaphorical) process theory for several phenomena, including the transfer of dopamine responses, reversal learning, habit formation and devaluation. Finally, we show that active inference reduces to a classical (Bellman) scheme, in the absence of ambiguity.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Karl Friston

Thomas H. B. FitzGerald

Francesco Rigoli

Journals

Neuroscience & Biobehavioral Reviews

Actions

Institutions

University College London

California Institute of Technology

National Hospital for Neurology and Neurosurgery

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Active inference and learning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider