Key points are not available for this paper at this time.
This paper offers an active inference account of choice behaviour and learning. It focuses on the distinction between goal-directed and habitual behaviour and how they contextualise each other. We show that habits emerge naturally (and autodidactically) from sequential policy optimisation when agents are equipped with state-action policies. In active inference, behaviour has explorative (epistemic) and exploitative (pragmatic) aspects that are sensitive to ambiguity and risk respectively, where epistemic (ambiguity-resolving) behaviour enables pragmatic (reward-seeking) behaviour and the subsequent emergence of habits. Although goal-directed and habitual policies are usually associated with model-based and model-free schemes, we find the more important distinction is between belief-free and belief-based schemes. The underlying (variational) belief updating provides a comprehensive (if metaphorical) process theory for several phenomena, including the transfer of dopamine responses, reversal learning, habit formation and devaluation. Finally, we show that active inference reduces to a classical (Bellman) scheme, in the absence of ambiguity.
Building similarity graph...
Analyzing shared references across papers
Loading...
Karl Friston
Thomas H. B. FitzGerald
Francesco Rigoli
Neuroscience & Biobehavioral Reviews
University College London
California Institute of Technology
National Hospital for Neurology and Neurosurgery
Building similarity graph...
Analyzing shared references across papers
Loading...
Friston et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69dbc02250e1971baba3c70e — DOI: https://doi.org/10.1016/j.neubiorev.2016.06.022
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: