Los puntos clave no están disponibles para este artículo en este momento.
In this article we hypothesise that intelligence, and its associated abilities, can be understood as subserving the maximisation of reward. Accordingly, reward is enough to drive behaviour that exhibits abilities studied in natural and artificial intelligence, including knowledge, learning, perception, social intelligence, language, generalisation and imitation. This is in contrast to the view that specialised problem formulations are needed for each ability, based on other signals or objectives. Furthermore, we suggest that agents that learn through trial and error experience to maximise reward could learn behaviour that exhibits most if not all of these abilities, and therefore that powerful reinforcement learning agents could constitute a solution to artificial general intelligence.
Building similarity graph...
Analyzing shared references across papers
Loading...
David Silver
Satinder Singh
Doina Precup
Artificial Intelligence
Building similarity graph...
Analyzing shared references across papers
Loading...
Silver et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69e9cfd2186fc979e9a819e7 — DOI: https://doi.org/10.1016/j.artint.2021.103535