August 12, 2024Open Access

Body Transformer: Leveraging Robot Embodiment for Policy Learning

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

In recent years, the transformer architecture has become the de facto standard for machine learning algorithms applied to natural language processing and computer vision. Despite notable evidence of successful deployment of this architecture in the context of robot learning, we claim that vanilla transformers do not fully exploit the structure of the robot learning problem. Therefore, we propose Body Transformer (BoT), an architecture that leverages the robot embodiment by providing an inductive bias that guides the learning process. We represent the robot body as a graph of sensors and actuators, and rely on masked attention to pool information throughout the architecture. The resulting architecture outperforms the vanilla transformer, as well as the classical multilayer perceptron, in terms of task completion, scaling properties, and computational efficiency when representing either imitation or reinforcement learning policies. Additional material including the open-source code is available at https: //sferrazza. cc/botₛite.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Sferrazza et al. (Mon,) studied this question.

www.synapsesocial.com/papers/68e5ca6db6db64358756096e — DOI: https://doi.org/10.48550/arxiv.2408.06316

Authors

Carmelo Sferrazza

Dun-Ming Huang

Fangchen Liu

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Body Transformer: Leveraging Robot Embodiment for Policy Learning

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion