Key points are not available for this paper at this time.
Whole-body control for humanoids is challenging due to the high-dimensional nature of the problem, coupled with the inherent instability of a bipedal morphology. Learning from visual observations further exacerbates this difficulty. In this work, we explore highly data-driven approaches to visual whole-body humanoid control based on reinforcement learning, without any simplifying assumptions, reward design, or skill primitives. Specifically, we propose a hierarchical world model in which a high-level agent generates commands based on visual observations for a low-level agent to execute, both of which are trained with rewards. Our approach produces highly performant control policies in 8 tasks with a simulated 56-DoF humanoid, while synthesizing motions that are broadly preferred by humans. Code and videos: https://nicklashansen.com/rlpuppeteer
Building similarity graph...
Analyzing shared references across papers
Loading...
Hansen et al. (Tue,) studied this question.
www.synapsesocial.com/papers/68e68232b6db64358760b908 — DOI: https://doi.org/10.48550/arxiv.2405.18418
Nicklas Hansen
S V Jyothir
Vlad Sobal
Building similarity graph...
Analyzing shared references across papers
Loading...