June 25, 2024Open Access

Human-Object Interaction from Human-Level Instructions

Key Points

Key points are not available for this paper at this time.

Abstract

Intelligent agents need to autonomously navigate and interact within contextual environments to perform a wide range of daily tasks based on human-level instructions. These agents require a foundational understanding of the world, incorporating common sense and knowledge, to interpret such instructions. Moreover, they must possess precise low-level skills for movement and interaction to execute the detailed task plans derived from these instructions. In this work, we address the task of synthesizing continuous human-object interactions for manipulating large objects within contextual environments, guided by human-level instructions. Our goal is to generate synchronized object motion, full-body human motion, and detailed finger motion, all essential for realistic interactions. Our framework consists of a large language model (LLM) planning module and a low-level motion generator. We use LLMs to deduce spatial object relationships and devise a method for accurately determining their positions and orientations in target scene layouts. Additionally, the LLM planner outlines a detailed task plan specifying a sequence of sub-tasks. This task plan, along with the target object poses, serves as input for our low-level motion generator, which seamlessly alternates between navigation and interaction modules. We present the first complete system that can synthesize object motion, full-body motion, and finger motion simultaneously from human-level instructions. Our experiments demonstrate the effectiveness of our high-level planner in generating plausible target layouts and our low-level motion generator in synthesizing realistic interactions for diverse objects. Please refer to our project page for more results: https://hoifhli.github.io/.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Wu et al. (Tue,) studied this question.

www.synapsesocial.com/papers/68e636c5b6db6435875c8cfb — DOI: https://doi.org/10.48550/arxiv.2406.17840

Authors

Zhen Wu

Jiaman Li

C. Karen Liu

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Human-Object Interaction from Human-Level Instructions

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion