What type of study is this?

This is a Experimental Study study.

October 3, 2025Open Access

Compose by Focus: Scene Graph-based Atomic Skills

Key Points

This framework enhances the robustness of atomic skills in robotics through scene graph representation, improving task success.
Experiments showed that the integration with graph neural networks and diffusion-based learning led to significantly higher success rates.
The focus on task-relevant objects in scene graphs mitigates issues observed with traditional visuomotor policies under variations.
The approach combines focused scene graph skills with a vision-language model for better task planning and execution.

Abstract

A key requirement for generalist robots is compositional generalization - the ability to combine atomic skills to solve complex, long-horizon tasks. While prior work has primarily focused on synthesizing a planner that sequences pre-learned skills, robust execution of the individual skills themselves remains challenging, as visuomotor policies often fail under distribution shifts induced by scene composition. To address this, we introduce a scene graph-based representation that focuses on task-relevant objects and relations, thereby mitigating sensitivity to irrelevant variation. Building on this idea, we develop a scene-graph skill learning framework that integrates graph neural networks with diffusion-based imitation learning, and further combine "focused" scene-graph skills with a vision-language model (VLM) based task planner. Experiments in both simulation and real-world manipulation tasks demonstrate substantially higher success rates than state-of-the-art baselines, highlighting improved robustness and compositional generalization in long-horizon tasks.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Qi Han

Changhe Chen

Heng Yang

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Compose by Focus: Scene Graph-based Atomic Skills

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider