What type of study is this?

This is a Quantitative Study study.

October 13, 2025Open Access

RoboDexVLM: Visual Language Model-Enabled Task Planning and Motion Control for Dexterous Robot Manipulation

Key Points

RoboDexVLM significantly enhances dexterous robot manipulation in long-horizon scenarios, with improved task execution.
The framework employs visual language models to interpret natural language commands for complex tasks.
A novel grasp perception algorithm aids in robust detection of diverse objects, enhancing the robot's dexterity.
Experimental validation shows that RoboDexVLM excels in challenging environments, demonstrating its potential for practical applications.

Abstract

This paper introduces RoboDexVLM, an innovative framework for robot task planning and grasp detection tailored for a collaborative manipulator equipped with a dexterous hand. Previous methods focus on simplified and limited manipulation tasks, which often neglect the complexities associated with grasping a diverse array of objects in a long-horizon manner. In contrast, our proposed framework utilizes a dexterous hand capable of grasping objects of varying shapes and sizes while executing tasks based on natural language commands. The proposed approach has the following core components: First, a robust task planner with a task-level recovery mechanism that leverages vision-language models (VLMs) is designed, which enables the system to interpret and execute open-vocabulary commands for long sequence tasks. Second, a language-guided dexterous grasp perception algorithm is presented based on robot kinematics and formal methods, tailored for zero-shot dexterous manipulation with diverse objects and commands. Comprehensive experimental results validate the effectiveness, adaptability, and robustness of RoboDexVLM in handling long-horizon scenarios and performing dexterous grasping. These results highlight the framework's ability to operate in complex environments, showcasing its potential for open-vocabulary dexterous manipulation. Our open-source project page can be found at https://henryhcliu.github.io/robodexvlm.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Haichao Liu

Sikai Guo

Patrick Mai

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

RoboDexVLM: Visual Language Model-Enabled Task Planning and Motion Control for Dexterous Robot Manipulation

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider