What question did this study set out to answer?

The research aims to improve search and rescue operations using a collaborative system of UAVs and quadruped robots through advanced reinforcement learning techniques.

April 15, 2026Open Access

Multi-stage hierarchical multi-agent reinforcement learning for UAV-quadruped completing search and rescue

Key Points

The research aims to improve search and rescue operations using a collaborative system of UAVs and quadruped robots through advanced reinforcement learning techniques.
Developed the multi-stage hierarchical multi-agent reinforcement learning method (MHMARL).
Evaluated the system in a high-fidelity simulation environment using Isaac Lab.
Compared performance with traditional end-to-end MARL training.
Achieved more stable training processes with MHMARL compared to direct MARL.
Demonstrated reliable coordination between UAV and quadruped robots in various SAR scenarios.
Showed potential for successful exploration and target localization in complex environments.

Abstract

Search and rescue (SAR) operations in post-disaster environments often involve complex terrain, limited visibility, and high safety risks for human responders. While unmanned aerial vehicles (UAVs) are effective for rapid exploration and target localization, their limited payload capacity and endurance restrict their ability to perform sustained ground-level tasks. Quadruped robots, in contrast, are well suited for traversing unstructured terrain and carrying equipment, but they typically lack global situational awareness. Based on the multi-agent reinforcement learning (MARL) framework and hierarchical multi-stage reinforcement learning (HMRL) algorithm, this paper proposes a multi-stage hierarchical multi-agent reinforcement learning method (MHMARL). This method is introduced to train a heterogeneous multi-agent collaborative system consisting of a UAV and a robotic dog (ANYmal-C), enabling the UAV to guide the dog to complete SAR operations in post disaster rescue scenarios. The proposed approach is evaluated in a high-fidelity simulation environment built on Isaac Lab. Experimental results demonstrate that, compared with direct end-to-end MARL training, the proposed method achieves more stable training and reliable UAV-quadruped coordination across multiple SAR scenarios.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Chuan Chen

Shuhan Yan

Xinliang Zhou

Actions

Institutions

Nanyang Technological University

Beijing Jiaotong University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Multi-stage hierarchical multi-agent reinforcement learning for UAV-quadruped completing search and rescue

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study