What question did this study set out to answer?

To enhance multi-robot collaborative search efficiency while addressing sparse rewards and control oscillations.

April 10, 2026Open Access

Hierarchical Active Perception and Stability Control for Multi-Robot Collaborative Search in Unknown Environments

Key Points

To enhance multi-robot collaborative search efficiency while addressing sparse rewards and control oscillations.
Developed a hierarchical active perception framework using multi-agent deep deterministic policy gradient (HAP-MADDPG).
Implemented global utility planning and local information aggregation to improve target discovery.
Introduced a stability control mechanism utilizing hysteresis logic and reward decay.
Achieved a success rate of 96.25% in target capture.
Reduced average search time to 216.3 steps.
Demonstrated smooth path trajectories, indicating effective navigation.

Abstract

Multi-robot systems (MRS) have attracted a lot of attention from researchers due to their widespread application in various environments. However, in multi-robot collaborative search tasks, two problems often arise: sparse rewards for capturing targets and control oscillations. To address these issues, this paper proposes the hierarchical active perception multi-agent deep deterministic policy gradient (HAP-MADDPG) framework. This framework guides robots to efficiently explore maps and discover targets through global utility planning based on global exploration rate and local information aggregation based on local exploration rate. A stability control mechanism, which includes hysteresis logic and reward decay, is introduced to suppress control oscillations. Experimental results show that the HAP-MADDPG framework achieves a success rate of 96.25% and an average search time of 216.3 steps. The path trajectories are smooth, demonstrating the effectiveness of the proposed approach.

Bookmark

View Full Paper

Bookmark

View Full Paper

Hierarchical Active Perception and Stability Control for Multi-Robot Collaborative Search in Unknown Environments

Key Points

Abstract

Cite This Study