March 3, 2026Open Access

A multi-UAV cooperative path planning method based on HCA-MAPPO deep reinforcement learning

Key Points

The proposed method improves cooperative path planning efficiency for multi-UAV systems, enhancing exploration in sparse reward scenarios.
Key improvements include a hierarchical cross-attention network that processes heterogeneous information, boosting situational awareness.
Modeling the planning problem as a partially observable Markov decision process allows effective algorithmic solutions.
Simulation experiments validate the effectiveness and superiority of the proposed approach in real-world applications.

Abstract

针对多无人机协同路径规划中存在的感知信息处理扁平化、稀疏奖励下探索效率低下等挑战, 本文提出了一种新的基于多智能体深度强化学习的协同路径规划方法. 本方法将多无人机协同路径规划问题建模为部分可观测的马尔可夫决策过程, 并采用深度强化学习算法进行求解. 为了增强智能体对多源异构观测信息的特征提取与态势感知能力, 本文设计了一种基于分层交叉注意力的执行者网络, 通过序贯处理流程实现了对多源异构信息的优先级感知与处理. 此外, 构建了一种基于协同一致性门控的内在好奇心模块, 将个体探索与集群协同目标有效耦合, 提升了算法在稀疏奖励环境下的有效学习效率. 最后, 通过仿真实验验证了所提出算法的有效性和优越性.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Zhenxi Lin

Zhengyu Guo

Jian Zhang

Journals

Scientia Sinica Technologica

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

A multi-UAV cooperative path planning method based on HCA-MAPPO deep reinforcement learning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study