홈
탐색
nav.journalClub
트렌드
더보기
synapse
⌘+K
언어
한국어
CMA-MAPPO: Integrating Covariance Matrix Adaptation Evolution Strategy with Multi-Agent Proximal Policy Optimization for enhanced exploration in sparse-reward environments | Synapse
March 3, 2026
CMA-MAPPO: Integrating Covariance Matrix Adaptation Evolution Strategy with Multi-Agent Proximal Policy Optimization for enhanced exploration in sparse-reward environments
AK
A.H. Khatami
Key Points
Improved exploration in sparse-reward environments was achieved through the CMA-MAPPO method.
Key evidence shows significant enhancement in performance metrics under specific setups.
Theoretical model integrates covariance matrix adaptation and multi-agent proximal policy optimization for effective learning.
This new method may enable better performance in complex multi-agent scenarios; however, further validation is necessary.
Mark Helpful
Like
Save
Bookmark
Relay
Share
Mark Helpful
Like
Save
Bookmark
Relay
Share
Cite This Study
Copy
A.H. Khatami (Tue,) studied this question.
synapsesocial.com/papers/69a761bfc6e9836116a2fce2
https://doi.org/https://doi.org/10.1016/j.swevo.2026.102330