Key points are not available for this paper at this time.
Sample efficiency is a critical challenge in reinforcement learning. Model-based RL has emerged as a solution, but its application has largely been confined to single-agent scenarios. In this work, we introduce CoDreamer, an extension of the Dreamer algorithm for multi-agent environments. CoDreamer leverages Graph Neural Networks for a two-level communication system to tackle challenges such as partial observability and inter-agent cooperation. Communication is separately utilised within the learned world models and within the learned policies of each agent to enhance modelling and task-solving. We show that CoDreamer offers greater expressive power than a naive application of Dreamer, and we demonstrate its superiority over baseline methods across various multi-agent environments.
Building similarity graph...
Analyzing shared references across papers
Loading...
Toledo et al. (Wed,) studied this question.
www.synapsesocial.com/papers/68e642a2b6db6435875d4677 — DOI: https://doi.org/10.48550/arxiv.2406.13600
Edan Toledo
Amanda Prorok
Building similarity graph...
Analyzing shared references across papers
Loading...