This presentation introduces a new learning technique for planning in cooperative multi-agent systems (MAS), proposing a taxonomy for MAS based on rationality and optimality, and formally defining cooperative, competitive, and mixed matrix games (MGs). It presents the Cooperative Multi-agent Markov Decision Process (CMMDP) as a mathematical framework and introduces the Extended-Q algorithm, which integrates reinforcement learning with game-theoretic equilibrium concepts like Nash equilibrium to solve coordination problems. The algorithm is extended to handle weakly competitive scenarios and is enhanced with neural network-based generalization (Neuro-Extended-Q) for large state spaces. Experimental validation using grid games demonstrates its effectiveness, while future work includes convergence proofs, extensions to competitive MAS, partial observability, and improved exploration techniques.
Building similarity graph...
Analyzing shared references across papers
Loading...
Walid Gomaa (Fri,) studied this question.
www.synapsesocial.com/papers/6966f31d13bf7a6f02c00ccd — DOI: https://doi.org/10.5281/zenodo.18202514
Walid Gomaa
Egypt-Japan University of Science and Technology
Building similarity graph...
Analyzing shared references across papers
Loading...