Q-CMAPO: A quantum-classical framework for balancing exploration and exploitation in multi-agent reinforcement learning | Synapse