Collaborative optimisation of multi-agent reinforcement learning in enterprise digital supply chain | Synapse