A reinforcement learning approach to solve the stochastic Container Trans-Marshalling Problem | Synapse