Modelling herd behaviour in traffic jams using Markov chains-based reinforcement learning learning | Synapse