Key points are not available for this paper at this time.
In this paper, we investigate the application of two reinforcement learning methods, known as the Dueling Double Deep Q-Network and Soft Actor-Critic to discover bus scheduling strategies and compare them against conventional approaches. In particular, we look into real-time control strategies where buses may choose to stay or leave at bus stops. We explore both waiting time and travel time as the optimization objectives. The results for uniform bus frequency show that average waiting time can be reduced by allowing buses to stay longer at stops with higher passengers’ arrival rate but at the cost of increased average travel time. This is also supported by our analytical calculation on a theoretical bus loop model. We then apply our method to a model based on a real world bus loop in Nanyang Technological University. The results highlight the potential benefit of reinforcement learning methods to find novel strategies that can be better than conventional approaches. The similar performance of the two distinct reinforcement learning methods also serves as independent verification of the validity of the strategies obtained. This is an extended version of our ICCS 2025 conference paper “Bus Loop Scheduling with Dueling Double Deep Q Network” Pradana and Chew (2025), with the main addition of the application of the Soft Actor-Critic method which has to be modified to handle the optimization problem described in this paper.
Building similarity graph...
Analyzing shared references across papers
Loading...
Andri Pradana
Lock Yue Chew
Journal of Computational Science
Nanyang Technological University
Building similarity graph...
Analyzing shared references across papers
Loading...
Pradana et al. (Wed,) studied this question.
www.synapsesocial.com/papers/6a0a541d5b6facdebcb4e779 — DOI: https://doi.org/10.1016/j.jocs.2026.102807