The optimal tracking control problem for multiplayer differential game systems (MDGS) with unknown dynamics is investigated in this article. A two-stage asynchronous learning scheme is proposed to achieve Nash equilibrium solutions without requiring initial admissible control policies. In the first stage, stabilizing control policies are constructed through a homotopic-based iterative process. In the second stage, an asynchronous policy iteration (PI) method is employed, in which players sequentially update their policies using partial real-time information, contributing to improved convergence efficiency compared to synchronous approaches. The proposed scheme is further extended to a data-driven framework, relaxing the requirement of explicit system dynamic information. Convergence under stabilizability and detectability conditions is theoretically proven. Finally, two simulation examples are conducted to demonstrate the effectiveness of the proposed method in tracking a sinusoidal reference. Additionally, comparison experiments are provided to highlight the superiority of the proposed algorithm.
Building similarity graph...
Analyzing shared references across papers
Loading...
Qing Yang
Anhui University of Technology
Jiacheng Wu
State Key Laboratory of Industrial Control Technology
Yì Wáng
University of Stuttgart
IEEE Transactions on Cybernetics
Yeungnam University
Anhui University of Technology
State Key Laboratory of Industrial Control Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Yang et al. (Thu,) studied this question.
synapsesocial.com/papers/69d894ce6c1944d70ce05b29 — DOI: https://doi.org/10.1109/tcyb.2026.3678031
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: