What question did this study set out to answer?

To improve the efficiency and thoroughness of software testing using reinforcement learning techniques.

April 18, 2026

STRL: a reinforcement learning framework for efficient and comprehensive software testing

Key Points

To improve the efficiency and thoroughness of software testing using reinforcement learning techniques.
Developed a novel framework named STRL using reinforcement learning principles.
Employed the Proximal Policy Optimization (PPO) algorithm for dynamic testing strategy.
Tested and compared STRL against traditional automated and manual testing methods.
STRL significantly improves state coverage in software testing.
Reduced testing time compared to manual and traditional automated testing.
Enhanced identification of critical states and transitions during testing.

Abstract

Reinforcement learning (RL) has become a significant research focus in machine learning due to its ability to generate dynamic data through interaction with the environment, without requiring large-scale labeled datasets. This characteristic makes RL particularly suitable for applications where data is scarce or difficult to obtain. In the realm of software testing, traditional methods such as regression testing often suffer from long execution times and low state coverage, which can hinder the efficiency and thoroughness of the testing process. To address these challenges, this paper proposes a novel software testing framework named STRL (Software Testing with Reinforcement Learning). The framework employs the Proximal Policy Optimization (PPO) algorithm, a powerful RL technique known for its efficiency in balancing exploration and exploitation. PPO enables STRL to dynamically adapt its testing strategy based on real-time feedback from the software environment, thereby optimizing the testing process. Experimental results show that STRL significantly improves state coverage and reduces testing time compared to both manual testing and traditional automated script testing. By leveraging RL, STRL can more effectively identify critical states and transitions, leading to more comprehensive and efficient testing outcomes. This study demonstrates the potential of RL in enhancing software testing and suggests that STRL could serve as a valuable tool for improving the quality and efficiency of software development processes.

Bookmark

Cite This Study

Hanwen et al. (Wed,) studied this question.

synapsesocial.com/papers/69e3209340886becb653fb3a https://doi.org/https://doi.org/10.1049/icp.2026.0963

Bookmark