• Integrates TD(0) RL into BACO for adaptive heuristic learning. • Extremized probabilistic path enhances exploration-exploitation. • Validated on 21 datasets (14–2000 features), 30 runs, 10-fold cross-validation. • Outperformed 7 state-of-the-art FS algorithms. • Maintains high precision, recall and F1 with <10 As an essential preprocessing technique in machine learning and pattern recognition, feature selection has become a focus of attention. Its primary goal is to select a subset of features that contain rich information in order to reduce dimensionality and improve accuracy. Intelligent algorithms have been successfully applied to feature selection, but their key parameters cannot be efficiently and dynamically adjusted during the computation process, resulting in the iteration stagnating in a local optimum or failing to converge. In this study, a novel feature selection wrapper method is proposed based on the ant colony optimization algorithm called TPACO. Unlike traditional methods, TPACO dynamically learns and updates heuristic information from experience during the search process, enabling more effective exploration of feature combinations. In addition, an extremized probability-based ant path strategy is introduced to guide feature selection toward higher-quality subsets. Experimental results on 21 benchmark classification datasets demonstrate that TPACO consistently outperforms seven state-of-the-art or classical FS methods in terms of accuracy and stability. These findings highlight the effectiveness of combining reinforcement learning with swarm intelligence for adaptive feature selection.
Hou et al. (Sun,) studied this question.