Convolutional neural network (CNN) pruning has traditionally relied on heuristically designed importance criteria, often leading to limited generalizability and inconsistent performance. In this article, we propose a novel framework centered around filter replacement (FR), introducing pruning as a process of replacing selected filters with zero filters. Through a rigorous analysis, we derive an upper bound on the absolute error in the output of the subsequent layer and use this bound to define an efficient importance function. This importance function exhibits -weakly submodular properties, enabling the development of a simple, low-complexity, and data-free oblivious algorithm for selecting filters to prune. In addition, we extend the FR framework to include nonzero filter alternatives, leveraging a best-approximation technique to construct optimal replacements for the pruned filters. Extensive experiments on benchmark networks and datasets validate the effectiveness of our method. The proposed approach achieves state-of-the-art results, with a complexity comparable to basic techniques such as l₂ -norm pruning. Notably, our pruning method achieves 76. 52% accuracy (ACC) in ResNet-50 on the ImageNet dataset, surpassing the baseline of 75. 15%, while reducing network parameters by 25. 5%. Our proposed resource efficiency (RE) metric assesses that the layer interdependence-aware pruning (LIAP) method is up to 10^11 times more efficient than existing techniques, setting a new standard for resource-aware CNN pruning.
Tofigh et al. (Thu,) studied this question.