Action recognition has long been a fundamental and compelling problem in the field of computer vision. However, one aspect that has been overlooked so far is that current action recognition approaches often produce an unfavourable multi-peaked distribution when identifying the action class of a given motion sequence, which is ambiguous and hard to learn for neural networks. Moreover, current methods heavily rely on neural networks to extract action features for differentiating actions, lacking theoretical constraints ensuring that action-specific features are selectively extracted and ambiguous features common to multiple actions are effectively reduced. These shortcomings culminate in inadequate action recognition accuracy. Motivated by this, in this paper we seek to tackle the problem from three aspects: (1) We try to eliminate ambiguity by enforcing a smooth single-peaked distribution instead of a multi-peaked one for action-class prediction. (2) We theoretically analyze the lower bound of the label prediction log-likelihood and derive a training objective, which focuses on the extraction of action-specific features and the reduction of ambiguous features. (3) We further advocate feeding the model with richer information, including positive information like body-part structures and negative information like masked inputs. Empirically, our approach sets the new state-of-the-art performance on five large-scale benchmarks. Our code is released at https://github.com/ActionR-Group/DPM to facilitate future research and is available in the supplementary material.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yingying Jiao
Haipeng Chen
Yingda Lyu
IEEE Transactions on Image Processing
Jilin University
Zhejiang University of Technology
Zhejiang University of Science and Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Jiao et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69d0ae68659487ece0fa45cd — DOI: https://doi.org/10.1109/tip.2026.3678019