Unsupervised video anomaly detection (VAD) methods learn from normal data to identify anomalies by capturing pattern deviations. However, they often struggle to model multi-scale features and distinguish between normal and abnormal instances. To address these limitations, we propose a Multi-scale U-shaped Adaptive Clustering Learning (MS-UACL) framework. Built on the U-Net architecture, we redesign it as a 3D-encoder/2D-decoder autoencoder. In the encoder, we introduce a Dual-scale Feature Cascading Module (IDCN), which adopts a pseudo-branch fusion mechanism to systematically model multi-scale spatiotemporal features, thereby enhancing the model’s representational capability. To further enhance the distinction between normal and anomalous patterns, we propose an MLP-based Adaptive Clustering Algorithm (MLP-ACA). Specifically, MLP-ACA employs an initial mapping mechanism to align cluster centers with the underlying normal data distribution, facilitating more accurate feature reconstruction. Additionally, we introduce an adaptive clustering update strategy that optimizes cluster centers by tuning solely the parameters of the MLP. This enables the cluster centers to autonomously converge toward optimal feature representations, thereby accelerating clustering convergence and enhancing pattern separability. Extensive experiments on three benchmark datasets demonstrate that the proposed MS-UACL framework outperforms most existing methods on small- and medium-scale datasets.
Building similarity graph...
Analyzing shared references across papers
Loading...
Shaoming Qiu
Lin He
Hanhan Dang
Electronics
Dalian University
Chinese People's Liberation Army
Jiangsu Provincial Academy of Environmental Science
Building similarity graph...
Analyzing shared references across papers
Loading...
Qiu et al. (Wed,) studied this question.
www.synapsesocial.com/papers/69d8968f6c1944d70ce08128 — DOI: https://doi.org/10.3390/electronics15081558