What question did this study set out to answer?

The aim is to enhance vehicle classification performance in low-resolution surveillance settings.

March 3, 2026Open Access

Vehicle Classification in Low-Resolution Surveillance Images Using RepViT and KernelWarehouse with Composite Loss

Key Points

The aim is to enhance vehicle classification performance in low-resolution surveillance settings.
Proposed the KRepIncep-AF model using InceptionNeXt-Tiny backbone and RepViT modules.
Applied a compound loss function combining linear adaptive cross-entropy and focal loss.
Conducted comparative experiments with a vehicle dataset of six classes at 100 × 100 pixels resolution.
Achieved an accuracy rate of 99.58%.
Macro-average F1, precision, and recall values exceeded 99.5%.
Outperformed several competitive baselines.

Abstract

Vehicle classification within low-resolution surveillance scenarios remains a challenging task due to the subtle differences between classes and the lack of clear visual cues. This study aimed to improve vehicle classification performance under low-resolution surveillance scenarios.To this end, we proposed KRepIncep-AF, a convolutional neural network model that employed the backbone of InceptionNeXt-Tiny, RepViT modules, and a KernelWarehouse block for prioritized assimilation of spatial cues and contextual information. A compound loss function that combined linear adaptive cross-entropy and focal loss was applied to effectively address class imbalance and reinforce robustness. Comparative experiments were carried out using a vehicle dataset consisting of six classes and a resolution of 100 × 100 pixels. The proposed model attained an outstanding accuracy rate of 99.58%, with macro-average F1, precision, and recall values exceeding 99.5%, and outperformed several competitive baselines. These results demonstrate the effectiveness of the proposed architecture in constrained surveillance environments. Visual examination via heatmaps further established that the model highlighted silhouette-specific features such as bumpers and trailers. These observations indicated that improvements in model structure and the domain-specific application of loss functions could lead to considerable gains in classification accuracy, with meaningful implications for real-world traffic surveillance scenarios.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

H. Zhang

Stony Brook University

Zhihong Fan

Journals

Tehnicki vjesnik - Technical Gazette

Actions

Institutions

Stony Brook University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Vehicle Classification in Low-Resolution Surveillance Images Using RepViT and KernelWarehouse with Composite Loss

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study