Despite over two decades of advancement in object detection, achieving high accuracy for small target detection in practical applications remains an unresolved challenge. This paper proposes a novel small-object detection model to address this issue. The model incorporates three key innovations: first, the RCSOSA module, which optimizes feature information transmission through dynamic channel interaction and multi-scale feature coordination; second, the HFPN module, a three-branch multi-scale feature fusion network that integrates local and global features by combining CNN and Transformer architectures to enhance semantic details; and third, the NWD-CIoU loss function, which dynamically adjusts the weights of NWD and CIoU losses based on the training phase. Experimental results on the COCO dataset demonstrate that our model improves detection accuracy by 4% over YOLOv11 and achieves state-of-the-art performance among mainstream models while maintaining a real-time inference speed of no less than 60 FPS. Furthermore, validation on the VisDrone dataset confirms the model’s strong generalization capability. The proposed algorithm significantly enhances small target detection accuracy, effectively mitigating a critical limitation in current practical object detection applications.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhang et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69df2ae6e4eeef8a2a6afe0b — DOI: https://doi.org/10.3390/a19040306
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:
Hong Zhang
Runzhen Liu
Zeyi Zhu
Algorithms
Civil Aviation University of China
Building similarity graph...
Analyzing shared references across papers
Loading...