Detecting small unmanned aerial vehicles (UAVs) in complex airspace presents significant challenges due to their minimal pixel footprint, resemblance to birds, and frequent occlusion. To address these issues, we propose YOLOv11-ResCBAM, a novel real-time detection framework that integrates a Residual Convolutional Block Attention Module (ResCBAM) and a high-resolution P2 detection head into the YOLOv11 architecture. ResCBAM enhances channel and spatial feature refinement while preserving original feature contexts through residual connections, and the P2 head maintains fine spatial details crucial for small-object localization. Evaluated on a custom dataset of 4917 images (11,733 after augmentation) across three classes (drone, bird, airplane), our model achieves a mean average precision at the 0.5–0.95 IoU threshold (mAP@0.5–0.95) of 0.845, representing a 7.9% improvement over the baseline YOLOv11n, while maintaining real-time inference at 50.51 FPS. Cross-dataset validation on VisDrone2019-DET and UAVDT benchmarks demonstrates promising generalization trends. This work demonstrates the effectiveness of the proposed approach for UAV surveillance systems, balancing detection accuracy with computational efficiency for deployment in security-critical environments.
Building similarity graph...
Analyzing shared references across papers
Loading...
Chuang Han
Md Redwan Ullah
Amrul Kayes
Journal of Imaging
Southwest Jiaotong University
Hebei University of Technology
Harbin University of Science and Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Han et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69be37726e48c4981c677161 — DOI: https://doi.org/10.3390/jimaging12030140
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: