Accurate and efficient detection is pivotal for tiny objects in remote sensing. However, achieving a favorable accuracy-efficiency trade-off remains challenging due to the few informative pixels of small targets, frequent occlusions, cluttered backgrounds, and detail degradation introduced by downsampling and multi-scale fusion. To address these challenges, we propose WEYOLO, a wavelet-enhanced detector that explicitly models frequency components and adaptively strengthens high-frequency cues to improve tiny-object robustness while maintaining competitive efficiency in inference speed and model size for remote-sensing deployment. To preserve edges and textures when spatial resolution is reduced, we design a Frequency-Aware Lifting Haar (FaLH) backbone that decomposes features into directional sub-bands and retains them during downsampling, preventing the loss of high-frequency information. Next, to address the blurring and detail loss caused by conventional pooling during multi-scale fusion, we introduce a Frequency-Domain Pyramid-Pooling (FDPP) module that performs wavelet-based multi-resolution analysis for frequency-aware feature-pyramid fusion. Additionally, we propose a stable size-aware quality focal regression loss that unifies Focaler-CIoU and size-aware DFL into a single objective, improving robustness and overall accuracy for small objects. Comprehensive experiments show that WEYOLO improves precision and recall over the baseline by 3.2%/4.2% on VisDrone and 2.6%/9.7% on TT100K; on AI-TOD, it achieves 47.5% mAP@0.5 and 21.3% mAP@0.5:0.95. Meanwhile, it reduces the parameter count by 60%, achieving a strong accuracy-efficiency balance for practical aerial sensing deployment.
Building similarity graph...
Analyzing shared references across papers
Loading...
Xu et al. (Wed,) studied this question.
synapsesocial.com/papers/69d896166c1944d70ce074bb — DOI: https://doi.org/10.3390/rs18081109
Weifan Xu
Yong Hu
Remote Sensing
Nanjing University of Aeronautics and Astronautics
Anhui Institute of Information Technology
Building similarity graph...
Analyzing shared references across papers
Loading...