Rotation invariance is essential for precise object level segmentation in UAV aerial imagery, where targets can have arbitrary orientations and exhibit fine scale details. Conventional segmentation architectures like UNet rely on convolution operators that are not rotation-invariant, leading to degraded segmentation accuracy across varying viewpoints. Rotation invariance can be achieved by expanding the filter bank across multiple orientations; however, this significantly increases computational cost and memory requirement. In this paper, we introduce a GPU-optimized rotation-invariant convolution framework that eliminates the traditional data lowering (im2col) step required for matrix multiplication based convolution. By exploiting structured data sharing among symmetrically rotated filters, our method achieves multi-orientation convolution with greatly reduced memory requirements and computational redundancy. We further generalize the approach to accelerate convolution with arbitrary (non-symmetric) rotation angles. Integrated into a UNet segmentation model, the framework yields up to a 5.7% improvement in accuracy over the non-rotation-aware baseline. Across extensive benchmarks, the proposed convolution achieves 20–57% faster training and 15–45% lower energy consumption than cuDNN, while maintaining accuracy comparable to state-of-the-art rotation-invariant methods. Because the scatter-based operator greatly reduces intermediate feature dimensionality, the efficiency of our design also enables practical sixteen-orientation convolution and pooling, yielding further accuracy gains that are infeasible for conventional rotation-invariant implementations. Our sixteen-orientation approach achieves competitive accuracy on multiple datasets compared with state-of-the-art UAV segmentation networks. These results demonstrate that the proposed method provides an effective and efficient alternative to existing rotation-invariant convolution frameworks.
Building similarity graph...
Analyzing shared references across papers
Loading...
Manduhu et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69a76175c6e9836116a2f76c — DOI: https://doi.org/10.1109/tgrs.2026.3664979
Manduhu Manduhu
Alexander Dow
Gerard Dooly
IEEE Transactions on Geoscience and Remote Sensing
University of Glasgow
University of Limerick
Building similarity graph...
Analyzing shared references across papers
Loading...