What question did this study set out to answer?

The aim is to improve oriented object detection in RGB-infrared imagery by integrating semantic information during fusion.

March 2, 2026Open Access

SGFNet: Semantic-Guided Fusion Network with Closed-Loop Feedback for RGB-Infrared Oriented Object Detection

Read Full Paperexternally

Key Points

The aim is to improve oriented object detection in RGB-infrared imagery by integrating semantic information during fusion.
Developed SGFNet with three main modules: Frequency-aware Disentanglement Module, Semantic-Guided Module, Adaptive Geometric Convolution.
Utilized RGB-IR datasets from DroneVehicle benchmark with 28,439 image pairs.
Applied detection-level feedback to enhance the fusion process for oriented objects.
Achieved 82.0% mean average precision (mAP) at 0.5 IoU, outperforming the previous leading method by 3.2 percentage points.
Reduced mean angular error from 7.4° to 6.2°, indicating a 16% improvement in accuracy.

Abstract

In oriented object detection from drone imagery, many existing RGB-infrared (RGB-IR) fusion methods derive modality weights from input statistics alone, without regard for downstream detection objectives. We present SGFNet, a Semantic-Guided Fusion Network that feeds detection-level semantics back into the fusion stage through learned importance masks. SGFNet comprises three modules: (1) a Frequency-aware Disentanglement Module (FDM) that separates high-frequency textures from low-frequency thermal structures through Laplacian and Gaussian filtering; (2) a Semantic-Guided Module (SGM) that generates P5-level semantic masks to steer fusion toward detection-critical regions; and (3) an Adaptive Geometric Convolution (AGC) whose rotation-aware sampling matches receptive fields to arbitrarily oriented objects. On the DroneVehicle benchmark (28,439 RGB-IR pairs, five vehicle categories), SGFNet achieves 82.0% mAP@0.5, surpassing the runner-up DMM by 3.2 percentage points while lowering mean angular error from 7.4° to 6.2° (−16%). Ablation analysis attributes the largest single-module gain (+1.7 pp) to the semantic feedback path.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Liang Zhang

Yueqiu Jiang

Wei Yang

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

SGFNet: Semantic-Guided Fusion Network with Closed-Loop Feedback for RGB-Infrared Oriented Object Detection

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study