What question did this study set out to answer?

May 20, 2026Open Access

A Learnable Feature Processing Front-End Based Multimodal Fusion Network for SAR Ship Classification

Key Points

This research aims to enhance ship classification in synthetic aperture radar (SAR) imagery using a novel multimodal fusion network.
Developed a learnable feature preprocessing front-end (LFPF-MFN) for improved feature extraction.
Integrated polarimetric, textural, and geometric information in a single framework.
Implemented a bidirectional cross-attention mechanism for effective multimodal fusion.
Achieved state-of-the-art performance in three-class and six-class ship classification tasks.
Validated effectiveness of the LFPF-MFN design modules based on extensive experiments on the OpenSARShip 2.0 dataset.

Abstract

Ship classification in synthetic aperture radar (SAR) imagery is essential for maritime surveillance but remains challenging due to limited resolution, insufficient textural details, and difficulties in effectively fusing multimodal information. Existing methods either rely on handcrafted features with limited adaptability or employ simplistic fusion strategies that fail to fully exploit the complementary guidance across modalities. To address these issues, we propose a multimodal fusion network based on a learnable feature preprocessing front-end (LFPF-MFN), which integrates polarimetric, textural, and geometric information in an end-to-end learnable manner. Specifically, LFPF-MFN introduces a learnable preprocessing front-end to embed scattering and enhanced textural features. Meanwhile, geometric information from the Automatic Identification System (AIS) is incorporated through textual embedding, and effective multimodal fusion is achieved via a bidirectional cross-attention mechanism. Extensive experiments on the OpenSARShip 2.0 dataset demonstrate that the proposed method achieves state-of-the-art performance in both three-class and six-class classification tasks, validating the effectiveness of each designed module and the superiority of the multimodal fusion strategy.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Wang et al. (Sun,) studied this question.

synapsesocial.com/papers/6a0d5098f03e14405aa9c8a9 https://doi.org/https://doi.org/10.3390/rs18101610

Bookmark

View Full Paper