Remote sensing (RS) images, characterized by their large size and rich texture, require algorithms capable of effectively integrating both global and local features for compression. However, existing Learned Image Compression (LIC) approaches face distinct bottlenecks. While Transformer-based architectures typically suffer from heavy computational loads, standard State Space Models (SSMs) often incur prohibitive memory costs when processing high-resolution inputs. To address these limitations, we propose MambaLIC, a novel RS image compression network that integrates the efficient long-range modeling of SSMs with the local modeling ability of CNNs. In this paper, we introduce an innovative Remote Sensing State Space Model (RS-SSM) module, which combines visual SSM with dynamic convolution for remote sensing image compression. This integration facilitates effective interaction between local and global information, thereby enhancing the performance of RS image compression. Furthermore, we propose an SSM attention-based (SSA-based) spatial-channel context model for better entropy modeling. Compared to Transformer-CNN mixed architectures, MambaLIC reduces computational complexity by 63.9% and achieves superior rate-distortion (RD) performance. Consequently, compared to the latest SS2D-based method MambaIC, MambaLIC achieves substantial efficiency gains, saving 78.8% in memory usage. Experimental results demonstrate that MambaLIC achieves state-of-the-art (SOTA) performance, outperforming VVC (VTM-17.0) by 14.22%, 18.48%, and 17.47% in BD-rate on UC-Merced, LoveDA, and xView datasets, respectively.
Building similarity graph...
Analyzing shared references across papers
Loading...
Haobo Xiong
Kai Liu
Huachao Xiao
Remote Sensing
Xidian University
China Academy of Space Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Xiong et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69b4ba0818185d8a398028bb — DOI: https://doi.org/10.3390/rs18060881