What question did this study set out to answer?

The aim is to develop a computationally efficient network for image super-resolution while maintaining high accuracy.

March 22, 2026Open Access

A Swin-style shifted pooling cross-aggregation network for efficient image super-resolution

Key Points

The aim is to develop a computationally efficient network for image super-resolution while maintaining high accuracy.
Proposed the Swin-style shifted pooling cross-aggregation network (SPCAN) for image super-resolution.
Utilized max pooling for downsampling to replace window-based self-attention (WSA).
Implemented a shifted pooling mechanism within a CNN framework to emulate the Swin transformer's capabilities.
Introduced a cross-aggregation module to enhance inter-region feature interaction.
Achieved competitive reconstruction accuracy on public super-resolution benchmarks.
Demonstrated significant improvement in computational efficiency over existing state-of-the-art methods.

Abstract

Abstract Swin transformer-based methods have achieved impressive performance in image super-resolution (SR) due to their ability to effectively model long-range spatial dependencies. However, the core component, window-based self-attention (WSA), introduces considerable computational overhead, which limits their applicability on resource-constrained devices. To address these issues, we propose a Swin-style shifted pooling cross-aggregation network (SPCAN) for image SR, which achieves high computational efficiency while maintaining excellent reconstruction quality. Specifically, we adopt max pooling-based downsampling as a lightweight alternative to WSA for extracting low-frequency features and introduce a shifted pooling mechanism that emulates the shifted window strategy of Swin transformers within a convolutional neural network (CNN) framework. This mechanism is embedded within a cross-aggregation module to facilitate efficient inter-region feature interaction. Moreover, we generalize the pooling operation from square to rectangular regions to enhance the model’s ability to capture spatial dependencies across different orientations. Extensive experiments on public SR benchmarks demonstrate that the proposed method achieves competitive reconstruction accuracy while offering significantly better efficiency compared with existing state-of-the-art methods. The source code and pretrained models are available at: https://github.com/hms-source/SPCAN .

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Rui He

Zhenyang Zhu

Xiaoyang Mao

Journals

The Visual Computer

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

A Swin-style shifted pooling cross-aggregation network for efficient image super-resolution

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study