What type of study is this?

This is a Quantitative Study study.

October 17, 2025Open Access

DiNAT-IR: Exploring Dilated Neighborhood Attention for High-Quality Image Restoration

Key Points

DiNAT-IR improves image restoration by integrating local and global context effectively.
Key findings show that the new architecture competes well across multiple benchmarks.
The approach utilizes dilated neighborhood attention to expand the receptive field without significant overhead.
Incorporating channel-aware modules aids in maintaining pixel-level precision during the restoration process.

Abstract

Transformers, with their self-attention mechanisms for modeling long-range dependencies, have become a dominant paradigm in image restoration tasks. However, the high computational cost of self-attention limits scalability to high-resolution images, making efficiency-quality trade-offs a key research focus. To address this, Restormer employs channel-wise self-attention, which computes attention across channels instead of spatial dimensions. While effective, this approach may overlook localized artifacts that are crucial for high-quality image restoration. To bridge this gap, we explore Dilated Neighborhood Attention (DiNA) as a promising alternative, inspired by its success in high-level vision tasks. DiNA balances global context and local precision by integrating sliding-window attention with mixed dilation factors, effectively expanding the receptive field without excessive overhead. However, our preliminary experiments indicate that directly applying this global-local design to the classic deblurring task hinders accurate visual restoration, primarily due to the constrained global context understanding within local attention. To address this, we introduce a channel-aware module that complements local attention, effectively integrating global context without sacrificing pixel-level precision. The proposed DiNAT-IR, a Transformer-based architecture specifically designed for image restoration, achieves competitive results across multiple benchmarks, offering a high-quality solution for diverse low-level computer vision problems.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Liu et al. (Wed,) studied this question.

www.synapsesocial.com/papers/68f19f20de32064e504dddd8 — DOI: https://doi.org/10.48550/arxiv.2507.17892

Authors

Hanzhou Liu

Binghan Li

Chengkai Liu

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

DiNAT-IR: Exploring Dilated Neighborhood Attention for High-Quality Image Restoration

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion