What question did this study set out to answer?

The aim is to improve image restoration from adverse weather by utilizing explicit semantic and perceptual guidance.

February 14, 2026Open Access

Towards Adaptive Adverse Weather Removal via Semantic and Low-Level Visual Perceptual Priors

Key Points

The aim is to improve image restoration from adverse weather by utilizing explicit semantic and perceptual guidance.
Developed AWR-VIP, a prior-guided framework for adverse weather removal.
Employed a frozen vision–language model to extract semantic and weather-type information.
Generated global and local restoration guidance based on identified semantics.
AWR-VIP outperformed existing state-of-the-art methods in image restoration.
The VLM-derived priors can be integrated into other restoration systems for enhanced performance.
Restoration process effectively addressed issues of over-smoothing and residual artifacts.

Abstract

Adverse weather removal aims to restore images degraded by haze, rain, or snow. However, existing unified models often rely on implicit degradation cues, making them vulnerable to inaccurate weather perception and insufficient semantic guidance, which leads to over-smoothing or residual artifacts in real scenes. In this work, we propose AWR-VIP, a prior-guided adverse weather removal framework that explicitly extracts semantic and perceptual priors using a frozen vision–language model (VLM). Given a degraded input, we first employ a degradation-aware prompt extractor to produce a compact set of semantic tags describing key objects and regions, and simultaneously perform weather-type perception by prompting the VLM with explicit weather definitions. Conditioned on the predicted weather type and selected tags, the VLM further generates two levels of restoration guidance: a global instruction that summarizes image-level enhancement goals (e.g., visibility/contrast) and local instructions that specify tag-aware refinement cues (e.g., recover textures for specific regions). These textual outputs are encoded by a text encoder into a pair of priors (Pglobal and Plocal), which are injected into a UNet-based restorer through global-prior-modulated normalization and instruction-guided attention, enabling weather-adaptive and content-aware restoration. Extensive experiments on a combined benchmark show that AWR-VIP consistently outperforms state-of-the-art methods. Moreover, the VLM-derived priors are plug-and-play and can be integrated into other restoration backbones to further improve performance.

Bookmark

View Full Paper

Cite This Study

Dong et al. (Thu,) studied this question.

synapsesocial.com/papers/6990112b2ccff479cfe57a95 https://doi.org/https://doi.org/10.3390/make8020045

Bookmark

View Full Paper