With the increasing demand for high-quality imaging in consumer electronics, image aesthetics assessment (IAA) has been widely applied to electronic cameras and display devices. Although the deformable attention mechanism has been introduced into IAA due to its perceptual capabilities, enabling models to refine attention regions by learning interest points and their corresponding offsets, existing methods often lack guidance from aesthetic composition features during the offset generation process, which limits their performance in aesthetic evaluation tasks. To address this issue, we propose a graph neural network (GNN)-guided deformable attention module that incorporates composition information into the generation of interest points by modeling image features as graphs and applying the GNN to guide interest point selection. In addition, we design an improved transformer model that employs neighborhood attention to further enhance IAA performance. We evaluate the proposed model on two aesthetic datasets, AVA and TAD66K, and the experimental results demonstrate its effectiveness in improving overall model performance.
Building similarity graph...
Analyzing shared references across papers
Loading...
Lin Li
Jesse Zhu
Mingxing Jiang
Electronics
Hefei University of Technology
Hefei University
Chaohu University
Building similarity graph...
Analyzing shared references across papers
Loading...
Li et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69d893626c1944d70ce046eb — DOI: https://doi.org/10.3390/electronics15071534