What question did this study set out to answer?

The research aims to improve gaze estimation accuracy in dynamic scenes with significant head movements and gaze shifts.

April 1, 2026Open Access

DGAGaze: Gaze Estimation with Dual-Stream Differential Attention and Geometry-Aware Temporal Alignment

Key Points

The research aims to improve gaze estimation accuracy in dynamic scenes with significant head movements and gaze shifts.
Developed a gaze estimation framework called DGAGaze.
Utilized a difference-driven spatiotemporal attention mechanism.
Implemented a geometry-aware temporal alignment module for head movement compensation.
Used pose estimation and affine feature warping to decouple head and eye motion.
Conducted experiments on EyeDiap and Gaze360 datasets.
Achieved improved gaze estimation accuracy compared to state-of-the-art methods.
Maintained a lightweight architecture based on ResNet-18.
Demonstrated effectiveness under dynamic conditions with rigid head movements.

Abstract

Gaze estimation plays a crucial role in human-computer interaction and behavior analysis. However, in dynamic scenes, rigid head movements and rapid gaze shifts pose significant challenges to accurate gaze prediction. Most existing methods either process single-frame images independently or rely on long video sequences, making it difficult to simultaneously achieve strong performance and high computational efficiency. To address this issue, we propose DGAGaze, a gaze estimation framework based on a difference-driven spatiotemporal attention mechanism. This framework uses a geometry-aware temporal alignment module to mitigate interference from rigid head movements, compensating for them through pose estimation and affine feature warping, thereby achieving explicit decoupling between global head motion and local eye motion. Based on the aligned features, inter-frame differences are used to adjust spatial and channel attention weights, enhancing motion-sensitive representations without introducing an additional temporal modeling layer. Extensive experiments on the EyeDiap and Gaze360 datasets demonstrate the effectiveness of the proposed approach. DGAGaze achieves improved gaze estimation accuracy while maintaining a lightweight architecture based on a ResNet-18 backbone, outperforming existing state-of-the-art methods.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Wen Zhang

Pengcheng Li

Journals

Applied Sciences

Actions

Institutions

University of South China

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

DGAGaze: Gaze Estimation with Dual-Stream Differential Attention and Geometry-Aware Temporal Alignment

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study