What question did this study set out to answer?

The aim is to improve short-term facial landmark forecasting by enhancing motion dynamics in predictions.

April 10, 2026Open Access

PAGF: Short-Horizon Forecasting of 3D Facial Landmarks

Key Points

The aim is to improve short-term facial landmark forecasting by enhancing motion dynamics in predictions.
Developed a peak-aware gated recurrent unit (GRU) framework.
Separated forecasting into peak planning and trajectory generation stages.
Incorporated loss functions such as peak supervision and temporal-shape regularization.
Evaluated performance on the MEAD dataset with a subject-independent protocol.
Demonstrated a clear trade-off between distortion and dynamics preservation.
Outperformed static and sequence-to-sequence baselines in maintaining peak facial dynamics.
Achieved competitive accuracy in 24-step forecasting.

Abstract

Short-term facial landmark forecasting is important for anticipatory facial behavior in human–robot interaction, yet models trained with pointwise reconstruction losses often suffer from mean reversion, producing low-error predictions with weakened motion dynamics. To address this issue, we propose a peak-aware gated recurrent unit (GRU) framework that separates forecasting into peak planning and peak-conditioned trajectory generation. The planning stage estimates the timing and intensity of a salient motion peak within the forecast horizon together with a global motion direction, and the generation stage produces short-horizon landmark displacements through temporal gating and structured motion composition. The model is trained with reconstruction loss, peak supervision, peak-integrity regularization, and correlation-based temporal-shape regularization. Experiments on the MEAD dataset using 3D facial landmarks under a subject-independent protocol show a clear distortion–dynamics trade-off. Compared with static and sequence-to-sequence baselines, the proposed method better preserves peak-related facial dynamics while maintaining competitive 24-step prediction accuracy.

Bookmark

View Full Paper