We study the reconstruction of high-quality 3D head avatars. Our goal is to reduce the reliance on dense capture data required by most existing approaches, which limits their practicality. Recent advances have attempted to address this using single or few input images by either training a prior model or fine-tuning multi-view diffusion models to generate pseudo training points. However, these methods fall short in producing multi-view-consistent, high-fidelity results aligned with the input data. This motivates us to explore a more practical and user-friendly input setting. Modern smartphones such as Apple's Face ID already guide users to slowly rotate their heads in front of a single camera, enabling the capture of facial data across varying viewpoints with minimal effort. This simple and intuitive scanning motion has become a widely accepted user habit and provides sufficient geometric information-highlighting a natural opportunity for 3D head avatar creation from monocular videos of neutral expression. In this paper, we introduce R ^2 Avatar, a lightweight and user-friendly framework for generating expressive 3D head avatars under this setting. Our method adopts a Reconstruction-by-Restoration strategy that avoids large-scale model pretraining while achieving high-quality animatable avatars. Specifically, a geometry-guided warping module first synthesizes coarse expression variations from the neutral input. Then, a restoration module refines the warped results by recovering high-frequency facial details, including mouth interior, with the help of a data-driven 2D animation prior. These restored images serve as supervision targets to optimize the final avatar. Experiments demonstrate that our method produces realistic avatars with improved expression diversity and view consistency compared to baseline approaches. Our Code and data will be released upon acceptance.
Building similarity graph...
Analyzing shared references across papers
Loading...
Qing Chang
Yao-Xiang Ding
Kun Zhou
IEEE Transactions on Visualization and Computer Graphics
Zhejiang University
Building similarity graph...
Analyzing shared references across papers
Loading...
Chang et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69fd7d94bfa21ec5bbf05f58 — DOI: https://doi.org/10.1109/tvcg.2026.3690559