March 3, 2026Open Access

Unifying Scale-Aware Depth Prediction and Perceptual Priors for Monocular Endoscope Pose Estimation and Tissue Reconstruction

Key Points

Pose estimation and 3D tissue reconstruction are crucial for effective monocular minimally invasive surgery, with enhanced accuracy noted.
The proposed MAPIS-Depth module integrates robust scale initialisation with efficient per-frame depth prediction for precise outcomes.
Integration of temporal constraints and adaptive blending aids in reducing artefacts from physiological motion and tissue deformation.
The framework utilizes advanced optimization techniques, allowing for coherent 3D tissue surface extraction and reliable registration.

Abstract

Accurate endoscope pose estimation and 3D tissue reconstruction are essential for enhancing navigation and spatial awareness in monocular minimally invasive surgery. However, these tasks remain challenging due to depth ambiguity, physiological tissue deformation, inconsistent endoscope motion, limited texture fidelity, and the restricted field of view. To address these limitations, a unified monocular reconstruction framework is proposed that integrates scale-aware depth prediction with temporally constrained perceptual refinement. The proposed MAPIS-Depth module combines Depth Pro for robust scale initialisation with Depth Anything for efficient per-frame prediction, followed by L-BFGS-B optimisation to obtain pseudo-metric depth. Temporal consistency is further improved using RAFT-based pixel correspondences and LPIPS-guided adaptive blending, reducing artefacts caused by motion and deformation. For reliable registration of the synthesised pseudo-RGBD frames, the WEMA-RTDL module is introduced, which jointly optimises rotation and translation. Finally, truncated signed distance fusion and marching cubes are used to extract coherent 3D tissue surfaces. Experiments on the HEVD and SCARED datasets, supported by ablation studies and comparisons with state-of-the-art methods, demonstrate the robustness and superior accuracy of the proposed approach.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Muzammil Khan

Enzo Kerkhof

Matteo Fusaglia

Journals

IEEE Access

SHILAP Revista de lepidopterología

Actions

Institutions

The Netherlands Cancer Institute

University of Twente

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Unifying Scale-Aware Depth Prediction and Perceptual Priors for Monocular Endoscope Pose Estimation and Tissue Reconstruction

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study