We propose a method for noninvasive heart rate (HR) estimation from ultra-high-definition (4K) UAV-recorded videos, leveraging general-purpose instance segmentation without retraining or manual annotation. The method integrates Fast Segment Anything (FastSAM) with silhouette tracking and signal processing techniques. To address segmentation reliability, we introduce a lightweight heuristic metric ( C s e g ) that detects abrupt changes in the region of interest, enabling the rejection of unreliable frames. Furthermore, a genetic algorithm is further employed to fine-tune preprocessing and estimation parameters. The method was evaluated on two datasets: the indoor, controlled UBFC-rPPG dataset and a challenging UAV dataset recorded outdoors. On the UBFC-rPPG dataset, our approach achieved up to 76% improvement in heart rate estimation accuracy compared to a baseline algorithm. On the UAV dataset, results indicate more modest gains, limited by motion artifacts, compression, and lighting variability. This work highlights the potential of generic foundation segmentation models in remote physiological monitoring and identifies key challenges for real-world UAV-based applications. Of particular importance here is the fact that the studied objects are moving, while previous studies mainly have focused on stationary objects. Our findings demonstrate that carefully designed segmentation and parameter tuning strategies can enhance rPPG performance in favorable conditions, while also emphasizing the need for future work in motion compensation, segmentation validation, and deployment on embedded systems. • A method is proposed for remote heart rate estimation using ultra-high-definition (4K) UAV videos that integrates general-purpose segmentation (FastSAM) with silhouette tracking. • The method was evaluated on both indoor (UBFC-rPPG) and challenging outdoor (UAV) datasets, highlighting the limits and potential of generic segmentation in practical HR monitoring scenarios. • This work highlights the potential of generic segmentation models in remote physiological monitoring and identifies key challenges for real-world UAV-based applications. • A heuristic segmentation correctness metric ( C s e g ) is introduced to filter unreliable frames without requiring ground-truth masks. • Genetic optimization is applied to tune preprocessing and estimation parameters, significantly improving accuracy on the UBFC-rPPG dataset (up to 76%).
Przybyło et al. (Mon,) studied this question.