March 3, 2026Open Access

Progressive semi-supervised learning for multimodal pneumonia diagnosis

Key Points

The proposed model achieves an accuracy of 97.85% and an F1-score of 97.80%, indicating strong diagnostic performance.
Contrastive and diversity losses enhance the model's generalization capability with limited labeled data during training.
The framework incorporates time-series data, spectrograms, and wavelet transforms for effective pneumonia detection.
This method can support timely pneumonia screening in low-resource environments, potentially changing care practices.

Abstract

To develop a multimodal machine learning framework for pneumonia detection from lung sound recordings to address the challenge of timely and affordable diagnosis in resource-limited settings. We designed a progressive semi-supervised learning model that processes lung sound signals in three forms: time-series data, spectrogram representations, and wavelet transforms. Contrastive and diversity losses were introduced during progressive training to improve generalization and reduce overfitting with limited labeled data. The proposed framework achieved state-of-the-art performance with an accuracy of 97.85% and an F1-score of 97.80%, outperforming existing unimodal and multimodal benchmarks. This approach shows strong potential as a reliable and efficient noninvasive screening tool for pneumonia, offering robust performance with a minimal computational footprint.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Jobayer et al. (Thu,) studied this question.

synapsesocial.com/papers/69a76714badf0bb9e87df89a https://doi.org/https://doi.org/10.1016/j.bspc.2026.109750

Bookmark

View Full Paper