What question did this study set out to answer?

The aim was to explore the use of explainable machine learning techniques to predict voice recovery following thyroid surgery.

March 7, 2026Open Access

A Feasibility Study of Explainable Machine Learning on Small-Scale Postoperative Voice Data

Key Points

The aim was to explore the use of explainable machine learning techniques to predict voice recovery following thyroid surgery.
Collected voice recordings from patients before and one month after thyroid surgery.
Extracted acoustic and glottal features like Quasi Open Quotient and Speed Quotient from recordings.
Applied machine learning models: Random Forest, Support Vector Machines, and Logistic Regression.
Evaluated model stability and interpretability using cross-validation techniques.
Conducted SHAP analysis to determine feature contributions to predictions.
Performance metrics showed variability across different folds of the data.
Caution is advised in interpreting predictions due to the fragile nature of small datasets.
Identified important acoustic and glottal features for predicting voice recovery.

Abstract

Voice dysfunction is a common complication following thyroid surgery. However, the application of explainable machine learning for predicting postoperative voice recovery remains largely unexplored. Therefore, an investigation was done to examine voice recovery based on acoustic, objective, and glottal features. Voice recordings were collected from female patients before surgery and one month after surgery. Acoustic and glottal parameters, including Quasi Open Quotient, Speed Quotient, age, and others, were automatically extracted from the recordings. Random Forest, Support Vector Machines, and Logistic Regression with Sequential Feature Selection were applied to examine model behavior and identify feature importance. Model stability and interpretability were evaluated across cross-validation folds. Performance metrics varied over folds, highlighting the exploratory and statistically fragile nature of predictions in small datasets. SHAP (SHapley Additive exPlanations) analysis revealed variability in feature contributions, emphasizing the need for cautious interpretation and detailed methodological reporting. Our findings provide preliminary guidance for applying explainable machine learning to small biomedical datasets. They demonstrate the importance of careful methodological design.

Bookmark

View Full Paper

Bookmark

View Full Paper

A Feasibility Study of Explainable Machine Learning on Small-Scale Postoperative Voice Data

Key Points

Abstract

Cite This Study