This study proposed the integration of a laser Doppler vibrometer sensing with a Variational Inference with adversarial learning for Text-to-Speech-based voice conversion system to enhance automatic speech recognition for individuals with dysarthria in noisy environments. The proposed framework combines the noise robustness of laser Doppler vibrometer and generative modeling capabilities of Variational Inference with adversarial learning for Text-to-Speech to transform dysarthric speech into intelligible acoustic outputs. Experimental results demonstrated significant gains in automatic speech recognition accuracy compared with conventional acoustic methods, even at low signal-to-noise ratios. These findings establish a foundation for future clinical applications of augmentative and alternative communication systems.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yu-Chuan Lee
Wei-Zhong Zheng
Jia-Wei Chen
JASA Express Letters
National Yang Ming Chiao Tung University
Taipei Veterans General Hospital
Building similarity graph...
Analyzing shared references across papers
Loading...
Lee et al. (Sun,) studied this question.
www.synapsesocial.com/papers/69c37afeb34aaaeb1a67cf99 — DOI: https://doi.org/10.1121/10.0043006
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: