Distortion of speech in real-life communication is inevitable, affecting its quality. Conventionally, the effectiveness of a speech system in terms of the perceptual quality of the speech it produces has been assessed using a time-consuming subjective metric, the mean opinion score. There are a number of objective metrics that can be used instead of the mean opinion score to assess the perceptual quality of the speech signal. The objective of this paper is to propose and validate a new objective metric, the spectral entropy-based metric (SEM), designed to evaluate the perceptual quality of speech and perceptual naturalness by quantifying spectral coherence. While other metrics focus on intelligibility, this study aims to fill a gap in naturalness assessment. The core novelty of this work lies in offering a diagnostic perspective on spectral coherence, an indicator of speech naturalness that is often not explicitly addressed by other metrics. To demonstrate the effectiveness of the proposed metric in evaluating the perceptual quality, we consider fixed-beam and steerable-beam first-order differential microphone arrays. Compared with other objective metrics, it is shown that the proposed SEM is more sensitive to spectral coherence, a predominant indicator of the naturalness of the output speech signal of a speech system.
Building similarity graph...
Analyzing shared references across papers
Loading...
Ali Sarafnia
M. Omair Ahmad
Mns Swamy
Signals
Concordia University
Building similarity graph...
Analyzing shared references across papers
Loading...
Sarafnia et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69ba424e4e9516ffd37a262b — DOI: https://doi.org/10.3390/signals7020027