What question did this study set out to answer?

The research aims to enhance automatic speaker recognition by using mel-frequency cepstral coefficients and spectrum-based features.

February 6, 2026Open Access

Mel-frequency cepstral coefficients and spectrum based additional features in automatic speaker recognition

Read Full Paperexternally

Key Points

The research aims to enhance automatic speaker recognition by using mel-frequency cepstral coefficients and spectrum-based features.
Evaluated speaker recognition on two speech databases.
Used 21 mel-frequency cepstral coefficients in the feature vector.
Incorporated up to three additional features from the amplitude spectrum.
Tested on the CHAINS database and S-ADAPT emotional speech database.
Achieved 97.11% recognition accuracy on the CHAINS database.
Attained 98.65% accuracy on neutral speech within the S-ADAPT database.
Maximum recognition accuracy of 98.72% when considering the entire S-ADAPT database.

Abstract

The efficiency of the proposed automatic speaker recognizer is evaluated using two speech databases. The feature vector consists of 21 mel-frequency cepstral coefficients (MFCCs), along with up to three additional features derived from the amplitude spectrum. The additional features are calculated based on the logarithm of the energy around the appropriate local maximum in the spectrum, the frequency of that maximum, and the logarithm of the energy of the maximum component in the spectrum across all frames of the observed signal. The speaker identification procedure for a closed set of speakers is tested on the Solo section of the CHAINS database and a speech database with expressed emotions, developed within the S-ADAPT project. The achieved maximum mean recognition accuracies are 97.11%, on the CHAINS database, using a feature vector of 21 MFCCs and two additional features, and 98.65% on neutral speech, as well as 98.72% on the entire database, for the S-ADAPT database, using a feature vector of 21 MFCCs.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Jokić et al. (Wed,) studied this question.

synapsesocial.com/papers/698586238f7c464f2300a1c9 — DOI: https://doi.org/10.2298/fuee2504663j

Authors

Ivan Jokić

University of Novi Sad

S. Jokić

Universitat Autònoma de Barcelona

Vlado Delić

University of Novi Sad

Journals

Facta universitatis - series Electronics and Energetics

Actions

Institutions

University of Novi Sad

University of Nis

Telekom Srbija (Serbia)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Mel-frequency cepstral coefficients and spectrum based additional features in automatic speaker recognition

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion