What question did this study set out to answer?

The aim is to create a word sense disambiguation method for the low-resource Chechen language to enhance speech synthesis.

June 3, 2026Open Access

Context-Oriented Method for Resolving Lexical Ambiguities in Speech Synthesis for a Low-Resource Language

Key Points

The aim is to create a word sense disambiguation method for the low-resource Chechen language to enhance speech synthesis.
Developed three algorithms: AWEN, AWA, and AWN for word sense disambiguation.
Compiled a corpus of 15,035 sentences from 5 million annotated words reflecting Chechen linguistic features.
Integrated the homonym recognition module into the Chechen speech synthesis system.
AWN achieved an F1-score of 0.78 and an accuracy of 0.80, outperforming AWA (F1: 0.74) and AWEN (F1: 0.40).
F1-scores of AWN for specific parts of speech: 0.82 for nouns, 0.83 for verbs, and 0.85 for adverbs.
AWN ranked second after ViConBERT (F1: 0.87) among existing methods for low-resource languages.

Abstract

Disambiguation resolution in speech synthesis is one of the main challenges in text-to-speech conversion. Machine learning methods and artificial neural networks have been successfully applied to this problem in synthesis systems for English, Spanish, and other common languages. For low-resource languages, the available data are insufficient to train artificial neural networks, so heuristic methods for context analysis and selection of the correct homonym for polysemantic words should be used. The purpose of this study is to develop a word sense disambiguation (WSD) method for the low-resource Chechen language and to introduce it into a speech synthesis system. The study presents the developed method and three algorithms: AWEN (based on Euclidean distance), AWA (weighted average), and AWN (weighted normalized distance) for word sense disambiguation. A corpus of Chechen texts, CheWSData, was compiled, containing 15,035 manually selected sentences derived from 5 million annotated words and reflecting the natural frequency of polysemy across grammatical categories. Experimental results show that the proposed AWN method achieves the best performance, with an F1-score of 0.78 and an accuracy of 0.80, outperforming AWA (F1: 0.74) and AWEN (F1: 0.40). For specific parts of speech, AWN reaches F1-scores of 0.82 for nouns, 0.83 for verbs, and 0.85 for adverbs. Comparative analysis with existing WSD methods for low-resource languages (Kashmiri, Hausa, Assamese, Urdu, and Vietnamese) demonstrates that AWN is competitive, ranking second after ViConBERT (F1: 0.87) and ahead of XLM-R for Hausa (F1: 0.79). The developed software module for homonym recognition was integrated into the Chechen speech synthesis system, contributing to more natural synthesized speech.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper