The Algorithms of Tajik Speech Synthesis by Syllable

Khurshed A. Khudoyberdiev

doi:10.1051/itmconf/20203507003

All issues

Volume 35 (2020)

ITM Web Conf., 35 (2020) 07003

Abstract

Open Access

Issue		ITM Web Conf. Volume 35, 2020 International Forum “IT-Technologies for Engineering Education: New Trends and Implementing Experience” (ITEE-2019)


Article Number		07003
Number of page(s)		14
Section		Anthropological Dimension of Digital Technologies in Engineering Education
DOI		https://doi.org/10.1051/itmconf/20203507003
Published online		09 December 2020

ITM Web of Conferences 35, 07003 (2020)

The Algorithms of Tajik Speech Synthesis by Syllable

Khurshed A. Khudoyberdiev^*

Khujand Polytechnic institute of Tajik technical university named after academician M.S. Osimi, Khujand, 735700, Tajikistan

^* Corresponding author: tajlingvo@gmail.com

Abstract

This article is devoted to the development of a prototype of a computer synthesizer of Tajik speech by the text. The need for such a synthesizer is caused by the fact that its analogues for other languages not only help people with visual and speech defects, but also find more and more application in communication technology, information and reference systems. In the future, such programs will take their proper place in the broad acoustic dialogue of humans with automatic machines and robotics in various fields of human activity. The article describes the prototype of the Tajik computer synthesizer by the text developed by the author, which is constructed on the principle of a concatenative synthesizer, in which the syllable is chosen as the speech unit, which in turn, indicates the need for the most complete description of the variety of Tajik language syllables. To study the patterns of the Tajik language associated with the concept of syllable, it was introduced the concept of “syllabic structure of the word”. It is obtained the statistical distribution of structures, i.e. a correspondence is established between the syllabic structures of words and the frequencies of their occurrence in texts in the Tajik language. It is proposed an algorithm for breaking Tajik words into syllables, implemented as a computer program. A solution to the problem of Tajik speech synthesis from an arbitrary text is proposed. The article describes the computer implementation of the algorithm for syncronization of words, numbers, characters and text. For each syllable the corresponding sound realization is extracted from the “syllable-sound” database, then the sound of the word is synthesized from the extracted elements.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.