Continuous Speech Recognition of Kazakh Language

Open Access

Issue		ITM Web Conf. Volume 24, 2019 AMCSE 2018 - International Conference on Applied Mathematics, Computational Science and Systems Engineering


Article Number		01012
Number of page(s)		5
Section		Communications-Systems-Signal Processing
DOI		https://doi.org/10.1051/itmconf/20192401012
Published online		01 February 2019

Garofolo, John S., et al. TIMIT Acoustic-Phonetic Continuous Speech Corpus LDC93S1. Web Download. Philadelphia: Linguistic Data Consortium, 1993. [Google Scholar]
Godfrey, John, and Edward Holliman. Switchboard-1 Release 2 LDC97S62. Web Download. Philadelphia: Linguistic Data Consortium, 1993. [Google Scholar]
R. Gary Leonard, and George Doddington. TIDIGITS LDC93S10. Web Download. Philadelphia: Linguistic Data Consortium, 1993. [Google Scholar]
H.G. Hirsch, D. Pearce: â€œThe Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditionsâ, Proceedings of the ISCA workshop ASR2000, Paris, France, 2000. [Google Scholar]
Wikipedia. Agglutinative languages // Access mode: https://ru.wikipedia.org/wiki/Агглютинативные_языки free (accessed date 20.04.2018). [Google Scholar]
Access mode: http://kaldi-asr.org/doc/free (accessed date 20.04.2018). [Google Scholar]
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y. Qian, P. Schwarz, and J. Silovsky, “The Kaldi speech recognition toolkit,” in IEEE 2011 Workshop on Automatic Speech Recognition and Understanding (No. EPFL-CONF192584), IEEE Signal Processing Society, 2011. [Google Scholar]
Karpov, A., Markov, K., Kipyatkova, I., Vazhenina, D., Ronzhin, A.: Large vocabulary Russian speech recognition using syntactico-statistical language modeling. Speech Commun. 56, 213–228 (2014) [CrossRef] [Google Scholar]
Kipyatkova I.S., Karpov A.A. Dnn-based acoustic modeling for Russian speech recognition using Kaldi // Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2016, Vol. 9811, pp. 246–253 [Google Scholar]
Bazarbayeva Z. Fundamentals of Kazakh phonology. Almaty: Inst. Linguistics. 2012. p -120. [Google Scholar]
Wikipedia. Word error rate // Access mode: https://en.wikipedia.org/wiki/Word_error_rate free (accessed date 20.04.2018). [Google Scholar]
Levenshtein V. I. Binary codes capable of correcting deletions, insertions and reversals // Sov. Phys. Dokl. 1966. Vol. 6. P. 707–710. [Google Scholar]
Khokhlov Y., Tomashenko N. Speech recognition performance evaluation for LVCSR system // Proc. of the 14th Intern. Conf. “Speech and Computer” SPECOM—2011, Kazan, Russia. 2011. P. 129–135. [Google Scholar]

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.