Abstract
The paper presents a way to advance to a multi-level automatic speech understanding system implementation. Two levels are considered. On the first level a free (or relatively free) grammar phoneme recognition is applied and at the second level an output of the phonemic recognizer is automatically interpreted in a reasonable way. A Generative Model approach based model for phoneme recognizer output decoding is proposed. An experimental system is described.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Vintsiuk, T.: Multi-Level Multi-Decision Model for Automatic Speech Recognition and Understanding // International Summer School Neural Nets. E.R.Caianiello 3rd Course Speech Processing, Recognition and Artificial Neural Networks, Vietri sul Mare (SA) Italy, pp. 341–344 (1998)
Vintsiuk, T.: Generative Phoneme-Threephone Model for ASR. In: Matoušek, V., Mautner, P., Mouček, R., Tauser, K. (eds.) TSD 2001. LNCS (LNAI), vol. 2166, pp. 201–207. Springer, Heidelberg (2001)
Vasylyeva, N.: Training Samples Forming for Automatic Speech Synthesis by Text. – Magister diploma work, Kyjiv, p. 88 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sazhok, M. (2005). Generative Model for Decoding a Phoneme Recognizer Output. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_37
Download citation
DOI: https://doi.org/10.1007/11551874_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28789-6
Online ISBN: 978-3-540-31817-0
eBook Packages: Computer ScienceComputer Science (R0)