Abstract
In this paper a new rule-based approach to break assignment for the Russian language is discussed. It is a flexible and robust method of segmentation of texts in Russian in prosodic units. We implemented it in the recent “Orator” text-to-speech (TTS) system. The model was developed to use for the inflective languages as an alternative both for statistic and for strict rule-based algorithms. It is designed in such a way that all potentially tunable context dependencies are brought up to the interface grammar and can be easily modified by linguists. The algorithm we developed performs well on different kinds of texts due to this simple and intuitive grammar built upon an elaborate mechanism of morpho-grammatical analysis. Juncture correct rate varies between more than 98% for simple literary texts and 85% for raw transcripts of spontaneous speech.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Atterer, M.: Assigning Prosodic Structure for Speech Synthesis: A Rule-based Approach. In: Proceedings of Prosody 2002. Aix-en-Provence, pp. 147–150 (2002)
Bachenko, J., Fitzpatrick, E.: A Computational Grammar of Discourse-Neutral Prosodic Phrasing in English. Computational Linguistics 16, 157–167 (1993)
Bondarko, L.V., Volskaya, N.B., Tananaiko, S.O., Vasilieva, L.A.: Phonetic Properties of Russian Spontaneous Speech. In: Proceedings of the 15th ICPhS. Barcelona, Spain, pp. 2973–2976 (2003)
Black, A.W., Taylor, P.: Assigning phrase breaks from part-of-speech sequences. In: Proceedings of Eurospeech 1997. Rhodes, Greece, pp. 995–998 (1997)
Gee, J.P., Grosjean, F.: Performance Structures: A Psycholinguistic and Linguistic Appraisal. Cognitive Psychology 15, 411–458 (1998)
Krivnova, O.F.: Perceptual and semantic meaning of prosodic boundaries in a coherent text. In: Problemy Fonetiki. Moscow, Russia, pp. 228–238 (1995) (in Russian)
Monaghan, A.I.C.: Rhythm and stress shift. Computer Speech and Language 4, 71–78 (1990)
Oparin, I., Talanov, A.: Stem-Based Approach to Pronunciation Vocabulary Construction and Language Modeling of Russian. In: Eurospeech 2005. Lisbon, Portugal (2005) (submitted to)
Oparin, I.: Flexible Rule-Based Breaks Assignment for Russian. In: Eurospeech 2005. Lisbon, Portugal (2005) (submitted to)
Sanders, E.: Using Probabilistic Methods to Predict Phrase Boundaries for a Text-to-Speech System. Phd thesis, University of Nijmegen, the Netherlands (1995)
Traber, C.: Syntactic Processing and Prosody Control in the SVOX TTS System for German. In: Proceedings of Eurospeech 1993. Berlin, Germany, pp. 2099–2102 (1993)
Wang, M., Hirschberg, J.: Automatic Classification of Intonational Phrase Boundaries. Computer Speech and Language 6 (1992)
Zaliznyak, A.A.: Grammatical Dictionary of the Russian Language. Moscow, Russia (1977) (in Russian)
Zharkov, I.V., Slobodanuk, S.L., Svetozarova, N.D.: Automatic Accent-Intonational Transcriber of a Russian Text. Bochum-St.Petersburg (1994) (in Russian)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Oparin, I. (2005). Robust Rule-Based Method for Automatic Break Assignment in Russian Texts. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_46
Download citation
DOI: https://doi.org/10.1007/11551874_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28789-6
Online ISBN: 978-3-540-31817-0
eBook Packages: Computer ScienceComputer Science (R0)