Abstract
The prosodic specification of an utterance to be spoken by a Text-to-Speech synthesis system can be devised in break indices, pitch accents and boundary tones. In particular, the identification of break indices formulates the intonational phrase breaks that affect all the forthcoming prosody-related procedures. In the present paper we use tree-structured predictors, and specifically the commonly used in similar tasks CART and the introduced C4.5 one, to cope with the task of break placement in the presence of shallow textual features. We have utilized two 500-utterance prosodic corpora offered by two Greek universities in order to compare the machine learning approaches and to argue on the robustness they offer for Greek break modeling. The evaluation of the resulted models revealed that both approaches were positively compared with similar works published for other languages, while the C4.5 method accuracy scaled from 1% to 2,7% better than CART.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bolinger, D.: Intonation and its Uses: Melody in Grammar and Discourse, London, UK, Edward Arnold (1989)
Anderson, M., Pierrehumbert, J., Liberman, M.: Synthesis by rule of English intonation patterns. In: ICASSP, pp. 281–284 (1984)
Prieto, P., Hirschberg, J.: Training Intonational Phrasing Rules Automatically for English and Spanish text-to-speech. Speech Communication 18, 281–290 (1996)
Bachenco, J., Fitzpatrick, E.: A Computational grammar of Discourse-Neutral Prosodic Phrasing in English. Computational Linguistics 16(3), 155–170 (1990)
Ostendorf, M., Veilleux, N.: A hierarchical stochastic model for automatic prediction of prosodic boundary location. Computational Linguistics 20(1) (1989)
Taylor, P., Black, A.W.: Assigning Phrase Breaks from Part-of-Speech Sequences. Computer Speech and Language 12, 99–117 (1998)
Muller, A.F., Zimmermann, H.G., Neuneier, R.: Robust Generation of Symbolic Prosody by a Neural Classifier Based on Autoassociators. In: ICASSP 1996, pp. 1285–1288 (1996)
Xydas, G., Spiliotopoulos, D., Kouroupetroglou, G.: Modeling Prosodic Structures in Linguistically Enriched Environments. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 521–528. Springer, Heidelberg (2004)
Fordyce, C., Osterdorf, S.,, M.: Prosody Prediction for Speech Synthesis Using Transformational Rule-Based Learning. In: ICSLP 1998, pp. 682–685 (1998)
Zervas, P., Maragoudakis, M., Fakotakis, N., Kokkinakis, G.: Bayesian Induction of intonational phrase breaks. In: EUROSPEECH, Geneva, Switzerland, September 1-4, pp. 113–116 (2003)
Silverman, K., Beckman, M., Pitrelli, J., Ostendorf, M., Wightman, C., Price, P., Pierrehumbert, J., Hirschberg, J.: ToBI: A standard for labeling English prosody. In: ICSLP, pp. 867–870 (1992)
Stamatatos, E., Fakotakis, N., Kokkinakis, G.: A Practical Chunker for Unrestricted Text. In: 2nd Int. Conf. of Natural Language Processing, pp. 139–150 (2000)
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth International Group, Belmont (1984)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)
Stone, M.: Cross-validation choice and assessment of statistical predictions. Journal of the Royal Statistical Society 36, 111–147 (1974)
Xydas, G., Spiliotopoulos, D., Kouroupetroglou, G.: Modeling Improved Prosody Generation from High-Level Linguistically Annotated Corpora. In: IEICE Transactions of Information and Systems (2005) (to appear)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zervas, P., Xydas, G., Fakotakis, N., Kokkinakis, G., Kouroupetroglou, G. (2005). Experimental Evaluation of Tree-Based Algorithms for Intonational Breaks Representation. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_43
Download citation
DOI: https://doi.org/10.1007/11551874_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28789-6
Online ISBN: 978-3-540-31817-0
eBook Packages: Computer ScienceComputer Science (R0)