Abstract
Multilingual access to information and services is a key requirement in any pervasive or ubiquitous computing environment. In this paper we describe our efforts towards multilingual speech recognition with a focus on applications that are designed to run on embedded devices, like e.g. a commercially available PDA. We give an overview on speech recognition techniques suited for the special requirements of the expected phonetic and acoustic environments and explore the ability to create multilingual acoustic models and applications that are able to run on embedded devices in real-time.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bahl, L., de Souza, P., Gopalakrishnan, P., Nahamoo, D., Picheny, M.: Context-dependent Vector Quantization for Continuous Speech Recognition. In: Proc. of the IEEE Int. Conference on Acoustics, Speech, and Signal Processing, Minneapolis (1993)
Beran, T., Bergl, V., Hampl, R., Krbec, P., Šedivý, J., Tydlitát, B., Vopička, J.: Embedded ViaVoice. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 269–274. Springer, Heidelberg (2004)
Fischer, V., Gonzalez, J., Janke, E., Villani, M., Waast-Richard, C.: Towards Multilingual Acoustic Modeling for Large Vocabulary Speech Recognition. In: Proc. of the IEEE Workshop on Multilingual Speech Communications, Kyoto (2000)
Fischer, V., Janke, E., Kunzmann, S.: Likelihood Combination and Recognition Output Voting for the Decoding of Non-Native Speech with Multilingual HMMs. In: Proc. of the 7th Int. Conference on Spoken Language Processing, Denver (2002)
Fischer, V., Kunzmann, S.: Bayesian Information Criterion based Multi-style Training and Likelihood Combination for Robust Hands Free Speech Recognition in the Car. In: Proc. of the IEEE Workshop on Handsfree Speech communication, Kyoto (2001)
Kunzmann, S., Fischer, V., Gonzalez, J., Emam, O., Günther, C., Janke, E.: Multilingual Acoustic Models for Speech Recognition and Synthesis. In Proc. of the IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Montreal, 2004.
Schultz, T., Waibel, A.: Language Independent and Language Adaptive Acoustic Modeling for Speech Recognition. Speech Communications 35 (2001)
Wells, C.J.: Computer Coded Phonemic Notation of Individual Languages of the European Community. Journal of the International Phonetic Association 19, 32–54 (1989)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ivanecký, J., Fischer, V., Kunzmann, S. (2005). French–German Bilingual Acoustic Modeling for Embedded Voice Driven Applications. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_30
Download citation
DOI: https://doi.org/10.1007/11551874_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28789-6
Online ISBN: 978-3-540-31817-0
eBook Packages: Computer ScienceComputer Science (R0)