Abstract
The paper presents the design concept of the VoiceTRAN Communicator that integrates speech recognition, machine translation and text-to-speech synthesis using the DARPA Galaxy architecture. The aim of the project is to build a robust speech-to-speech translation communicator able to translate simple domain-specific sentences in the Slovenian-English language pair. The project represents a joint collaboration between several Slovenian research organizations that are active in human language technologies. We provide an overview of the task, describe the system architecture and individual servers. Further we describe the language resources that will be used and developed within the project. We conclude the paper with plans for evaluation of the VoiceTRAN Communicator.
The authors of the paper thank the Slovenian Ministry of Defense and the Slovenian Ministry of Higher Education, Science and Technology for co-funding the project.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lavie, A., Waibel, A., Levin, L., Finke, M., Gates, D., Gavalda, M., Zeppenfeld, T., Zhan, P.: Janus-III: Speech-to-Speech Translation in Multiple Languages. In: Proceedings of the ICASSP,Munich, Germany, pp. 99–102 (1997)
Wahlster, W.: Verbmobil: Foundation of Speech-to-Speech translation. Springer, Heidelberg (2000)
Lavie, A., Metze, F., Cattoni, R., Costantin, E., Burger, S., Gates, D., Langley, C., Laskowski, K., Levin, L., Peterson, K., Schultz, T., Waibel, A., Wallace, D., McDonough, J., Soltau, H., Lazzari, G., Mana, N., Pianesi, F., Pianta, E., Besacier, L., Blanchon, H., Vaufreydaz, D., Taddei, L.: A Multi-Perspective Evaluation of the NESPOLE! Speech-to- Speech Translation System. In: Proceedings of the ACL 2002 workshop on Speech-to-speech Translation: Algorithms and Systems, Philadelphia, PA (2002)
Sarich, A.: Phraselator, one-way speech translation system (2001), Available at http://www.sarich.com/translator/
Waibel, A., Badran, A., Black, A.W., Frederking, R., Gates, D., Lavie, A., Levin, L., Lenzo, K., Mayfield Tomokyo, L., Reichert, J., Schultz, T., Wallace, D., Woscsyna, M., Zhang, J.: Speechalator: Two-Way Speech-to-Speech Translation on a Consumer PDA. In: Proceedings of the Eurospeech, Geneva, Switzerland, pp. 369–372 (2003)
Seneff, S., Hurley, E., Lau, R., Pao, C., Schmid, P., Zue, V.: Galaxy-II: A Reference Architecture for Conversational System Development. In: Proceedings of the ICSLP, Sydney, Australia, pp. 931–934 (1998)
Dobrišek, S.: Analysis and Recognition of Phrases in Speech Signals. PhD Thesis, University of Ljubljana, Slovenia (2001)
Ney, H.: The Statistical Approach to Spoken Language Translation. In: Proceedings of the International Workshop on Spoken Language Translation, Kyoto, Japan, pp. XV-XVI (2004)
Romih, M., Holozan, P.: Slovensko-angleški prevajalni sistem (A Slovene-English Translation System). In: Proceedings of the 3rd Language Technologies Conference, Ljubljana, Slovenia, p. 167 (2002)
Žganec Gros, J., Mihelič, A., Žganec, M., Pavešić, N., Mihelič, F., Cvetko Orešnik, V.: Alp- Synth corpus-driven Slovenian text-to-speech synthesis: designing the speech corpus. In: Proceedings of the joint conferences CTS+CIS, Rijeka, Croatia, pp. 107–110 (2004)
Mihelič, F., Gros, J., Dobrišek, S., Žibert, J., Pavešić, N.: Spoken language resources at LUKS of the University of Ljubljana. International Journal on Speech Technologies 6(3), 221–232 (2003)
Erjavec, T.: The IJS-ELAN Slovene-English parallel corpus. International Journal on Corpus Linguistics 7(1), 1–20 (2002)
Erjavec, T., Džeroski, S.: Machine Learning of Language Structure: Lemmatising Unknown Slovene Words. Applied Artificial Intelligence 18(1), 17–41 (2004)
MT Evaluation Kit. NIST MT Evaluation Kit Version 11a (2002), Available at http://www.nist.gov/speech/tests/mt
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Žganec-Gros, J., Mihelič, F., Erjavec, T., Vintar, Š. (2005). The VoiceTRAN Speech-to-Speech Communicator. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_49
Download citation
DOI: https://doi.org/10.1007/11551874_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28789-6
Online ISBN: 978-3-540-31817-0
eBook Packages: Computer ScienceComputer Science (R0)