The VoiceTRAN Speech-to-Speech Communicator

Žganec-Gros, Jerneja; Mihelič, France; Erjavec, Tomaž; Vintar, Špela

doi:10.1007/11551874_49

Jerneja Žganec-Gros¹⁹,
France Mihelič²⁰,
Tomaž Erjavec²¹ &
…
Špela Vintar²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3658))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

692 Accesses
1 Citations

Abstract

The paper presents the design concept of the VoiceTRAN Communicator that integrates speech recognition, machine translation and text-to-speech synthesis using the DARPA Galaxy architecture. The aim of the project is to build a robust speech-to-speech translation communicator able to translate simple domain-specific sentences in the Slovenian-English language pair. The project represents a joint collaboration between several Slovenian research organizations that are active in human language technologies. We provide an overview of the task, describe the system architecture and individual servers. Further we describe the language resources that will be used and developed within the project. We conclude the paper with plans for evaluation of the VoiceTRAN Communicator.

The authors of the paper thank the Slovenian Ministry of Defense and the Slovenian Ministry of Higher Education, Science and Technology for co-funding the project.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Grappling with Web Technologies: The Problems of Remote Speech Recording

A Modular Approach for Romanian-English Speech Translation

French-Fulfulde Textless and Cascading Speech Translation: Towards a Dual Architecture

References

Lavie, A., Waibel, A., Levin, L., Finke, M., Gates, D., Gavalda, M., Zeppenfeld, T., Zhan, P.: Janus-III: Speech-to-Speech Translation in Multiple Languages. In: Proceedings of the ICASSP,Munich, Germany, pp. 99–102 (1997)
Google Scholar
Wahlster, W.: Verbmobil: Foundation of Speech-to-Speech translation. Springer, Heidelberg (2000)
Google Scholar
Lavie, A., Metze, F., Cattoni, R., Costantin, E., Burger, S., Gates, D., Langley, C., Laskowski, K., Levin, L., Peterson, K., Schultz, T., Waibel, A., Wallace, D., McDonough, J., Soltau, H., Lazzari, G., Mana, N., Pianesi, F., Pianta, E., Besacier, L., Blanchon, H., Vaufreydaz, D., Taddei, L.: A Multi-Perspective Evaluation of the NESPOLE! Speech-to- Speech Translation System. In: Proceedings of the ACL 2002 workshop on Speech-to-speech Translation: Algorithms and Systems, Philadelphia, PA (2002)
Google Scholar
Sarich, A.: Phraselator, one-way speech translation system (2001), Available at http://www.sarich.com/translator/
Waibel, A., Badran, A., Black, A.W., Frederking, R., Gates, D., Lavie, A., Levin, L., Lenzo, K., Mayfield Tomokyo, L., Reichert, J., Schultz, T., Wallace, D., Woscsyna, M., Zhang, J.: Speechalator: Two-Way Speech-to-Speech Translation on a Consumer PDA. In: Proceedings of the Eurospeech, Geneva, Switzerland, pp. 369–372 (2003)
Google Scholar
Seneff, S., Hurley, E., Lau, R., Pao, C., Schmid, P., Zue, V.: Galaxy-II: A Reference Architecture for Conversational System Development. In: Proceedings of the ICSLP, Sydney, Australia, pp. 931–934 (1998)
Google Scholar
Dobrišek, S.: Analysis and Recognition of Phrases in Speech Signals. PhD Thesis, University of Ljubljana, Slovenia (2001)
Google Scholar
Ney, H.: The Statistical Approach to Spoken Language Translation. In: Proceedings of the International Workshop on Spoken Language Translation, Kyoto, Japan, pp. XV-XVI (2004)
Google Scholar
Romih, M., Holozan, P.: Slovensko-angleški prevajalni sistem (A Slovene-English Translation System). In: Proceedings of the 3rd Language Technologies Conference, Ljubljana, Slovenia, p. 167 (2002)
Google Scholar
Žganec Gros, J., Mihelič, A., Žganec, M., Pavešić, N., Mihelič, F., Cvetko Orešnik, V.: Alp- Synth corpus-driven Slovenian text-to-speech synthesis: designing the speech corpus. In: Proceedings of the joint conferences CTS+CIS, Rijeka, Croatia, pp. 107–110 (2004)
Google Scholar
Mihelič, F., Gros, J., Dobrišek, S., Žibert, J., Pavešić, N.: Spoken language resources at LUKS of the University of Ljubljana. International Journal on Speech Technologies 6(3), 221–232 (2003)
Article Google Scholar
Erjavec, T.: The IJS-ELAN Slovene-English parallel corpus. International Journal on Corpus Linguistics 7(1), 1–20 (2002)
Article Google Scholar
Erjavec, T., Džeroski, S.: Machine Learning of Language Structure: Lemmatising Unknown Slovene Words. Applied Artificial Intelligence 18(1), 17–41 (2004)
Article Google Scholar
MT Evaluation Kit. NIST MT Evaluation Kit Version 11a (2002), Available at http://www.nist.gov/speech/tests/mt

Download references

Author information

Authors and Affiliations

Alpineon d.o.o., Ulica Iga Grudna 15, SI-1000, Ljubljana, Slovenia
Jerneja Žganec-Gros
University of Ljubljana, SI-1000, Ljubljana, Slovenia
France Mihelič & Špela Vintar
Jožef Stefan Insitute, Jamova 39, SI-1000, Ljubljana, Slovenia
Tomaž Erjavec

Authors

Jerneja Žganec-Gros
View author publications
You can also search for this author in PubMed Google Scholar
France Mihelič
View author publications
You can also search for this author in PubMed Google Scholar
Tomaž Erjavec
View author publications
You can also search for this author in PubMed Google Scholar
Špela Vintar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of West Bohemia in Pilsen, Univerzitni 8, 30614, Plzen, Czech Republic
Václav Matoušek , Pavel Mautner & Tomáš Pavelka , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Žganec-Gros, J., Mihelič, F., Erjavec, T., Vintar, Š. (2005). The VoiceTRAN Speech-to-Speech Communicator. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_49

Download citation

DOI: https://doi.org/10.1007/11551874_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28789-6
Online ISBN: 978-3-540-31817-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

The VoiceTRAN Speech-to-Speech Communicator

Abstract

Access this chapter

Preview

Similar content being viewed by others

Grappling with Web Technologies: The Problems of Remote Speech Recording

A Modular Approach for Romanian-English Speech Translation

French-Fulfulde Textless and Cascading Speech Translation: Towards a Dual Architecture

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

The VoiceTRAN Speech-to-Speech Communicator

Abstract

Access this chapter

Preview

Similar content being viewed by others

Grappling with Web Technologies: The Problems of Remote Speech Recording

A Modular Approach for Romanian-English Speech Translation

French-Fulfulde Textless and Cascading Speech Translation: Towards a Dual Architecture

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation