Abstract
Multilingual thesauri play a key role in multilingual text retrieval. At present, only a small number of on-line thesauri contain translations of terms in languages other than English. This is the case of the Unified Medical Language System (UMLS) Metathesaurus that includes the same term in different languages (e.g., English and Spanish). However, only a subset of terms in English have a corresponding translation in Spanish. In this work, I present an approach and some experimental results for reusing translated terms to expand the Metathesaurus. The approach includes two main tasks: finding patterns and formulating rules to automate the translation of English terms into Spanish terms. The approach is based on pattern matching, morphological rules, and word order inversion.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Soergel, D.: Multilingual thesauri in cross-language text and speech retrieval. AAAI Symposium on Cross-Language Text and Speech Retrieval (1997)
Oard, D.W.: Alternative Approaches for Cross-Language Text Retrieval. AAAI Symposium on Cross-Language Text and Speech Retrieval (1997)
Sheridan, P., Ballerini, J.P. Experiments in Multilingual Information Retrieval using the SPIDER system. Proceedings of SIGIR-96 (1996)
Yang, Y., Brown, R., Frederking, R., Carbonell, J., Geng, Y., Lee, D. Bilingual-corpus Based Approaches to Translingual Information Retrieval. The 2nd Workshop on Multilinguality in Software Industry: The AI Contribution (1997)
Ballesteros, L., Croft, W.B. Phrasal translation and query expansion techniques for cross-language information retrieval. AAAI Symposium on Cross-Language Text and Speech Retrieval (1997)
Stamatos, E., Michos, S., Pantelodimou, C., Fakotakis, N. TRANSLIB: An Advanced Tool for Supporting Multilingual Access to Library Catalogues. The 2nd Workshop on Multilinguality in Software Industry: The AI Contribution (1997)
Gilarranz, J., Gonzalo, J., Verdejo, F. Language-independent Text Retrieval with the EuroWordNet Multilingual Semantic Database. The 2nd Workshop on Multi-linguality in Software Industry: The AI Contribution (1997)
Nkwenti-Azeh, B. New trends in terminology processing and implications for practical translation. ASLIB (1994) 67–74
National Library of Medicine. Unified Medical Language System. 8th Edition 1997. UMLS Knowledge Sources, Documentation (1997)
Klemettinen, M., Mannila, H., Toivonen, H. A Data Mining Methodology and Its Application to Semi-Automatic Knowledge Acquisition. DEXA’97 Workshop (1997)
Fayyad, U., Piatetsky-Shapiro, G., Smyth, P. From data mining to knowledge discovery: An overview. Advances in Knowledge Discovery and Data Mining (1996) 1–34
Brazma, A., Jonassen, I., Eidhammer, I., Gilbert, D. R. Approaches to automatic discovery of patterns in biosequences. Reports in Informatics 113 (1995)
Jonassen, I., Collins, J. F., Higgins, D. G. Finding flexible patterns in unaligned protein sequences. Protein Science 4(8) (1995) 1587–1595
Jonassen, I. Efficient discovery of conserved patterns using a pattern graph CABIOS (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Guillén, R. (1998). Reusing Translated Terms to Expand a Multilingual Thesaurus. In: Farwell, D., Gerber, L., Hovy, E. (eds) Machine Translation and the Information Soup. AMTA 1998. Lecture Notes in Computer Science(), vol 1529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49478-2_34
Download citation
DOI: https://doi.org/10.1007/3-540-49478-2_34
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65259-5
Online ISBN: 978-3-540-49478-2
eBook Packages: Springer Book Archive