Abstract
This paper presents work on document retrieval for Italian carried out at ITC-irst. Two different approaches to information retrieval were investigated, one based on the Okapi weighting formula and one based on a statistical model. Development experiments were carried out using the Italian sample of the TREC-8 CLIR track. Performance evaluation was done on the Cross Language Evaluation Forum (CLEF) 2000 Italian monolingual track. The two methods achieved mean average precisions of 49.0% and 47.5%, respectively, which were the two best scores of their track.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Federico, Marcello, 2000. A system for the retrieval of italian broadcast news. Speech Communication, 33(1-2).
Federico, Marcello and Renato De Mori, 1998. Language modelling. In Renato De Mori (ed.), Spoken Dialogues with Computers, chapter 7. London, UK: Academy Press.
Gretter, Roberto and Gianni Peirone, 1991. A Morphological Analyzer for the Italian Language. ITC-irst Technical Report N. 9108-01.
Johnson, S.E., P. Jourlin, K. Spark Jones, and P.C. Woodland, 1999. Spoken document retrieval for TREC-8 at Cambridge University. In Proceedings of the 8th Text REtrieval Conference. Gaithersburg, MD.
Merialdo, Bernard, 1994. Tagging English text with a probabilistic model. Computational Linguistics, 20(2):155–172.
Miller, David R. H., Tim Leek, and Richard M. Schwartz, 1998. BBN at TREC-7: Using hidden Markov models for information retrieval. In Proceedings of the 7th Text REtrieval Conference. Gaithersburg, MD.
Mood, Alexander M., Franklin A. Graybill, and Duane C. Boes, 1974. Introduction to the Theory of Statistics. Singapore: McGraw-Hill.
Ng, Kenney, 1999. A maximum likelihood ratio information retrieval model. In Proceedings of the 8th Text REtrieval Conference. Gaithersburg, MD.
Robertson, S. E., S. Walker, S. Jones, M. M. Hancock-Beaulieu, and M. Gatford, 1994. Okapi at TREC-3. In Proceedings of the 3rd Text REtrieval Conference. Gaithersburg, MD.
Sparck Jones, Karen and Peter Willett (eds.), 1997. Readings in Information Retrieval. San Francisco, CA: Morgan Kaufmann.
Witten, Ian H. and Timothy C. Bell, 1991. The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression. IEEE Trans. Inform. Theory, IT-37(4):1085–1094.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bertoldi, N., Federico, M. (2001). ITC-irst at CLEF 2000: Italian Monolingual Track. In: Peters, C. (eds) Cross-Language Information Retrieval and Evaluation. CLEF 2000. Lecture Notes in Computer Science, vol 2069. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44645-1_26
Download citation
DOI: https://doi.org/10.1007/3-540-44645-1_26
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42446-8
Online ISBN: 978-3-540-44645-3
eBook Packages: Springer Book Archive