Abstract
Information retrieval (IR) has become an important application in today’s computer world because of the great increase in the amount of web-based documents and the widespread use of the Internet. However, the classical ”bag of words” approach no lo nger meets user expectations adequately. In this context, natural language processing (NLP) techniques come into mind. In this paper, we investigate the question of whether NLP techniques can improve the effectiveness of information retrieval in Turkish. We implemented and tested a linguistically motivated information retrieval system, which uses knowledge of the morphological, lexico-semantical and syntactical levels of Turkish.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Salton, G., McGill, M.J.: Introduction to modern information retrieval. McGraw-Hill, New York (1983)
Google, http://www.google.com
Yahoo!, http://www.yahoo.com
Arampatzis, A., Weide, T., Koster, C., Bommel, P.: Linguistically motivated information retrieval. Encyclopedia of Library and Information Science 69, 201–222 (2000)
Feldman, S.: NLP meets the Jabberwocky: Natural Language Processing in Information Retrieval. Online (1999)
Arampatzis, A.T., Tsoris, T., Koster, C.H.A.: IRENA: Information retrieval engine based on natural language analysis. In: Proceedings of RIAO 1997, Computer-Assisted Information Searching on Internet, pp. 159–175. McGill University, Montreal (1997)
Çiftçi, T.: Multimedia search engine for content based retrieval of images and text. M.Sc. Thesis, Boğaziçi University, İstanbul, Turkey (2002)
Evans, D.A., Zhai, C.: Noun-phrase analysis in unrestricted text for information retrieval. In: 34th Annual Meeting of the Association for Computational Linguistics, pp. 17–24 (1996)
Zhai, C., Tong, X., Milic-Frayling, N., Evans, D.A.: Evaluation of syntactic phrase indexing, CLARIT NLP track report. In: Harman, D.K. (ed.) The Fifth Text Retrieval Conference (TREC-5), NIST Special Publication (1997)
Strzalkowski, T., Carballo, J.P.: Recent developments in natural language text retrieval. In: TREC, pp. 123–136 (1993)
Göçmen, E., Şehitoglu, O., Bozşahin, C.: An outline of Turkish syntax. Technical Report 95-2, METU Department of Computer Engineering, Ankara, Turkey (1995)
Oflazer, K.: Two-level description of Turkish morphology. Literary and Linguistic Computing 9(2) (1994)
Antworth, E.L.: PC-KIMMO: A two-level processor for morphological analysis. In: Summer Institute of Linguistics, Dallas, Texas (1990)
Stamou, S., Oflazer, K., Pala, K., Christoudoulakis, D., Cristea, D., Tufis, D., Koeva, S., Totkov, G., Dutoit, D., Grigoriadou, M.: Balkanet: A multilingual semantic network for Balkan languages. In: Proceedings of the First International WordNet Conference, Mysore, India (2002)
Bilgin, O., Çetinoğlu, Ö., Oflazer, K.: Morphosemantic relations in and across wordnets: A preliminary study based on Turkish. In: Proceedings of the Global WordNet Conference, Masaryk, Czech Republic (2004)
Jurafsky, D.S., Martin, J.H.: Speech and language processing. Prentice Hall, Inc., Englewood Cliffs (2000)
Abney, S.: Partial parsing via finite-state cascades. In: Natural Language Engineering, Cambridge University Press, Cambridge (1995)
Darcan, O.N.: An intelligent database interface for Turkish. M.Sc. Thesis, Boğaziçi University, İstanbul, Turkey (1991)
Birtürk, A.A., Fong, S.: A modular approach to Turkish noun compounding: The integration of a finite-state model. In: Proceedings of the 6th Natural Language Processing Pacific Rim Symposium (NLPRS 2001), Tokyo, Japan (2001)
Lee, D.L., Chuang, H., Seamons, K.: Document ranking and the vector-space model. IEEE Computer, Theme Issues on Assessing Measurement, 67–75 (1997)
Text REtrieval Conference (TREC), http://trec.nist.gov/
Solak, A., Can, F.: Effects of stemming on Turkish text retrieval. Technical Report, BUCEIS-94-20, Computer Engineering and Information Science Dept., Bilkent University, Ankara, Turkey (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pembe, F.C., Say, A.C.C. (2004). A Linguistically Motivated Information Retrieval System for Turkish. In: Aykanat, C., Dayar, T., Körpeoğlu, İ. (eds) Computer and Information Sciences - ISCIS 2004. ISCIS 2004. Lecture Notes in Computer Science, vol 3280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30182-0_74
Download citation
DOI: https://doi.org/10.1007/978-3-540-30182-0_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23526-2
Online ISBN: 978-3-540-30182-0
eBook Packages: Springer Book Archive