A Linguistically Motivated Information Retrieval System for Turkish | SpringerLink
Skip to main content

A Linguistically Motivated Information Retrieval System for Turkish

  • Conference paper
Computer and Information Sciences - ISCIS 2004 (ISCIS 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3280))

Included in the following conference series:

Abstract

Information retrieval (IR) has become an important application in today’s computer world because of the great increase in the amount of web-based documents and the widespread use of the Internet. However, the classical ”bag of words” approach no lo nger meets user expectations adequately. In this context, natural language processing (NLP) techniques come into mind. In this paper, we investigate the question of whether NLP techniques can improve the effectiveness of information retrieval in Turkish. We implemented and tested a linguistically motivated information retrieval system, which uses knowledge of the morphological, lexico-semantical and syntactical levels of Turkish.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 17159
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 21449
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Salton, G., McGill, M.J.: Introduction to modern information retrieval. McGraw-Hill, New York (1983)

    MATH  Google Scholar 

  2. Google, http://www.google.com

  3. Yahoo!, http://www.yahoo.com

  4. Arampatzis, A., Weide, T., Koster, C., Bommel, P.: Linguistically motivated information retrieval. Encyclopedia of Library and Information Science 69, 201–222 (2000)

    Google Scholar 

  5. Feldman, S.: NLP meets the Jabberwocky: Natural Language Processing in Information Retrieval. Online (1999)

    Google Scholar 

  6. Arampatzis, A.T., Tsoris, T., Koster, C.H.A.: IRENA: Information retrieval engine based on natural language analysis. In: Proceedings of RIAO 1997, Computer-Assisted Information Searching on Internet, pp. 159–175. McGill University, Montreal (1997)

    Google Scholar 

  7. Çiftçi, T.: Multimedia search engine for content based retrieval of images and text. M.Sc. Thesis, Boğaziçi University, İstanbul, Turkey (2002)

    Google Scholar 

  8. Evans, D.A., Zhai, C.: Noun-phrase analysis in unrestricted text for information retrieval. In: 34th Annual Meeting of the Association for Computational Linguistics, pp. 17–24 (1996)

    Google Scholar 

  9. Zhai, C., Tong, X., Milic-Frayling, N., Evans, D.A.: Evaluation of syntactic phrase indexing, CLARIT NLP track report. In: Harman, D.K. (ed.) The Fifth Text Retrieval Conference (TREC-5), NIST Special Publication (1997)

    Google Scholar 

  10. Strzalkowski, T., Carballo, J.P.: Recent developments in natural language text retrieval. In: TREC, pp. 123–136 (1993)

    Google Scholar 

  11. Göçmen, E., Şehitoglu, O., Bozşahin, C.: An outline of Turkish syntax. Technical Report 95-2, METU Department of Computer Engineering, Ankara, Turkey (1995)

    Google Scholar 

  12. Oflazer, K.: Two-level description of Turkish morphology. Literary and Linguistic Computing 9(2) (1994)

    Google Scholar 

  13. Antworth, E.L.: PC-KIMMO: A two-level processor for morphological analysis. In: Summer Institute of Linguistics, Dallas, Texas (1990)

    Google Scholar 

  14. Stamou, S., Oflazer, K., Pala, K., Christoudoulakis, D., Cristea, D., Tufis, D., Koeva, S., Totkov, G., Dutoit, D., Grigoriadou, M.: Balkanet: A multilingual semantic network for Balkan languages. In: Proceedings of the First International WordNet Conference, Mysore, India (2002)

    Google Scholar 

  15. Bilgin, O., Çetinoğlu, Ö., Oflazer, K.: Morphosemantic relations in and across wordnets: A preliminary study based on Turkish. In: Proceedings of the Global WordNet Conference, Masaryk, Czech Republic (2004)

    Google Scholar 

  16. Jurafsky, D.S., Martin, J.H.: Speech and language processing. Prentice Hall, Inc., Englewood Cliffs (2000)

    Google Scholar 

  17. Abney, S.: Partial parsing via finite-state cascades. In: Natural Language Engineering, Cambridge University Press, Cambridge (1995)

    Google Scholar 

  18. Darcan, O.N.: An intelligent database interface for Turkish. M.Sc. Thesis, Boğaziçi University, İstanbul, Turkey (1991)

    Google Scholar 

  19. Birtürk, A.A., Fong, S.: A modular approach to Turkish noun compounding: The integration of a finite-state model. In: Proceedings of the 6th Natural Language Processing Pacific Rim Symposium (NLPRS 2001), Tokyo, Japan (2001)

    Google Scholar 

  20. Lee, D.L., Chuang, H., Seamons, K.: Document ranking and the vector-space model. IEEE Computer, Theme Issues on Assessing Measurement, 67–75 (1997)

    Google Scholar 

  21. Text REtrieval Conference (TREC), http://trec.nist.gov/

  22. Solak, A., Can, F.: Effects of stemming on Turkish text retrieval. Technical Report, BUCEIS-94-20, Computer Engineering and Information Science Dept., Bilkent University, Ankara, Turkey (1994)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pembe, F.C., Say, A.C.C. (2004). A Linguistically Motivated Information Retrieval System for Turkish. In: Aykanat, C., Dayar, T., Körpeoğlu, İ. (eds) Computer and Information Sciences - ISCIS 2004. ISCIS 2004. Lecture Notes in Computer Science, vol 3280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30182-0_74

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30182-0_74

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23526-2

  • Online ISBN: 978-3-540-30182-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics