Detecting Gaps in Language Resources and Tools in the Project CESAR | SpringerLink
Skip to main content

Detecting Gaps in Language Resources and Tools in the Project CESAR

  • Conference paper
  • First Online:
Human Language Technology Challenges for Computer Science and Linguistics (LTC 2011)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8387))

Included in the following conference series:

  • 866 Accesses

Abstract

In this paper the first preliminary results of the analysis of marks collected within the tables of META-NET series of Language White Papers of CESAR project languages are demonstrated. Although they are preliminary results, we can consider them useful for showing us where real gaps in language resources and tools can be detected.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 5719
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 7149
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    In the final version of all META-NET Language White Papers, the overall methodology of collecting and merging marks was changed. It was decided that the peer-evaluation of the original fine-grained categories would not be practical and feasible to carry out at the META-NET community level. Therefore the categories were merged and the further process of evaluation and the final decisions at the META-NET meeting in Berlin in 2011 were based on the summary categories.

  2. 2.

    Each LRT category was originally marked (on a scale of 0 to 6) for quantity, availability, quality, coverage, maturity, sustainability and adaptability. See the respective tables in Sect. 4.6 of the individual LWP volumes.

References

  1. Váradi, T.: Veni, Vidi, Vici: the language technology infrastructure landscape after CESAR. In: Gajdošová, K., Žáková, A. (eds.) Natural Language Processing, Corpus Linguistics, E-Learning, Bratislava, pp. 261–278. RAM-Verlag (2013)

    Google Scholar 

  2. Blagoeva, D., Koeva, S., Murdarov, V.: – The Bulgarian Language in the Digital Age. META-NET White Paper Series. Springer (2012). http://www.meta-net.eu/whitepapers, Rehm, G., Uszkoreit, H. (Series eds.)

  3. Tadić, M., Brozović-Rončević, D., Kapetanović, A.: Hrvatski Jezik u Digitalnom Dobu - The Croatian Language in the Digital Age. META-NET White Paper Series. Springer (2012), http://www.meta-net.eu/whitepapers, Rehm, G., Uszkoreit, H. (Series eds.)

  4. Simon, E., Lendvai, P., Németh, G., Olaszy, G., Vicsi, K.: A magyar nyelv a digitális korban - The Hungarian Language in the Digital Age. META-NET White Paper Series. Springer (2012). http://www.meta-net.eu/whitepapers, Rehm, G., Uszkoreit, H. (Series eds.)

  5. Miłkowski, M.: Jȩzyk polski w erze cyfrowej - The Polish Language in the Digital Age. META-NET White Paper Series. Springer (2012). http://www.meta-net.eu/whitepapers, Rehm, G., Uszkoreit, H. (Series eds.)

  6. Vitas, D., Popović, L., Krstev, C., Obradović, I., Pavlović-Lažetić, G., Stanojević, M.: – The Serbian Language in the Digital Age. META-NET White Paper Series. Springer (2012). http://www.meta-net.eu/whitepapers, Rehm, G., Uszkoreit, H. (Series eds.)

  7. Šimková, M., Garabík, R., Gajdošová, K., Laclavík, M., Ondrejović, S., Juhár, J., Genći, J., Furdík, K., Ivoríková, H., Ivanecký, J.: Slovenský jazyk v digitálnom veku - The Slovak Language in the Digital Age. META-NET White Paper Series. Springer (2012). http://www.meta-net.eu/whitepapers, Rehm, G., Uszkoreit, H. (Series eds.)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marko Tadić .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Tadić, M., Váradi, T., Garabík, R., Koeva, S., Ogrodniczuk, M., Vitas, D. (2014). Detecting Gaps in Language Resources and Tools in the Project CESAR. In: Vetulani, Z., Mariani, J. (eds) Human Language Technology Challenges for Computer Science and Linguistics. LTC 2011. Lecture Notes in Computer Science(), vol 8387. Springer, Cham. https://doi.org/10.1007/978-3-319-08958-4_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08958-4_38

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08957-7

  • Online ISBN: 978-3-319-08958-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics