Abstract
This paper describes DEBORA – a dependency-based approach to the extraction of relations between named entities from Polish open-domain texts. The presented method designed for the purpose of the conducted experiment is adapted to morpho-syntactic properties of Polish. Results show that the method is applicable for Polish, even if there is a room for improvement. The extraction approach may be applied to the problem of graphical entity summarisation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Acedański, S.: A Morphosyntactic Brill Tagger for Inflectional Languages. In: Loftsson, H., Rögnvaldsson, E., Helgadóttir, S. (eds.) IceTAL 2010. LNCS, vol. 6233, pp. 3–14. Springer, Heidelberg (2010)
Banko, M., Cafarella, M., Soderland, S., Broadhead, M., Etzioni, O.: Open information extraction from the web. In: Proceedings of the 20th International Joint Conference on Artifical Intelligence, pp. 2670–2676 (2007)
Brin, S.: Extracting Patterns and Relations from the World Wide Web. In: Atzeni, P., Mendelzon, A.O., Mecca, G. (eds.) WebDB 1998. LNCS, vol. 1590, pp. 172–183. Springer, Heidelberg (1999)
Cafarella, M.J., Downey, D., Soderland, S., Etzioni, O.: KnowItNow: Fast, Scalable Information Extraction from the Web. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), pp. 563–570. Association for Computational Linguistics (2005)
Etzioni, O., Fader, A., Christensen, J., Soderland, S., Mausam: Open Information Extraction: The Second Generation. In: Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, pp. 3–10 (2011)
Korpus Rzeczpospolitej. Corpus of articles published in the Polish newspaper Rzeczpospolita, http://www.cs.put.poznan.pl/dweiss/rzeczpospolita
Savary, A., Waszczuk, J.: Narzędzia do anotacji jednostek nazewniczych. In: Przepiórkowski, A., Bańko, M., Górski, R.L., Lewandowska-Tomaszczyk, B. (eds.) Narodowy Korpus Języka Polskiego. Wydawnictwo Naukowe PWN, Warszawa (2012)
Sydow, M., Pikuła, M., Schenkel, R.: To Diversify or Not to Diversify Entity Summaries on RDF Knowledge Graphs? In: Kryszkiewicz, M., Rybinski, H., Skowron, A., Raś, Z.W. (eds.) ISMIS 2011. LNCS (LNAI), vol. 6804, pp. 490–500. Springer, Heidelberg (2011)
Wróblewska, A.: Polish Dependency Bank. Linguistic Issues in Language Technology 7(1) (2012), http://elanguage.net/journals/index.php/lilt/article/view/2684
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wróblewska, A., Sydow, M. (2012). DEBORA: Dependency-Based Method for Extracting Entity-Relationship Triples from Open-Domain Texts in Polish. In: Chen, L., Felfernig, A., Liu, J., Raś, Z.W. (eds) Foundations of Intelligent Systems. ISMIS 2012. Lecture Notes in Computer Science(), vol 7661. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34624-8_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-34624-8_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34623-1
Online ISBN: 978-3-642-34624-8
eBook Packages: Computer ScienceComputer Science (R0)