Abstract
Every day, a massive amount of information is reported in the form of video, audio, or text through various media such as television, radio, social media, and web blogs. As the number of unstructured documents on those media has grown, finding relevant information has become more difficult. As a result, extracting relevant events from large amounts of unstructured text data is essential. We proposed an event extraction model, which aims to detect, classify and extract various types of events along with their arguments from Amharic text documents. In this paper, the researchers first come up with Amharic language-specific issues and then proposed Bidirectional Long Short Memory (BiLSTM) with a Word2vec model to detect and classify Amharic events from unstructured documents. To achieve this research 9,050 Amharic documents were used for event detection and extraction purpose. In addition to event detection and classification, the model also extracts event arguments that contain additional information about events such as Time and Place. The experimental results showed that the Bidirectional long short-term memory approach with Word2vec word embedding shows a promising result in terms of Amharic event detection and event classification, with 94% and 89% accuracy, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Polakof, A.C.: Why are events, facts, and states of affairs different? Disputatio 9(44), 99–122 (2017). https://doi.org/10.2478/disp-2017-0029
Zhou, D., Chen, L., He, Y.: A simple Bayesian modelling approach to event extraction from Twitter. 52nd Annu. Meet. Assoc. Comput. Linguist. ACL 2014 - Proc. Conf., vol. 2, pp. 700–705 (2014). https://doi.org/10.3115/v1/p14-2114
Sahoo, S.K., Saha, S., Ekbal, A., Bhattacharyya, P.: A platform for event extraction in hindi. Proc. 12th Conf. Lang. Resour. Eval. (LREC 2020), no. May, pp. 11–16 (2020)
Petroni, F., et al.: An extensible event extraction system with cross-media event resolution. Proc. ACM SIGKDD Int. Conf. Knowl. Discov. Data Min., pp. 626–635, (2018). https://doi.org/10.1145/3219819.3219827
Hordofa, B.A.: Event extraction and representation model from news articles. 16 (3, 1–8 (2020)
Tadesse, E., Aga, R.T., Qaqqabaa, K.: Event extraction from unstructured amharic text. no. May, pp. 2103–2109 (2020)
Nguyen, V.Q., Anh, T.N., Yang, H.J.: Real-time event detection using recurrent neural network in social sensors. Int. J. Distrib. Sens. Networks 15, 6 (2019). https://doi.org/10.1177/1550147719856492
Björne, J., Salakoski, T.: Biomedical event extraction using convolutional neural networks and dependency parsing. 98–108 (2019). https://doi.org/10.18653/v1/w18-2311
Huang, L., et al.: Liberal event extraction and event schema induction. 54th Annu. Meet. Assoc. Comput. Linguist. ACL 2016 - Long Pap., vol. 1, pp. 258–268 (2016). https://doi.org/10.18653/v1/p16-1025
Zhang, Y., Liu, Z., Zhou, W.: Event recognition based on deep learning in Chinese texts. PLoS ONE 11(8), 1–18 (2016). https://doi.org/10.1371/journal.pone.0160147
Wang, W., Ning, Y., Rangwala, H., Ramakrishnan, N.: A multiple instance learning framework for identifying key sentences and detecting events. Int. Conf. Inf. Knowl. Manag. Proc., vol. 24–28-Octo, pp. 509–518 (2016). https://doi.org/10.1145/2983323.2983821
Ji, H., Grishman, R.: Refining event extraction through cross-document inference. ACL-08 HLT – 46th Annu. Meet. Assoc. Comput. Linguist. Hum. Lang. Technol. Proc. Conf., no. June, pp. 254–262 (2008)
Sahnoun, S., Elloumi, S., Yahia, S.B.: Event Detection Based on Open Information Extraction and Ontology. In: Nguyen, N.T., Chbeir, R., Exposito, E., Aniorté, P., Trawiński, B. (eds.) ICCCI 2019. LNCS (LNAI), vol. 11683, pp. 244–255. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-28377-3_20
Ribeiro, S., Ferret, O., Tannier, X.:. Unsupervised event clustering and aggregation from newswire and web articles. pp. 62–67 (2018). https://doi.org/10.18653/v1/w17-4211
Zhou, D., Chen, L., He, Y.: An unsupervised framework of exploring events on Twitter: filtering, extraction and categorization. Proc. Natl. Conf. Artif. Intell. 3, 2468–2474 (2015)
Valenzuela-Escárcega, M.A., Hahn-Powell, G., Hicks, T., Surdeanu, M.: A domain-independent rule-based framework for event extraction. ACL-IJCNLP 2015 – 53rd Annu. Meet. Assoc. Comput. Linguist. 7th Int. Jt. Conf. Nat. Lang. Process. Proc. Syst. Demonstr. pp. 127–132 (2015). https://doi.org/10.3115/v1/p15-4022
Miwa, M., Thompson, P., Korkontzelos, I., Ananiadou, S.: Comparable study of event extraction in newswire and biomedical domains. COLING 2014 – 25th Int. Conf. Comput. Linguist. Proc. COLING 2014 Tech. Pap. pp. 2270–2279 (2014)
Minaee, S., Kalchbrenner, N., Cambria, E., Nikzad, N., Chenaghlu, M., Gao, J.: Deep learning based text classification: a comprehensive review. arXiv, 1(1), pp. 1–43 (2020)
Nguyen T.H., Grishman, R. Modeling skip-grams for event detection with convolutional neural networks. EMNLP 2016 – Conf. Empir. Methods Nat. Lang. Process. Proc. no. January, pp. 886–891 (2016). https://doi.org/10.18653/v1/d16-1085
Nguyen, T.H., Fu, L., Cho, K., Grishman, R.: A two-stage approach for extending event detection to new types via neural networks. pp. 158–165 (2016) https://doi.org/10.18653/v1/w16-1618
Nguyen, T.H., Cho, K., Grishman, R.: Joint event extraction via recurrent neural networks. 2016 Conf. North Am. Chapter Assoc. Comput. Linguist. Hum. Lang. Technol. NAACL HLT 2016 – Proc. Conf. pp. 300–309 (2016). https://doi.org/10.18653/v1/n16-1034
Ma, J., Wang, S.: Resource–Enhanced Neural Model for Event Argument Extraction (2018)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Andualem, A., Tegegne, T. (2022). Design Event Extraction Model from Amharic Texts Using Deep Learning Approach. In: Berihun, M.L. (eds) Advances of Science and Technology. ICAST 2021. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 411. Springer, Cham. https://doi.org/10.1007/978-3-030-93709-6_28
Download citation
DOI: https://doi.org/10.1007/978-3-030-93709-6_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-93708-9
Online ISBN: 978-3-030-93709-6
eBook Packages: Computer ScienceComputer Science (R0)