Improved conditional random fields model with multi-trigger embedding for Chinese event extraction | World Wide Web Skip to main content
Log in

Improved conditional random fields model with multi-trigger embedding for Chinese event extraction

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Event extraction is a challenging task in natural language understanding, which aims to recognize event type, subtype and roles of relevant entities from unstructured text. Most current approaches address event extraction with highly local models that extract event type and arguments independently. However, this multi-step method cannot make full use of the reciprocal dependency relationship between event trigger and arguments, especially for nested event structure. E.g. the trigger of a Life/Injure event is embedded inside the argument, and a trigger is an event anchor as well as a modifier of argument. Meanwhile, In the same label space, the example proportion of triggers to arguments is scare, and there exists the issue of unbalanced data. Therefore, this kind of trigger is apt to be labeled as the event argument. In order to let event type recognition and event argument recognition guide each other, and resolve the problem of unbalanced data, we consider event extraction to be a sequence labeling problem and build a novel improved conditional random fields joint labeling model with multi-trigger embedding. Experimental results on ACE 2005 Chinese corpus show that this method improves the performance of event extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Ahn, D.: The stages of event extraction. In: Proceedings of the Workshop on Annotating and Reasoning about Time and Events, pp. 1–8. Association for Computational Linguistics (2006)

  2. Chambers, N., Jurafsky, D.: Unsupervised learning of narrative schemas and their participants. In: Proceedings of ACL, pp. 602–610. Association for Computational Linguistics, Suntec, Singapore (2009)

  3. Chen, Z., Ji, H.: Language specific issue and feature exploration in Chinese event extraction. In: Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pp. 209–212. Association for Computational Linguistics (2009)

  4. Filatova, E., Hatzivassiloglou, V.: Event-based extractive summarization. In: Proceedings of ACL Workshop on Summarization (2004)

  5. Finkel, J., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by gibbs sampling. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 363–370. Association for Computational Linguistics (2005)

  6. Gupta, P., Ji, H.: Predicting unknown time arguments based on cross-event propagation. In: Proceedings of the ACL-IJCNLP Conference Short Papers, pp. 369–372. Association for Computational Linguistics (2009)

  7. Hong, Y., Zhang, J., Ma, B., Yao, J., Zhou, G., Zhu, Q.: Using cross-entity inference to improve event extraction. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 1127–1136. Association for Computational Linguistics (2011)

  8. Ji, H., Grishman, R.: Refining event extraction through cross-document inference. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics, pp. 254–262. Association for Computational Linguistics (2008)

  9. Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: ICML, pp. 282–289 (2001)

  10. Liao, S., Grishman, R.: Using document level cross-event inference to improve event extraction. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 789–797. Association for Computational Linguistics (2010)

  11. Llorens, H., Saquete, E., Navarro-Colorado, B.: Timeml events recognition and classification: learning crf models with semantic roles. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 725–733. Association for Computational Linguistics (2010)

  12. McCallum, A., Freitag, D., Pereira, F.: Maximum entropy markov models for information extraction and segmentation. In: Proceedings of the Seventeenth International Conference on Machine Learning, pp. 591–598 (2000)

  13. McClosky, D., Surdeanu, M., Manning, C.D.: Event extraction as dependency parsing. In: Proceedings of ACL-HLT, pp. 1626–1635. Association for Computational Linguistics, Stroudsburg, PA, USA (2011)

  14. McNamee, P., Dang, H., Simpson, H., Schone, P., Strassel, S.: An evaluation of technologies for knowledge base population. In: Proceedings of the Seventh International Language Resources and Evaluation Conference (2010)

  15. Pinto, D., McCallum, A., Wei, X., Croft, W.: Table extraction using conditional random fields. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval, pp. 235–242 (2003)

  16. Pustejovsky, J.: Terqas: time and event recognition for question answering systems. In: ARDA Workshop, MITRE, Boston (2002)

  17. Sha, F., Pereira, F.: Shallow parsing with conditional random fields. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 134–141. Association for Computational Linguistics (2003)

  18. Toutanova, K., Haghighi, A., Manning, C.: A global joint model for semantic role labeling. Comput. Linguist. 34(2), 161–191 (2008)

    Article  MathSciNet  Google Scholar 

  19. Wang, X.Z., Zhai, J.H., Lu, S.X.: Induction of multiple fuzzy decision trees based on rough set technique. Inf. Sci. 178(16), 3188–3202 (2008)

    Article  MATH  MathSciNet  Google Scholar 

  20. Zhao, Y., Qin, B., Che, W., Liu, T.: Research on chinese event extraction. J. Chin. Inf. Process. 22(1), 3–8 (2008)

    Article  Google Scholar 

  21. Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd annual meeting on Association for Computational Linguistics, pp. 189–196. Association for Computational Linguistics (1995)

  22. Zhang, R., Li, W., Lu, Q.: Sentence ordering with event-enriched semantics and two-layered clustering for multi-document news summarization. In: Proceedings of the 22nd International Conference on Computational Linguistics (2010)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ruifang He.

Rights and permissions

Reprints and permissions

About this article

Cite this article

He, R., Zhang, Y., Li, T. et al. Improved conditional random fields model with multi-trigger embedding for Chinese event extraction. World Wide Web 17, 1029–1049 (2014). https://doi.org/10.1007/s11280-013-0231-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-013-0231-7

Keywords

Navigation