Abstract
Named entity recognition (NER) is a subtask in information extraction which aims to locate atomic element into predefined categories. Various NER techniques and tools have been developed to fit the interest of the applications developed. However, most NER works carried out focus on non-fiction domain. Fiction based domain displays a complex context in locating its NE especially name of person that might range from living things to non-living things. This paper proposes VAHA, automated dominant characters identification in fiction domain, particularly in fairy tales. TreeTagger, Stanford Dependencies and WordNet are the three freely available tools being used to identify verbs that are associated with human activity. The experimental results show that it is viable to use verb in identifying named entity, particularly in people category and it can be applied in a small text size environment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lisa, F.R.: Extracting Company Names from Text. In: Proc. IEEE Conference on Artificial Intelligence Applications, pp. 20–32 (1991)
Laura, C., Rajasekar, K., Li, Y.Y., Frederick, R., Shivakumar, V.: Domain Adaptation of Rule-Based Annotators for Named-Entity Recognition Tasks. In: Empirical Methods in Natural Language Processing, Massachusetts, pp. 1002–1012 (2010)
Andrew, M., Li, W.: Early Results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons. In: 7th Conference on Natural Language Learning, pp. 188–191 (2003)
Dan, K., Joseph, S., Huy, N., Christopher, D.M.: Named Entity Recognition with Character-Level Models. In: 7th Conference on Natural Language Learning, pp. 180–183 (2003)
Casey, W., Alex, K., Nemanja, P., Lyle, U.: Web-Scale Named Entity Recognition. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, Napa Valley, California, USA, pp. 123–132 (2008)
Einat, M., Richard, C.W., William, W.C.: Extracting Personal Names from Email: Applying Named Entity Recognition to Informal Text. In: Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, pp. 443–450 (2005)
Charles, S., Andrew, M.: Collective Segmentation and Labeling of Distant Entities in Information Extraction. In: ICML Workshop on Statistical Relational Learning (2004)
Razvan, C.B., Raymond, J.M.: Relational Markov Networks for Collective Information Extraction. In: ICML- Workshop on Statistical Relational Learning (2004)
Le, H.T., Nguyen, T.H.: Name Entity Recognition using Inductive Logic Programming. In: Symposium on Information and Communication Technology, Vietnam, pp. 71–77 (2010)
Marie-Catherine, D.M., Bill, M., Christopher, D.M.: Generating Typed Dependency Parses from Phrase Structure Parses. In: LREC (2006)
Christiane, F.: WordNet:An Electronic Lexical Database. MIT Press, Cambridge
WordNet Statistic, http://wordnet.princeton.edu/wordnet/man/wnstats.7WN.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Goh, HN., Soon, LK., Haw, SC. (2012). VAHA: Verbs Associate with Human Activity – A Study on Fairy Tales. In: Jiang, H., Ding, W., Ali, M., Wu, X. (eds) Advanced Research in Applied Artificial Intelligence. IEA/AIE 2012. Lecture Notes in Computer Science(), vol 7345. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31087-4_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-31087-4_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31086-7
Online ISBN: 978-3-642-31087-4
eBook Packages: Computer ScienceComputer Science (R0)