Abstract
Since sentences are the basic propositional units of text, knowing their themes should help various tasks requiring the knowledge about the semantic content of text. In this paper, we examine the notion of sentence theme and propose an automatic scheme where head-driven patterns are used for theme assignment. We tested our scheme with sentences in encyclopedia articles and obtained a promising result of 98.96% in F-score for training data and 88.57% for testing data, which outperform the baseline.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Cunningham, J.W., Moore, D.M.: The Confused world of main idea. In: Baumann, J.F. (ed.) Teaching main idea, IRA, N.Y (1986)
Fellbaum, C., et al.: WordNet:An eletroic lexical database. The MIT press, IRA (1998)
Yang, Y.: An evaluation of statistical approaches to text categorization, Information retrieval (1998)
Jeong, T.J., Jang, B.T.: Web Information search using reinforcement learning. In: Proceedings of the Korean information science society conference (KISS) (2001)
Park, K.R., Jang, E.J., Park, S.G.: Improved link analysis algorithm using document feature information. In: KISS 2003 (2003)
Lee, S.E., Hwang, I.J.: Summarizing relevant web pages based on semantic region. In: KISS 2003 (2003)
Seo, H.: A Study on the structure and theme-construction of Korean discourses, Ph.D dissertation, Seoul National University (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kang, BY., Myaeng, SH. (2005). A Theme Allocation for a Sentence Based on Head Driven Patterns. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_24
Download citation
DOI: https://doi.org/10.1007/11551874_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28789-6
Online ISBN: 978-3-540-31817-0
eBook Packages: Computer ScienceComputer Science (R0)