Document-Aware Graph Models for Query-Oriented Multi-document Summarization

Wei, Furu; Li, Wenjie; He, Yanxiang

doi:10.1007/978-3-642-19551-8_24

Furu Wei^7,8,
Wenjie Li⁷ &
Yanxiang He⁹

Part of the book series: Studies in Computational Intelligence ((SCI,volume 346))

1593 Accesses
4 Citations

Abstract

Sentence ranking is the issue of most concern in document summarization. In recent years, graph-based summarization models and sentence ranking algorithms have drawn considerable attention from the extractive summarization community due to their capability of recursively calculating sentence significance from the entire text graph that links sentences together rather than relying on single sentence alone. However, when dealing with multi-document summarization, existing sentence ranking algorithms often assemble a set of documents into one large file. The document dimension is ignored. In this work, we develop two alternative models to integrate the document dimension into existing sentence ranking algorithms. They are the one-layer (i.e. sentence layer) document-sensitive model and the two-layer (i.e. document and sentence layers) mutual reinforcement model. While the former implicitly incorporates the document’s influence in sentence ranking, the latter explicitly formulates the mutual reinforcement among sentence and document during ranking. The effectiveness of the proposed models and algorithms are examined on the DUC query-oriented multi-document summarization data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 37751; Price includes VAT (Japan)

Softcover Book: JPY 47189; Price includes VAT (Japan)

Hardcover Book: JPY 47189; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

GuideRank: A Guided Ranking Graph Model for Multilingual Multi-document Summarization

User Intention-Based Document Summarization on Heterogeneous Sentence Networks

Automatic Multi-Document Summarization Based on Keyword Density and Sentence-Word Graphs

Article 07 June 2018

References

Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. The ACM Press, New York (1999)
Google Scholar
Brin, S., Page, L.: The Anatomy of a Large-scale Hypertextual Web Search Engine. Computer Networks and ISDN Systems 30(1-7), 107–117 (1998)
Article Google Scholar
DUC, http://duc.nist.gov/
DUC Reports, http://www-nlpir.nist.gov/projects/duc/pubs.html
Erkan, G., Radev, D.R.: LexRank: Graph-based Centrality as Salience in Text Summarization. Journal of Artificial Intelligence Research 22, 457–479 (2004)
Google Scholar
Haveliwala, T.H.: Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search. IEEE Transactions on Knowledge and Data Engineering 15(4), 784–796 (2003)
Article Google Scholar
Jones, K.S.: Automatic Summarising: The State of the art. Information Processing and Management 43, 1449–1481 (2007)
Article Google Scholar
Langville, A.N., Meyer, C.D.: Deeper Inside PageRank. Journal of Internet Mathematics 1(3), 335–380 (2004)
Article MATH MathSciNet Google Scholar
Leskovec, J., Grobelnik, M., Milic-Frayling, N.: Learning Sub-structures of Document Semantic Graphs for Document Summarization. In: Proceedings of Link KDD Workshop, pp. 133–138 (2004)
Google Scholar
Li, W.J., Wu, M.L., Lu, Q., Xu, W., Yuan, C.F.: Extractive Summarization using Intra- and Inter-Event Relevance. In: Proceedings of ACL/COLING, pp. 369–376 (2006)
Google Scholar
Lin, C.Y., Hovy, E.: The Automated Acquisition of Topic Signature for Text Summarization. In: Proceedings of 18th COLING, pp. 495–501 (2000)
Google Scholar
Lin, C.Y., Hovy, E.: Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics. In: Proceedings of HLT-NAACL, pp. 71–78 (2003)
Google Scholar
Lin, Z.H., Chua, T.S., Kan, M.Y., Lee, W.S., Qiu, L., Ye, S.R.: NUS at DUC 2007: Using Evolutionary Models for Text. In: Proceedings of Document Understanding Conference (2007)
Google Scholar
Mihalcea, R.: Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization. In: Proceedings of ACL 2004, Article No. 20 (2004)
Google Scholar
Mihalcea, R.: Language Independent Extractive Summarization. In: Proceedings of ACL 2005, pp. 49–52 (2005)
Google Scholar
Kleinberg, J.M.: Authoritative Sources in a Hyperlinked Environment. In: Proceedings of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 668–677 (1999)
Google Scholar
Mani, I., Maybury, M.T. (eds.): Advances in Automatic Summarization. The MIT Press, Cambridge (1999)
Google Scholar
Otterbacher, J., Erkan, G., Radev, D.R.: Using Random Walks for Question-focused Sentence Retrieval. In: Proceedings of HLT/EMNLP, pp. 915–922 (2005)
Google Scholar
Ouyang, Y., Li, S.Y., Li, W.J.: Developing Learning Strategies for Topic-Based Summarization. In: Proceedings of the 16th ACM Conference on Information and Knowledge Management, pp. 79–86 (2007)
Google Scholar
Over, P., Dang, H., Harman, D.: DUC in Context. Information Processing and Management 43(6), 1506–1520 (2007)
Article Google Scholar
Porter Stemmer, http://www.tartarus.org/~martin/PorterStemmer
Radev, D.R., Jing, H.Y., Stys, M., Tam, D.: Centroid-based Summarization of Multiple Documents. Information Processing and Management 40, 919–938 (2004)
Article MATH Google Scholar
Vanderwende, L., Banko, M., Menezes, A.: Event-Centric Summary Generation. In: Working Notes of DUC 2004 (2004)
Google Scholar
Wan, X.J., Yang, J.W., Xiao, J.G.: Using Cross-document Random Walks for Topic-focused Multi-document Summarization. In: Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 1012–1018 (2006)
Google Scholar
Wan, X.J., Yang, J.W., Xiao, J.G.: Towards Iterative Reinforcement Approach for Simultaneous Document Summarization and Keyword Extraction. In: Proceedings of ACL, pp. 552–559 (2007)
Google Scholar
Wei, F.R., Li, W.J., Lu, Q., He, Y.X.: A Cluster-Sensitive Graph Model for Query-Oriented Multi-document Summarization. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 446–453. Springer, Heidelberg (2008)
Chapter Google Scholar
Wei, F.R., Li, W.J., Lu, Q., He, Y.X.: Query-Sensitive Mutual Reinforcement Chain with Its Application in Query-Oriented Multi-Document Summarization. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 283–290 (2008)
Google Scholar
Wong, K.F., Wu, M.L., Li, W.J.: Extractive Summarization Using Supervised and Semi-Supervised Learning. In: Proceedings of the 22nd International Conference on Computational Linguistics, pp. 985–992 (2008)
Google Scholar
Yoshioka, M., Haraguchi, M.: Multiple News Articles Summarization based on Event Reference Information. In: Working Notes of NTCIR-4 (2004)
Google Scholar
Zha, H.Y.: Generic Summarization and Key Phrase Extraction using Mutual Reinforcement Principle and Sentence Clustering. In: Proceedings of the 25th ACM SIGIR, pp. 113–120 (2002)
Google Scholar
Padmanabhan, D., Desikan, P., Srivastava, J., Riaz, K.: WICER: A Weighted Inter-Cluster Edge Ranking for Clustered Graphs. In: Proceedings of 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 522–528 (2005)
Google Scholar
Wei, F.R., Li, W.J., Lu, Q., He, Y.X.: Applying Two-Level Mutual Reinforcement Ranking in Query-Oriented Multi-document Summarization. Journal of the American Society for Information Science and Technology (2009) (in press)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing, The Hong Kong Polytechnic University, Hong Kong
Furu Wei & Wenjie Li
IBM China Research Laboratory, Beijing, China
Furu Wei
Department of Computer Science and Technology, Wuhan University, Wuhan, China
Yanxiang He

Authors

Furu Wei
View author publications
You can also search for this author in PubMed Google Scholar
Wenjie Li
View author publications
You can also search for this author in PubMed Google Scholar
Yanxiang He
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Engineering , Nanyang Technological University, 639798, Singapore
Weisi Lin & Dacheng Tao &
Intelligent Systems Laboratory Systems Research Institute , Polish Academy of Sciences, Poland
Janusz Kacprzyk
Department of Computing , Hong Kong Polytechnic University, Hung Hom, Hong Kong
Zhu Li
School of Electronic Engineering and Computer Science, Queen Mary, University of London, London, U.K.
Ebroul Izquierdo
TCL-Thomson Electronics , Santa Clara, California
Haohong Wang

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wei, F., Li, W., He, Y. (2011). Document-Aware Graph Models for Query-Oriented Multi-document Summarization. In: Lin, W., Tao, D., Kacprzyk, J., Li, Z., Izquierdo, E., Wang, H. (eds) Multimedia Analysis, Processing and Communications. Studies in Computational Intelligence, vol 346. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19551-8_24

Download citation

DOI: https://doi.org/10.1007/978-3-642-19551-8_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19550-1
Online ISBN: 978-3-642-19551-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Document-Aware Graph Models for Query-Oriented Multi-document Summarization

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

GuideRank: A Guided Ranking Graph Model for Multilingual Multi-document Summarization

User Intention-Based Document Summarization on Heterogeneous Sentence Networks

Automatic Multi-Document Summarization Based on Keyword Density and Sentence-Word Graphs

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Document-Aware Graph Models for Query-Oriented Multi-document Summarization

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

GuideRank: A Guided Ranking Graph Model for Multilingual Multi-document Summarization

User Intention-Based Document Summarization on Heterogeneous Sentence Networks

Automatic Multi-Document Summarization Based on Keyword Density and Sentence-Word Graphs

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation