Abstract
Extensible Markup Language (XML) is a simple, flexible text format derived from SGML, which is originally designed to support large-scale electronic publishing. Nowadays XML plays a fundamental role in the exchange of a wide variety of data on the Web. As XML allows designers to create their own customized tags, enables the definition, transmission, validation, and interpretation of data between applications, devices and organizations, lots of works in soft computing employ XML to take control and responsibility for the information, such as fuzzy markup language, and accordingly there are lots of XML-based data or documents. However, most of mobile and interactive ubiquitous multimedia devices have restricted hardware such as CPU, memory, and display screen. So, it is essential to compress an XML document/element collection to a brief summary before it is delivered to the user according to his/her information need. Query-oriented XML text summarization aims to provide users a brief and readable substitution of the original retrieved documents/elements according to the user’s query, which can relieve users’ reading burden effectively. We propose a query-oriented XML summarization system QXMLSum, which extracts sentences and combines them as a summary based on three kinds of features: user’s queries, the content of XML documents/elements, and the structure of XML documents/elements. Experiments on the IEEE-CS datasets used in Initiative for the Evaluation of XML Retrieval show that the query-oriented XML summary generated by QXMLSum is competitive.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Acampora G, Loia V (2005) Fuzzy control interoperability and scalability for adaptive domotic framework. IEEE Trans Ind Inf 1(2):97–111
Acampora G, Loia V (2008) A proposal of ubiquitous fuzzy computing for ambient intelligence. Inf Sci 178(3):631–646
Acampora G, Gaeta M, Loia V, Vasilakos AV (2010) Interoperable and adaptive fuzzy services for ambient intelligence applications. ACM Trans Auton Adapt Syst 5(2), art. no. 8
Acampora G, Lee C-S, Vitiello A, Wang M-H (2012) Evaluating cardiac health through semantic soft computing techniques. Soft Comput 16(7):1183–1196
Ali MS, Consens M, Gu X et al (2007) Efficient, effective and flexible XML retrieval using summaries. In: Proceedings of the comparative evaluation of XML information retrieval systems, pp 89–103
Barta A, Consens MP, Mendelzon AO (2005) Benefits of path summaries in an XML query optimizer supporting multiple access methods. In: Proceedings of the international conference on very large data bases (VLDB05), pp 133–144
Bergsma S, Lin D, Goebel R (2009) Web-scale N-gram models for lexical disambiguation. In: Proceedings of the 21st international joint conference on artificial intelligence, pp 1507–1512
Chen D, Tang J, Yao L, Li J, Zhou L (2009) Query-focused summarization by combining topic model and affinity propagation. In: Proceedings of APWeb/WAIM 2009, pp 174–185
Comai S, Marrara S (2004) XML document summarization: using XQuery for synopsis creation. In: Proceedings of the 15th international workshop on database and expert systems applications (DEXA04), pp 928–932
Dalamagas T, Cheng T et al (2004) Clustering XML documents using structural summaries. In: Proceedings of the EDBT workshop on clustering information over the web, pp 547–556
Jinxi X, Bruce Croft W (2000) Improving the effectiveness of informational retrieval with local context analysis. ACM Trans Inf Syst 18(1):79–112
Lee C-S, Wang M-H, Acampora G, Hsu C-Y, Hagras H (2010) Diet assessment based on type-2 fuzzy ontology and fuzzy markup language. Int J Intell Syst 25(12):1187–1216
Lin CY, Hovy E (2000) The automated acquisition of topic signatures for text summarization. In: Proceedings of the 18th COLING Conference, pp 495–501
Lin CY, Hovy E (2002) From single to multi-document summarization: a prototype system and its evaluation. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL), pp 457–464
Moreno-Velo FJ, Barros AB, Sánchez-Solano S, Baturone I (2012) XFSML: an XML-based modeling language for fuzzy systems. 2012 IEEE International Conference on Fuzzy Systems, Australia, pp. 1–8
Polyzotis N, Garofalakis M (2002) Statistical synopses for graph structured XML databases. In: Proceedings of the 2002 ACM SIGMOD, pp 358–369
Qin B, Liu T, Li S (2005) Review of multi-document summarization. J Chin Inf Process 19(6):13–20
Szlávik Z, Tombros A, Lalmas M (2007) Feature- and query-based table of contents generation for XML documents. In: Proceedings of ECIR 2007, pp 456–467
Thomas O, Dollmann T (2010) Fuzzy-EPC markup language: XML based interchange formats for fuzzy process models. Soft Comput XML Data Manag Stud Fuzziness Soft Comput 255:227–257
Wei F, He Y, Li W, Huang L (2009) Query-oriented summarization based on neighborhood graph model. In: Proceedings of ICCPOL 2009, pp 156–167
Wenjie L, Furu W, Qin L, Yanxiang H (2008) Ranking sentences with positive and negative reinforcement for query-oriented update summarization. In: Proceedings of the 22nd international conference on computational linguistics (Coling 2008), pp 489–496
World Wide Web Consortium. Extensible Markup Language (XML) 1.0 (Third Edition). W3C Recommendation. 2004, http://www.w3.org/TR/REC-xml/
Acknowledgments
This work is supported by Natural Science Foundation of China (No. 60803105, 61173146) and Science & Technology Project of Department of Education of Jiangxi Province (No. GJJ11731).
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by G. Acampora.
Rights and permissions
About this article
Cite this article
Liu, D., Wu, S., Lan, Y. et al. A query-oriented XML text summarization for mobile devices. Soft Comput 17, 1585–1593 (2013). https://doi.org/10.1007/s00500-012-0980-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-012-0980-8