Abstract
The advantages of the eXtensible Markup Language, XML, come at a cost, especially for huge datasets or when used on small mobile devices. Several known XML-conscious compressors used in real time environments compress data during data streaming. This paper presents a study of new real time algorithms that exploit local structural redundancies of pre-order traversals of an XML tree. These algorithms focus on reducing the overhead of streaming data while maintaining load balancing between the sender and receiver. Our algorithms have similar or better performance than existing algorithms, while emphasizing low memory and processing overheads.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
XML: EXtensible Markup Language (XML) 1.0, 5th edn. (2012). http://www.w3.org/TR/REC-xml/. Accessed 20 June 2013
GZIP: The gzip home page (2012). http://www.gzip.org. Accessed 20 June 2013
Hartmut, L., Suciu, D.: XMill: an efficient compressor for XML data. ACM Special Interest Group on Management of Data (SIGMOD) Record 29, 153–164 (2000)
Tolani, P., Haritsa, J.: XGRIND: a query-friendly XML compressor. In: International Conference on Data Engineering (ICDE)’ 02, pp. 225–234 (2002)
Arion, A., Bonifati, A., Manolescu, I., Pugliese, A.: XQueC: a query-conscious compressed XML database. ACM Trans. Internet Technol. 7, 1–35 (2007)
Corbin, T., Müldner, T., Miziołek, J.: Pre-order compression schemes for XML in the real time environment. In: SciTePress Digital Library - WEBIST 2013 - 9th International Conference on Web Information Systems and Technologies, Aachen, Germany, pp. 5–15. SciTePress Digital Library (2013)
Skibiński, P., Swacha, J.: Combining efficient XML compression with query processing. In: Ioannidis, Y., Novikov, B., Rachev, B. (eds.) ADBIS 2007. LNCS, vol. 4690, pp. 330–342. Springer, Heidelberg (2007)
Lin, Y., Zhang, Y., Li, Q., Yang, J.: Supporting efficient query processing on compressed XML files. In: Proceedings of the Symposium on Applied Computing (SAC) ’05, pp. 660–665. ACM, New York (2005)
Ng, W., Lam, W.Y., Wood, P., Levene, N.: XCQ: a queriable XML compression system. Knowl. Inf. Syst. 10, 421–452 (2006)
XPath: XML Path Language (XPath) (2012). http://www.w3.org/TR/xpath/. Accessed on 20 June 2013
XQuery: XQuery 1.0: An XML Query Language, 2nd edn. (2012). http://www.w3.org/TR/xquery/. Accessed 20 June 2013
Müldner, T., Fry, C., Miziołek, J., Durno, S.: SXSAQCT and XSAQCT: XML queryable compressors. In: Böttcher, S., M. Lohrey, S.M., Rytter, W. (eds.) Structure-Based Compression of Complex Massive Data. Dagstuhl Seminar Proceedings, vol. 08261, Dagstuhl, Germany. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, Germany (2008)
Müldner, T., Fry, C., Miziołek, J.: Online Internet communication using an XML compressor. In: The Seventh International Conference on Internet and Web Applications and Services, Stuttgart, Germany, pp. 131–136. International Academy, Research, and Industry Association (IARIA) (2012)
Müldner, T., Fry, C., Miziołek, J., Durno, S.: XSAQCT: XML queryable compressor. In: Balisage: The Markup Conference 2009, Montreal, Canada (2009)
soap: SOAP Version 1.2 Part 1: Messaging Framework, 2nd edn. (2012). http://www.w3.org/TR/soap12-part1/. Accessed 20 June 2013
Leighton, G., Müldner, T., Diamond, J.: TREECHOP: a tree-based query-able compressor for XML. In: The Ninth Canadian Workshop on Information Theory, pp. 115–118 (2005)
Java: Java version 7 (2012). http://www.oracle.com/technetwork/java/javase/7u-relnotes-515228.html. Accessed 20 June 2013
HTTP: HTTP RFC 2616 (2012). http://www.w3.org/Protocols/rfc2616/rfc2616.html. Accessed 20 June 2013
Peintner, D.: EXI: EXIficient (2012). http://exificient.sourceforge.net. Accessed 20 June 2013
enwiki dumps: enwiki-latest.xml (2012). http://dumps.wikimedia.org/enwiki/latest/. Accessed 20 June 2013
xmlgen: The benchmark data generator (2012). http://www.xml-benchmark.org/generator.html. Accessed 20 June 2013
Corpus: Wratislavia XML corpus (2012). http://www.ii.uni.wroc.pl/~inikep/research/Wratislavia/. Accessed 20 June 2013
Baseball: baseball.xml (2012). http://rassyndrome.webs.com/CC/Baseball.xml. Accessed 20 June 2013
Macbeth: macbeth.xml (2012). http://www.ibiblio.org/xml/examples/. Accessed 20 June 2013
Acknowledgements
The work of the first and second authors are partially supported by the NSERC CSG-M (Canada Graduate Scholarship-Masters) and NSERC RG-PIN grant respectively. We would like to thank the anonymous reviewers for their detailed and helpful comments used to improve the final version of our paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Corbin, T., Müldner, T., Miziołek, J.K. (2014). Networked XML Compression by Encoding Pre-order Traversals. In: Krempels, KH., Stocker, A. (eds) Web Information Systems and Technologies. WEBIST 2013. Lecture Notes in Business Information Processing, vol 189. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44300-2_1
Download citation
DOI: https://doi.org/10.1007/978-3-662-44300-2_1
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44299-9
Online ISBN: 978-3-662-44300-2
eBook Packages: Computer ScienceComputer Science (R0)