QReduction: Synopsizing XPath Query Set Efficiently under Resource Constraint | SpringerLink
Skip to main content

QReduction: Synopsizing XPath Query Set Efficiently under Resource Constraint

  • Conference paper
Advances in Web-Age Information Management (WAIM 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3129))

Included in the following conference series:

  • 933 Accesses

Abstract

How to evaluate a massive XPath set over XML streams poses great challenges to database researchers. Current work chiefly focuses on evaluating efficiently massive XPath set to obtain precise results. The size of the input query set has a great impact on the resource requirement and the efficiency of evaluation. In this paper, we propose a novel method, QReduction, to obtain the synopsized XPath query set to represent the original query set, while at the same time to minimize the ’precision loss’ caused by query set synopsis. QReduction discovers frequent patterns among the massive input XPath tree patterns first, and select query set synopsis from them based on a dynamic benefit model under resource constraints. Since frequent patterns discovery takes high complexity in QReduction, we propose optimization methods by pushing the constraints of QReduction into the discovery process. We propose 3 criteria, namely recall, precision and intersection to determine a better synopsis. The experimental results demonstrate that our method can produce a query set synopsis with high precision, recall and intersection under given resource constraints.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 11439
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 14299
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Amer-Yahia, S., Cho, S., Srivastava, D.: Tree Pattern Relaxation. In: Proceedings of 8th EDBT conference, pp. 496–513 (2002)

    Google Scholar 

  2. Yang, L.H., Lee, M.-L., Hsu, W.: Efficient Mining of XML Query Patterns for Caching. In: Proceedings of 29th VLDB conference, pp. 69–80 (2003)

    Google Scholar 

  3. Gupta, A.K., Suciu, D.: Stream Processing of XPath Queries with Predicates. In: Proceeding of SIGMOD conference, pp. 419–430 (2003)

    Google Scholar 

  4. Diao, Y., Fischer, P., Franklin, M., Raymond: YFilter: efficient and scalable filtering of XML documents. In: Proceeding of ICDE conference, pp. 341–345 (2002)

    Google Scholar 

  5. Clark, J.: XML Path language(XPath) (1999), available from the W3C, http://www.w3.org/TR/XPath

  6. Garofalakis, M., Chan, C.Y., fan, W., Felber, P., Rastogi, R.: Tree pattern aggregation for scalable XML data dissemination. In: Proceedings of 28th VLDB conference (2002)

    Google Scholar 

  7. Miklau, G., suciu, D.: Containment and equivalence for an XPath fragment. In: Proceedings of 21th PODS, pp. 65–76 (2002)

    Google Scholar 

  8. C.Chan, P.Felber, M.Garofalakis and R.Rastogi. Efficient filtering of XML document with XPath expressions. In proceedings of 18th ICDE conference, 2002, pages 235-244.

    Google Scholar 

  9. Garofalakis, M.N., Gibbon, P.B.: Approximate Query Processing: Taming the TeraBytes. In: Proceedings of 27th VLDB Conference (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gao, J., Ma, X., Yang, D., Wang, T., Tang, S. (2004). QReduction: Synopsizing XPath Query Set Efficiently under Resource Constraint. In: Li, Q., Wang, G., Feng, L. (eds) Advances in Web-Age Information Management. WAIM 2004. Lecture Notes in Computer Science, vol 3129. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27772-9_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27772-9_39

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22418-1

  • Online ISBN: 978-3-540-27772-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics