Abstract
The design of continuous query languages for data streams and the extent to which these should rely on database query languages represent pivotal issues for data stream management systems (DSMSs). The Expressive Stream Language (ESL) of our Stream Mill system is designed to maximize the spectrum of applications a DSMS can support efficiently, while retaining compatibility with the SQL:2003 standards. This approach offers significant advantages, particularly for the many applications that span both data streams and databases. Therefore, ESL supports minimal extensions required to overcome SQL’s expressive power limitations—a critical enhancement since said limitations are quite severe on database applications and are further exacerbated on data stream applications, where, e.g., only nonblocking query operators can be used. Thus, ESL builds on user-defined aggregates and flexible window mechanisms to turn SQL into a powerful and computationally-complete query language, which is capable of supporting applications, such as data stream mining and sequence queries that are beyond the application scope of other DSMSs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
D. Abadi, D. Carney, U. Cetintemel, M. Cherniack, C. Convey, S. Lee, M. Stonebraker, N. Tatbul, S.Z. Aurora, A new model and architecture for data stream management. VLDB J. 12(2), 120–139 (2003)
C. Cortes, K. Fisher, D. Pregibon, A. Rogers, Hancock: a language for extracting signatures from data streams, in SIGKDD (2000), pp. 9–17
P. Felber, P. Eugster, R. Guerraoui, A. Kermarrec, The many faces of publish/subscribe. ACM Comput. Surv. 35(2), 114–131 (2003)
ISO/IEC. Database languages—SQL, ISO/IEC 9075-*:2003 (2003)
B. Babcock, S. Babu, M. Datar, R. Motwani, J. Widom, Models and issues in data stream systems, in PODS (2002), pp. 1–16
Y.-N. Law, H. Wang, C. Zaniolo, Data models and query language for data streams, in VLDB (2004), pp. 492–503
I. Information Technologies, Illustra user’s guide, in 1111 Broadway, Suite 2000, Oakland, CA (1994)
D. Florescu, C. Hillery, D. Kossmann et al., The BEA/XQRL streaming xquery processor. VLDB J. 13(3), 294–315 (2004)
Oracle. Oracle9i application developer’s guide advanced queuing. Oracle, Redwood Shores, CA, USA (2002)
H. Wang, C. Zaniolo, Using SQL to build new aggregates and extenders for object-relational systems, in VLDB (2000), pp. 166–175
H. Wang, C. Zaniolo, Atlas: a native extension of sql for data minining, in Proceedings of Third SIAM Int. Conference on Data Mining (2003), pp. 130–141
J. Li, D. Maier, K. Tufte, V. Papadimos, P.A. Tucker, Semantics and evaluation techniques for window aggregates in data streams, in SIGMOD Conference (2005), pp. 311–322
Stream mill home. http://wis.cs.ucla.edu/stream-mill
A. Arasu, S. Babu, J. Widom, Cql: a language for continuous queries over streams and relations, in DBPL (2003), pp. 1–19
C. Cranor, Y. Gao, T. Johnson, V. Shkapenyuk, O. Spatscheck, Gigascope: high performance network monitoring with an sql interface, in SIGMOD (ACM, New York, 2002), p. 623
R. Sadri, C. Zaniolo, A. Zarkesh, J. Adibi, Optimization of sequence queries in database systems, in PODS (2001)
R. Sadri, C. Zaniolo, A.M. Zarkesh, J. Adibi, Expressing and optimizing sequence queries in database systems. ACM Trans. Database Syst. 29(2), 282–318 (2004)
F. Zemke, A. Witkowski, M. Cherniak, L. Colby, Pattern matching in sequences of rows, in Sql Change Proposal (2007). http://www.sqlsnippets.com/en/topic-12162.html
N. Dindar, B. Güç, P. Lau, A. Ozal, M. Soner, N. Tatbul, Dejavu: declarative pattern matching over live and archived streams of events, in SIGMOD Conference (2009), pp. 1023–1026
E. Wu, Y. Diao, S. Rizvi, High-performance complex event processing over streams, in SIGMOD Conference (2006), pp. 407–418
D. Gyllstrom, J. Agrawal, Y. Diao, N. Immerman, On supporting kleene closure over event streams, in ICDE (2008), pp. 1391–1393
A.J. Demers et al., Cayuga: a high-performance event processing engine, in SIGMOD Conference (2007), pp. 1100–1102
R.S. Barga et al., Consistent streaming through time: a vision for event stream processing, in CIDR (2007), pp. 363–374
B. Mozafari, K. Zeng, C. Zaniolo, K*SQL: a unifying engine for sequence patterns and XML, in SIGMOD Conference–Demo Track (2010), pp. 1143–1146
B. Mozafari, K. Zeng, C. Zaniolo, From regular expressions to nested words: unifying languages and query execution for relational and XML sequences. Proc. VLDB Endow. 3(1), 150–161 (2010)
R. Alur, P. Madhusudan, Adding nesting structure to words, in Developments in Language Theory (2006)
R. Alur, P. Madhusudan, Visibly pushdown languages, in STOC (2004), pp. 202–211
X. Zhou, H. Thakkar, C. Zaniolo, Unifying the processing of XML streams and relational data streams, in ICDE (2006), p. 50
U. Srivastava, J. Widom, Memory-limited execution of windowed stream joins, in VLDB (2004), pp. 324–335
Y. Bai, H. Thakkar, C. Luo, H. Wang, C. Zaniolo, A data stream language and system designed for power and flexibility, in CIKM (2006), pp. 337–346
L. Golab, M. Tamer Özsu, Update-pattern-aware modeling and processing of continuous queries, in ACM SIGMOD Conference (2005), pp. 658–669
B. Mozafari, C. Zaniolo, Optimal load shedding with aggregates and mining queries, in ICDE (2010), pp. 76–88
Y.-N. Law, C. Zaniolo, Improving the accuracy of continuous aggregates and mining queries on data streams under load shedding. Int. J. Bus. Intell. Data Min. 3(1), 99–117 (2008)
M. Datar, A. Gionis, P. Indyk, R. Motwani, Maintaining stream statistics over sliding windows: (extended abstract), in Proceedings of the Thirteenth Annual ACM–SIAM Symposium on Discrete Algorithms (2002), pp. 635–644
C. Aggarwal, Data Streams: Models and Algorithms (Springer, Berlin, 2007)
H. Mousavi, C. Zaniolo, Fast and accurate computation of equi-depth histograms over data streams, in EDBT (2011), pp. 69–80
C. Jin, W. Qian, C. Sha, J.X. Yu, A. Zhou, Dynamically maintaining frequent items over a data stream, in Proceedings of the 12th ACM Conference on Information and Knowledge Management (CIKM) (2003)
M. Charikar, K. Chen, M. Farach-Colton, Finding frequent items in data streams, in International Colloquium on Automata, Languages, and Programming (ICALP) (2000), pp. 508–515
G. Cormode, S. Muthukrishnan, What’s hot and what’s not: tracking most frequent items dynamically, in PODS (2003), pp. 296–306
H. Thakkar, N. Laptev, H. Mousavi, B. Mozafari, V. Russo, S.M.M. Carlo Zaniolo, A data stream management system for knowledge discovery, in ICDE (2011), pp. 757–768
S. Sarawagi, S. Thomas, R. Agrawal, Integrating association rule mining with relational database systems: alternatives and implications, in SIGMOD (1998)
T. Imielinski, H. Mannila, A database perspective on knowledge discovery. Commun. ACM 39(11), 58–64 (1996)
C. Zaniolo, Mining databases and data streams with query languages and rules—invited paper, in KDID 2005: Knowledge Discovery in Inductive Databases, 4th International Workshop. Lecture Notes in Computer Science, vol. 3933 (Springer, Berlin, 2006), pp. 24–37
Z. Tang, J. Maclennan, P.P. Kim, Building data mining solutions with OLE DB for DM and XML for analysis. SIGMOD Rec. 34(2), 80–85 (2005)
H. Thakkar, B. Mozafari, C. Zaniolo, Designing an inductive data stream management system: the stream Mill experience, in SSPS (2008), pp. 79–88
H.-P. Kriegel, M. Ester, J. Sander, X. Xu, A density-based algorithm for discovering clusters in large spatial databases with noise, in KDD (1996), pp. 226–231
H. Wang Wei Fan, P.S. Yu, J. Han, Mining concept-drifting data streams using ensemble classifiers, in KDD (2003), pp. 226–235
B. Mozafari, H. Thakkar, C. Zaniolo, Verifying and mining frequent patterns from large windows over data streams, in ICDE (2008), pp. 179–188
A. Bifet, G. Holmes, B. Pfahringer, P. Kranen, H. Kremer, T. Jansen, T. Seidl, Moa: massive online analysis, a framework for stream classification and clustering. J. Mach. Learn. Res. 11, 44–50 (2010)
Y. Bai, C. Zaniolo, Minimizing latency and memory in DSMS: a unified approach to quasi-optimal scheduling, in SSPS (2008), pp. 58–67
Y. Bai, H. Thakkar, H. Wang, C. Zaniolo, Optimizing timestamp management in data stream management systems, in ICDE (2007), pp. 1334–1338
L. Golab, M. Tamer Özsu, Issues in data stream management. ACM SIGMOD Rec. 32(2), 5–14 (2003)
D. Barbara, The characterization of continuous queries. Int. J. Coop. Inf. Syst. 8(4), 295–323 (1999)
D.B. Terry, D. Goldberg, D.A. Nichols, B.M. Oki, Continuous queries over append-only databases, in SIGMOD Conference (1992), pp. 321–330
M. Sullivan, Tribeca: a stream database manager for network traffic analysis, in VLDB (1996), p. 594
S. Chandrasekaran et al., TelegraphCQ: continuous dataflow processing for an uncertain world, in CIDR (2003)
L. Liu, C. Pu, W. Tang, Continual queries for Internet scale event-driven information delivery. IEEE Trans. Knowl. Data Eng. 11(4), 583–590 (1999)
J. Chen, D.J. DeWitt, F. Tian, Y. Wang, NiagaraCQ: a scalable continuous query system for Internet databases, in SIGMOD (2000), pp. 379–390
H. Jagadish, I. Mumick, A. Silberschatz, View maintenance issues for the chronicle data model, in PODS (1995), pp. 113–124
A. Kumar Gupta, D. Suciu, Stream processing of xpath queries with predicates, in SIGMOD Conference (2003), pp. 419–430
D. Carney, U. Cetintemel, M. Cherniack, C. Convey, S. Lee, G. Seidman, M. Stonebraker, N. Tatbul, S. Zdonik, Monitoring streams—a new class of data management applications, in VLDB, Hong Kong, China (2002)
Y.-N. Law, H. Wang, C. Zaniolo, Relational languages and data models for continuous queries on sequences and data streams. ACM Trans. Database Syst. 36, 8 (2011)
C. Luo, H. Thakkar, H. Wang, C. Zaniolo, A native extension of SQL for mining data streams, in ACM SIGMOD Conference 2005 (2005), pp. 873–875
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Laptev, N. et al. (2016). Extending Relational Query Languages for Data Streams. In: Garofalakis, M., Gehrke, J., Rastogi, R. (eds) Data Stream Management. Data-Centric Systems and Applications. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28608-0_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-28608-0_18
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28607-3
Online ISBN: 978-3-540-28608-0
eBook Packages: Computer ScienceComputer Science (R0)