Abstract
Implementation of the data stream processing applications requires a method for formal specification of the computations at a dataflow level. The logical models of stream processing hide the lower level implementation details. To solve this problem, we propose a new model of data stream processing based on the concepts of relational data stream, extensible system of elementary operations on relational streams, and data stream processing network integrating the dataflows and elementary operations. Next, we present the transformations of grouped data stream processing applications into data stream processing networks. The transformations proposed in the paper integrate the networks and optimize the implementations through elimination of the redundant elementary operations and dataflows. Finally, the paper introduces a timestamp based synchronization of data flows in our model and discusses its correctness.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Popa, L. (ed.) Proceedings of the Twenty-first ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pp. 1–16. ACM Press, New York (2002)
Madden, S., Franklin, M.J.: Fjording the stream: An architecture for queries over streaming sensor data. In: 18th International Conference on Data Engineering, San Jose, California, February 26-March 1, IEEE, Los Alamitos (2002)
Arasu, A., Babcock, B., Babu, S., McAlister, J., Widom, J.: Characterizing memory requirements for queries over continuous data streams. In: Popa, L. (ed.) Proceedings of the Twenty-first ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pp. 221–232. ACM Press, New York (2002)
Babu, S., Widom, J.: Continuous queries over data streams. SIGMOD Record 30, 109–120 (2001)
Krishnamurthy, S., Chandrasekaran, S., Cooper, O., Deshpande, A., Franklin, M.J., Hellerstein, J.M., Hong, W., Madden, S.R., Reiss, F., Shah, M.A.: Telegraphcq: An architectural status report. Bulletin of the Technical Committee on Data Engineering 26, 11–18 (2003)
Hellerstein, J.M., Franklin, M.J., Chandrasekaran, S., Deshpande, A., Hildrum, K., Madden, S., Raman, V., Shah, M.A.: Adaptive query processing: Technology in evolution. Bulletin of the Technical Committee on Data Engineering 23, 7–18 (2000)
Cole, R.L.: A decision theoretic cost model for dynamic plans. Bulletin of the Technical Committee on Data Engineering 23, 34–41 (2000)
Bouganim, L., Fabret, F., Mohan, C.: A dynamic query processing architecture for data integration systems. Bulletin of the Technical Committee on Data Engineering 23, 42–48 (2000)
Ives, Z.G., Levy, A.Y., Weld, D.S., Florescu, D., Friedman, M.: Adaptive query processing for internet applications. Bulletin of the Technical Committee on Data Engineering 23, 19–26 (2000)
Urhan, T., Franklin, M.J.: Xjoin: A reactively-scheduled pipelined join operator. IEEE Data Engineering Bulletin 23(2), 27–33 (2000)
Terry, D., Goldberg, D., Nichols, D., Oki, B.: Continuous queries over append-only databases. In: Proceedings of the 1992 ACM SIGMOD International Conference on Management of Data, pp. 321–330 (1992)
Liu, L., Pu, C., Tang, W.: Continual queries for internet scale event-driven information delivery. IEEE Transactions on Knowledge and Data Engineering 11, 610–628 (1999)
Hellerstein, A.R.: Eddies: Continuously adaptive query processing. In: Proc. ACMSIGMOD International Conference on Management of Data, pp. 106–117 (1998)
Fiat, A., Woeginger, G.J.: On Line Algorithms, The State of the Art. Springer, Heidelberg (1998)
Hellerstein, J.M., Haas, P.J., Wang, H.J.: Online aggregation. In: Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data. SIGMOD Record, pp. 171–182 (1997)
Hellerstein, J.M., Haas, P.J., Wang, H.J.: Online aggregation. In: SIGMOD 1997, Proceedings ACM SIGMOD International Conference on Management of Data, pp. 171–182. ACM Press, New York (1997)
Lee, E.A., Parks, T.M.: Dataflow process networks. Technical report, Department of Electrical Engineering and Computer Science, University of California (1995)
Stephens, R.: A survey of stream processing. Technical Report CSRG95-05, Department of Electronic and Electrical Engineering, University of Surrey (1996)
Madden, S., Shah, M., Hellerstein, J.M., Raman, V.: Continuously adaptive continuous queries over streams. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, Wisconsin, June 4-6, pp. 49–60. ACM Press, New York (2002)
Group, T.S.: Stream: The stanford stream data manager. Bulletin of the Technical Committee on Data Engineering 26, 19–26 (2003)
Avnur, R., Hellerstein, J.M.: Eddies: Continuously adaptive query processing. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 261–272. ACM, New York (2000)
Stonebraker, M., Cherniack, M., Cetintemel, U., Balazinska, M., Balakrishnan, H.: The aurora and medusa projects. Bulletin of the Technical Committee on Data Engineering 26, 3–10 (2003)
Viglas, S.D., Naughton, J.F.: Rate-based query optimization for streaming information sources. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, pp. 37–48. ACM Press, New York (2002)
Das, A., Gehrke, J., Riedewald, M.: Approximate join processing over data streams. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, June 9-12 (2003)
Ganguly, S., Garofalakis, M., Rastogi, R.: Processing set expressions over continuous update streams. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, June 9-12 (2003)
Getta, J., Vossough, E.: Optimization of data stream processing. Submitted for publication in SIGMOD Record (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Getta, J.R., Vossough, E. (2004). Grouped Processing of Relational Algebra Expressions over Data Streams. In: Benczúr, A., Demetrovics, J., Gottlob, G. (eds) Advances in Databases and Information Systems. ADBIS 2004. Lecture Notes in Computer Science, vol 3255. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30204-9_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-30204-9_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23243-8
Online ISBN: 978-3-540-30204-9
eBook Packages: Springer Book Archive