Abstract
A technical trend in supporting data intensive applications is to push-down them to database engines by wrapping computations with User Defined Functions (UDFs) callable from the SQL interface. For seamlessly integrating applications into SQL queries, we introduce a special kind of UDFs with both input and output as relations, or row sets, called Relation Valued Functions (RVFs). As a single SQL statement has limited expressive power on complex data flows and control flows, we propose the notion of Correlated Query Process (CQP), for specifying an application at the process level, by several queries and RVFs with interleaved steps.
We further extend this notion to Collaborative CQP with multiple participating parties which have correlated but separate goals. A truly P2P execution model is proposed, under which the logical execution of a collaborative CQP consists of a set of individual peer CQP executions run by the participating parties. These peer executions are based on the same CQP template; but each peer represents a designated role, it actively processes the steps belong to that role, and skip the steps not belonging to that role. They synchronize peer CQP executions by passing the query return messages. This collaborative process execution model represents a shift from coordinator based distributed query processing to truly P2P collaboration.
This work represents an initial step to the synthesis of relational operation, UDF and business process. We further envisage that the proposed P2P CQP execution model will have impact not only on inter-enterprise collaboration but also on the cooperation of multiple share-nothing data servers.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aberer, K., Hauswirth, M.: P2P Information Systems: Concepts and Models, State of the Art, and Future Systems. In: ICDE 2002. 187– Advanced Technology Seminar (2002)
Bernstein, P.A., Giunchiglia, F., Kementsietsidis, A., Mylopoulos, J., Serafini, L., Zaihrayeu, I.: Data Management for Peer-to-Peer Computing: A Vision. In: Proc. of WebDB 2002 (2002)
Chen, Q., Hsu, M.: CPM Revisited – An Architecture Comparison. In: Proc. of 10th Int’l Conf on Cooperative Information Systems (Coopis 2002), USA (2002)
Chen, Q., Hsu, M.: Inter-Enterprise Collaborative Business Process Management. In: Proc. of 17th Int’l Conf on Data Engineering (ICDE 2001), Germany (2001)
Chen, Q., Kambayashi, Y.: Nested Relation Based Database Knowledge Representation. In: Proc. of ACM SIGMOD 1991. ACM SIGMOD Rec., vol. 20(2) (1991)
Chen, Q.: A Rule-based Object/Task Modeling Approach. In: ACM SIGMOD (1986)
Dayal, U., Chen, Q.: From Database Programming to Process Management Programming. In: DBPL-5 (1996)
Jim Gray, D.T., Liu, M., Nieto-Santisteban, M.A., Szalay, A.S., Heber, G., DeWitt, D.: Scientific Data Management in the Coming Decade. SIGMOD Record 34(4) (2005)
Gribble, S.D., Halevy, A.Y., Ives, Z.G., Rodrig, M., Suciu, D.: What Can Database Do for Peer-to-Peer? In: WebDB 2001, pp. 31–36 (2001)
Hsu, M., Xiong, Y.: Building a Scalable Web Query System. In: Bhalla, S. (ed.) DNIS 2007. LNCS, vol. 4777. Springer, Heidelberg (2007)
Kossmann, D.: The State of the Art in Distributed Query Processing. ACM Computing Surveys 32, 422–469 (2000)
Olston, C., Reed, B., Srivastava, U., Kumar, R., Tomkins, A.: Pig Latin: A Not-So-Foreign Language for Data Processing. In: ACM SIGMOD (2008)
Papadimos, V., Maier, D.: Distributed Queries without Distributed State. In: WebDB (2002)
Tatarinov, I., Ives, Z., Madhavan, J., Halevy, A., Suciu, D., Dalvi, N., Dong, X.L., Kadiyska, Y., Miklau, G., Mork, P.: The Piazza peer data management project. In: ACM SIGMOD (2003)
Workflow Management Coalition, http://www.aiim.org/wfmc/mainframe.htm
Yang, B., Garcia-Molina, H.: Comparing Hybrid Peer-to-Peer Systems. In: VLDB 2001 (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, Q., Hsu, M. (2008). Correlated Query Process and P2P Execution. In: Hameurlain, A. (eds) Data Management in Grid and Peer-to-Peer Systems. Globe 2008. Lecture Notes in Computer Science, vol 5187. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85176-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-540-85176-9_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85175-2
Online ISBN: 978-3-540-85176-9
eBook Packages: Computer ScienceComputer Science (R0)