Abstract
In this paper, we study the safety guarantees of group communication-based database replication techniques. We show that there is a model mismatch between group communication and database, and because of this, classical group communication systems cannot be used to build 2-safe database replication. We propose a new group communication primitive called end-to-end atomic broadcast that solves the problem, i.e., can be used to implement 2-safe database replication. We also introduce a new safety criterion, called group-safety, that has advantages both over 1-safety and 2-safety. Experimental results show the gain of efficiency of group-safety over lazy replication, which ensures only 1-safety.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Gray, J.N., Helland, P., O’Neil, P., Shasha, D.: The dangers of replication and a solution. In: Proceedings of the 1996 International Conference on Management of Data, Montreal, Canada, pp. 173–182. ACM-SIGMOD (1996)
Schiper, A., Raynal, M.: From group communication to transactions in distributed systems. Communications of the ACM 39, 84–87 (1996)
Agrawal, D., Alonso, G., El Abbadi, A., Stanoi, I.: Exploiting atomic broadcast in replicated databases. Technical report, Department of Computer Science, University of California, Santa Barbara, California USA (1996)
Kemme, B., Alonso, G.: Don’t be lazy, be consistent: Postgres-R, a new way to implement database replication. In: Proceedings of the 26th International Conference on Very Large Databases (VLDB), Cairo, Egypt (2000)
Wiesmann, M.: Group Communications and Database Replication: Techniques, Issues and Performance. PhD thesis, École Polytechnique Fédérale de Lausanne, Switzerland (2002)
Holliday, J., Agrawal, D., Abbadi, A.E.: Using multicast communication to reduce deadlocks in replicated databases. In: Proceedings of the 19th Symposium on Reliable Distributed Systems SRDS 2000, Nürnberg, Germany, pp. 196–205. IEEE Computer Society, Los Alamitos (2000)
Wiesmann, M., Pedone, F., Schiper, A., Kemme, B., Alonso, G.: Database replication techniques: a three parameter classification. In: Proceedings of 19th Symposium on Reliable Distributed Systems (SRDS 2000), Nürnberg, Germany, pp. 206–215. IEEE Computer Society, Los Alamitos (2000)
Keidar, I.: A highly available paradigm for consistent object replication. Master’s thesis, The Hebrew University of Jerusalem, Jerusalem, Israel (1994); Also technical report CS 1994 (1994)
Stanoi, I., Agrawal, D., Abbadi, A.E.: Using broadcast primitives in replicated databases (abstract). In: Proceedings of the 16th Annual Symposium on Principles of Distributed Computing, Santa Barbara, California USA, p. 283. ACM, New York (1997)
Pedone, F., Guerraoui, R., Schiper, A.: The database state machine approach. Technical Report SSC/1999/008, École Polytechnique Fédérale de Lausanne, Switzerland (1999)
Amir, Y., Tutu, C.: From total order to database replication. Technical Report CNDS-2001-6, Departement of Computer Science John Hopkins University, Baltimore, MD 21218 USA (2001)
Gray, J.N., Reuter, A.: Transaction Processing: concepts and techniques. In: Data Management Systems, Morgan Kaufmann Publishers, Inc., San Mateo (1993)
Frølund, S., Guerraoui, R.: Implementing e-transactions with asynchronous replication. IEEE Transactions on Parallel and Distributed Systems 12 (2001)
Gärtner, F.C.: A gentle introduction to failure detectors and related problems. Technical Report TUD-BS-2001-01, Darmstadt University of Technology, Department of Computer Science (2001)
Birman, K.P., Joseph, T.A.: Exploiting virtual synchrony in distributed systems. In: Proceedings of the 11th ACM Symposium on OS Principles, Austin, TX, USA, pp. 123–138. ACM SIGOPS, ACM (1987)
Malloth, C.P., Felber, P., Schiper, A., Wilhelm, U.: Phoenix: A toolkit for building fault-tolerant distributed applications in large scale. In: Workshop on Parallel and Distributed Platforms in Industrial Products, San Antonio, Texas, USA, IEEE, Los Alamitos (1995)
van Renesse, R., Birman, K.P., Maffeis, S.: Horus: A flexible group communication system. Communications of the ACM 39, 76–83 (1996)
Hiltunen, M.A., Schlichting, R.D.: The cactus approach to building configurable middleware services. In: Proceedings of the Workshop on Dependable System Middleware and Group Communication (DSMGC 2000), Nürnberg, Germany (2000)
Birman, K., Constable, R., Hayden, M., Kreitz, C., Rodeh, O., van Renesse, R., Vogels, W.: The Horus and Ensemble projects: Accomplishments and limitations. In: Proceedings of the DARPA Information Survivability Conference & Exposition (DISCEX 2000), Hilton Head, South Carolina USA (2000)
Miranda, H., Pinto, A., Rodrigues, L.: Appia: A flexible protocol kernel supporting multiple coordinated channels. In: Proceedings of the 21st International Conference on Distributed Computing Systems (ICDCS 2001), Phoenix, Arizona, USA, pp. 707–710. IEEE Computer Society, Los Alamitos (2001)
Aguilera, M.K., Chen, W., Toueg, S.: Failure detection and consensus in the crash recovery model. Distributed Computing 13, 99–125 (2000)
Gafni, E., Lamport, L.: Disk paxos. Technical Report SRC 163, Compaq Systems Research Center, 130, Lytton Avenue Palo Alto, California 94301 USA (2000)
Rodrigues, L., Raynal, M.: Atomic broadcast in asynchronous crash-recovery distributed systems. In: Proceedings of the 20th International Conference on Distributed Systems (ICDCS 2000), Taipei, Taiwan (ROC), pp. 288–295. IEEE Computer Society, Los Alamitos (2000)
Chandra, T.D., Toueg, S.: Unreliable failure detectors for reliable distributed systems. Communications of the ACM 43, 225–267 (1996)
Lynch, N.: Distributed Algorithms. Morgan Kaufmann, San Francisco (1996)
Saltzer, J.H., Reed, D.P., Clark, D.D.: End-to-end arguments in system design. ACM Transactions on Computer Systems 2, 277–288 (1984)
Holliday, J.: Replicated database recovery using multicast communications. In: Proceedings of the Symposium on Network Computing and Applications (NCA 2001), Cambridge, MA, USA, pp. 104–107. IEEE, Los Alamitos (2001)
Cheriton, D.R., Skeen, D.: Understanding the limitations of causally and totally ordered communication. In: Liskov, B. (ed.) Proceedings of the 14th Symposium on Operating Systems Principles, Asheville, North Carolina, vol. 27, pp. 44–57. ACM Press, New York (1993)
Keidar, I., Dolev, D.: Totally ordered broadcast in the face of network partitions. In: Avresky, D. (ed.) Dependable Network Computing, Kluwer Academic Publications, Dordrecht (2000)
Davidson, S.B., Garcia-Molina, H., Skeen, D.: Consistency in partitioned networks. ACM Computing Surveys 17, 341–370 (1985)
Fu, A.W., Cheung, D.W.: A transaction replication scheme for a replicated database with node autonomy. In: Proceedings of the International Conference on Very Large Databases, Santiago, Chile (1994)
Kemme, B., Alonso, G.: A suite of database replication protocols based on group communication primitives. In: Proceedings of the 18th International Conference on Distributed Computing Systems (ICDCS 1998), Amsterdam, The Netherlands (1998)
Kemme, B., Pedone, F., Alonso, G., Schiper, A.: Processing transactions over optimistic atomic broadcast protocols. In: Proceedings of the International Conference on Distributed Computing Systems, Austin, Texas (1999)
Holliday, J., Agrawal, D., Abbadi, A.E.: The performance of database replication with group multicast. In: Proceedings of International Symposium on Fault Tolerant Computing (FTCS29), pp. 158–165. IEEE Computer Society, Los Alamitos (1999)
Babao§lu, Ö., Toueg, S.: Understanding non-blocking atomic commitement. Technical Report UBLCS-93-2, Laboratory for Computer Science, University of Bologna, 5 Piazza di Porta S. Donato, 40127 Bologna, Italy (1993)
Keidar, I., Dolev, D.: Increasing the resilience of distributed and replicated database systems. Journal of Computer and System Sciences (JCSS) 57, 309–324 (1998)
Jiménez-Paris, R., Patiño-Martínez, M., Alonso, G., Arévalo, S.: A low-latency non-blocking commit service. In: Welch, J.L. (ed.) DISC 2001. LNCS, vol. 2180, pp. 93–107. Springer, Heidelberg (2001)
Wiesmann, M., Pedone, F., Schiper, A., Kemme, B., Alonso, G.: Understanding replication in databases and distributed systems. In: Proceedings of 20th International Conference on Distributed Computing Systems (ICDCS 2000), Taipei, Taiwan, R.O.C., IEEE Computer Society, Los Alamitos (2000)
Kemme, B., Bartoli, A., Babao§lu, Ö.: Online reconfiguration in replicated databases based on group communication. In: Proceedings of the Internationnal Conference on Dependable Systems and Networks (DSN2001), Göteborg, Sweden (2001)
Amir, Y.: Replication using group communication over a partitioned network. PhD thesis, Hebrew University of Jerusalem, Israel (1995)
Ezhilchelvan, P.D., Shrivastava, S.K.: Enhancing replica management services to cope with group failures. In: Krakowiak, S., Shrivastava, S.K. (eds.) BROADCAST 1999. LNCS, vol. 1752, pp. 79–103. Springer, Heidelberg (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wiesmann, M., Schiper, A. (2004). Beyond 1-Safety and 2-Safety for Replicated Databases: Group-Safety. In: Bertino, E., et al. Advances in Database Technology - EDBT 2004. EDBT 2004. Lecture Notes in Computer Science, vol 2992. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24741-8_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-24741-8_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21200-3
Online ISBN: 978-3-540-24741-8
eBook Packages: Springer Book Archive