Abstract
In the past decade, cloud platforms have become a standard across the industry for data storage and operations. Such platforms offer high quality of service in terms of reliability and ease of setup at an effective cost. With exponentially high rates of increase of data storage requirements, data is now increasingly stored in clouds. However, there are limited studies which analyze the processes performing the storage operations. Queueing models offer a very natural way of modeling these storage processes. The data packets waiting for storage form a queue which is served by a storage server. Since data packets are transmitted to the cloud in batches for efficiency, this storage server is modelled as a batch server. The storage server goes into sleep mode in between data transmission periods which are, in turn, modelled as vacations. The storage service is resumed after a vacation if there are enough packets in backlog or enough time has elapsed since last storage. This is modelled as restarting thresholds in our model. Analyzing this model helps us evaluate the quality of service (QoS) of storage processes in terms of measures such as backlog size and probability of a new connection to cloud server. These measures are then used to define a user cost function and QoS constraints, and compute optimal storage parameters.
Similar content being viewed by others
References
Abu-Libdeh, H., Princehouse, L., & Weatherspoon, H. (2010). RACS: A case for cloud storage diversity. In Proceedings of the 1st ACM symposium on cloud computing, SoCC ’10 (pp. 229–240). ACM, New York, NY, USA.
Adan, I. J. B. F., van Leeuwaarden, J. S. H., & Winands, E. M. M. (2006). On the application of Rouche’s theorem in queueing theory. Operations Research Letters, 34(3), 355–360.
Amazon. (2018a). Amazon web service s3. https://aws.amazon.com/s3/. Accessed 1 July 2018.
Amazon. (2018b). Amazon case studies. https://aws.amazon.com/solutions/case-studies/all/. Accessed 1 July 2018.
Boullery, D., Schörgendorfer, A., Van de Ven, P., & Zhang, B. (2016). Balanced distributed backup scheduling. https://www.google.com/patents/US9244777. US Patent 9,244,777.
Bruneel, H., & Kim, B. G. (1992). Discrete-time models for communication systems including ATM. Norwell, MA: Kluwer Academic Publishers.
Chang, V., & Wills, G. (2016). A model to compare cloud and non-cloud storage of big data. Future Generation Computer Systems-the International Journal of Escience, 57, 56–76.
Cheng, C., Li, J., & Wang, Y. (2015). An energy-saving task scheduling strategy based on vacation queuing theory in cloud computing. Tsinghua Science and Technology, 20(1), 28–39.
Claeys, D., Dorsman, J., Saxena, A., Walraevens, J., & Bruneel, H. (2016). A queueing-theoretic analysis of the threshold-based exhaustive data-backup scheduling policy. In International conference on numerical analysis and applied mathematics.
Coughlin, T. (2017). Market trends are driving digital storage choices for consumer devices [the art of storage]. IEEE Consumer Electronics Magazine, 6(4), 133–136.
Dimitriou, I. (2014). A modified vacation queueing model and its application on the discontinuous reception power saving mechanism in unreliable long term evolution networks. Performance Evaluation, 77, 37–56. https://doi.org/10.1016/j.peva.2014.03.003.
Dimitriou, I. (2016). Queueing analysis of the drx power saving mechanism in fault-tolerant 3GPP LTE wireless networks. Annals of Operations Research, 239(2), 521–552. https://doi.org/10.1007/s10479-014-1662-y.
Fiedler, M., Hossfeld, T., & Tran-Gia, P. (2010). A generic quantitative relationship between quality of experience and quality of service. IEEE Network, 24(2), 36–41. https://doi.org/10.1109/MNET.2010.5430142.
Fry, T. (2013). Data processing. Amsterdam: Elsevier Science.
Gautam, A., Choudhury, G., & Dharmaraja, S. (2018). Performance analysis of DRX mechanism using batch arrival vacation queueing system with n-policy in ITE-A networks. Annals of Telecommunications. https://doi.org/10.1007/s12243-018-0659-y.
Host-it. (2018). http://www.host-it.ie/online-backup/. Accessed 1 July 2018.
IBM. (2018). Ibm cloud. https://www.ibm.com/cloud/. Accessed 1 July 2018.
Kumar, S., Manvi, S., & Shyam, G. (2014). Resource management for infrastructure as a service (IAAS) in cloud computing: A survey. Journal of Network and Computer Applications, 41, 424–440.
Li, J., & Li, B. (2013). Erasure coding for cloud storage systems: A survey. Tsinghua Science and Technology, 18(3), 259–272. https://doi.org/10.1109/TST.2013.6522585.
Long, S., & Zhao, Y. (2012). A toolkit for modeling and simulating cloud data storage: An extension to CloudSim. In 2012 International Conference on Control Engineering and Communication Technology (pp. 597–600).
Microsoft. (2018). Microsoft azure storage. https://azure.microsoft.com/. Accessed 1 July 2018.
Netonboard. (2018). https://www.netonboard.com/cloud/cloud-storage.html. Accessed 1 July 2018.
Ng, W. K., Wen, Y., & Zhu, H. (2012). Private data deduplication protocols in cloud storage. In Proceedings of the 27th annual ACM symposium on applied computing, SAC ’12 (pp. 441–446). ACM, New York, NY, USA.
Niu, Z., Guo, X., Zhou, S., & Kumar, P. R. (2015). Characterizing energy-delay tradeoff in hyper-cellular networks with base station sleeping control. IEEE Journal on Selected Areas in Communications, 33(4), 641–650.
Plank, J. S. (2013). Erasure codes for storage systems. https://www.usenix.org/publications/login/december-2013-volume-38-number-6/erasure-codes-storage-systems-brief-primer.
Powell, W. B., & Humblet, P. (1986). The bulk service queue with a general control strategy: Theoretical analysis and a new computational procedure. Operations Research, 34(2), 267–275.
Reinsel, D., Gantz, J., & Rydning, J. (2017). Data age 2025: The evolution of data to life-critical don’t focus on big data. Focus on the data that’s big sponsored by seagate the evolution of data to life-critical don’t focus on big data.
Saxena, A., Claeys, D., Bruneel, H., Zhang, B., & Walraevens, J. (2018). Modeling data backups as a batch-service queue with vacations and exhaustive policy. Computer Communications, 128, 46–59.
Sun, G. Z., Dong, Y., Chen, D. W., & Wei, J. (2010). Data backup and recovery based on data de-duplication. Proceedings of international conference on artificial intelligence and computational intelligence, AICI 2010 (Vol. 2, no. September 2016, pp. 379–382).
Van de Ven, P.M., Zhang, B., & Schörgendorfer, A. (2014). Distributed backup scheduling: Modeling and optimization. In 2014 Proceedings IEEE Infocom (pp. 1644–1652).
Won, Y., Kim, R., Ban, J., Hur, J., Oh, S., & Lee, J. (2008). PRUN: Eliminating information redundancy for large scale data backup system. In Proceedings of the international conference on computational sciences and its applications, ICCSA (Vol. 2008, pp. 139–144).
Xia, R., Machida, F., & Trivedi, K.S. (2014). A Markov decision process approach for optimal data backup scheduling. In 44th annual IEEE/IFIP international conference on dependable systems and networks, DSN 2014, Atlanta, GA, USA, 23–26 June 2014 (pp. 660–665).
Yang, S.-R., & Lin, Y.-B. (2005). Modeling umts discontinuous reception mechanism. IEEE Transactions on Wireless Communications, 4(1), 312–319. https://doi.org/10.1109/TWC.2004.840259.
Zomaya, A., & Sakr, S. (2017). Handbook of big data technologies. Berlin: Springer.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Saxena, A., Claeys, D., Zhang, B. et al. Cloud data storage: a queueing model with thresholds. Ann Oper Res 293, 295–315 (2020). https://doi.org/10.1007/s10479-019-03279-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-019-03279-y