Abstract
Service availability is a key construct in Service Level Agreements (SLA) between a cloud service provider and a client. The provider typically allocates backup resources to mitigate the risk of violating the SLA-specified uptime guarantee. However, initial backups may need to be adjusted in response to real-time failure and recovery events. In this study, we first develop a recurrent intervention at fixed intervals (RIFI) strategy that allows the provider to adjust the allocation of backup resources such that the expected total cost is minimized. Next, we focus on the limit to number of interventions, starting from single intervention strategy, as frequent reallocations may be operationally disruptive. Particularly, we provide a cost minimization approach to guide the service providers in their virtual resources management, and a specific downtime minimization approach for more mission-critical applications as a more aggressive alternative. We present computational results exploring the impact of intervention on the likelihood of SLA violation for the rest of the contract period, and evaluate parameters such as time and quantum of resource level adjustment, penalty levels desired by clients, and their influences on the backup resource provisioning strategies. We also validate our models through the analysis of use cases from Amazon Elastic Compute Cloud. Finally, we summarize this study by providing key practical managerial implications for resource deployment in the availability-aware cloud.
Similar content being viewed by others
References
Bruneo, D. (2014). A stochastic model to investigate data center performance and QoS in IaaS cloud computing systems. IEEE Transactions on Parallel and Distributed Systems, 25(3), 560–569.
Chase, J., & Niyato, D. (2015). Joint optimization of resource provisioning in cloud computing. IEEE Transactions on Services Computing, 10(3), 396–409.
Cisco. (2012). Cisco Global Cloud Networking Survey.
Dean, J. (2009) Designs, lessons and advice from building large distributed systems. https://www.cs.cornell.edu/projects/ladis2009/talks/deankeynote-ladis2009.pdf
Du, A. Y., Das, S., Yang, Z., Qiao, C., & Ramesh, R. (2015). Predicting transient downtime in virtual server systems: An efficient sample path randomization approach. IEEE Transactions on Computers, 64(12), 3541–3554.
Ghosh, R., Longo, F., Frattini, F., Russo, S., & Trivedi, K. S. (2014a). Scalable analytics for IaaS cloud availability. IEEE Transactions on Cloud Computing, 2(1), 57–70.
Ghosh, R., Longo, F., Xia, R., Naik, V. K., & Trivedi, K. S. (2014b). Stochastic model driven capacity planning for an infrastructure-as-a-service cloud. IEEE Transactions on Services Computing, 7(4), 667–680.
Goudarzi, H., Ghasemazar, M., & Pedram, M. (2012). SLA-based optimization of power and migration cost in cloud computing 2012 12th IEEE/ACM international symposium on cluster, Cloud and Grid Computing, Ottawa, ON, Canada.
Guo, Z., Li, J., & Ramesh, R. (2019). Optimal Management of Virtual Infrastructures under flexible cloud service agreements. Information Systems Research, 30(4), 1424–1446.
Gutierrez-Garcia, J. O., & Sim, K. M. (2012). GA-based cloud resource estimation for agent-based execution of bag-of-tasks applications. Information Systems Frontiers, 14(4), 925–951.
Hassan, M. M., Hossain, M. S., Sarkar, A., & Huh, E.-N. (2014). Cooperative game-based distributed resource allocation in horizontal dynamic cloud federation platform. Information Systems Frontiers, 16(4), 523–542.
ITIC (2017) ITIC 2017–2018 global server hardware, server OS reliability report. https://www.ibm.com/downloads/cas/DV0XZV6R#:~:text=ITIC's%202020%20Reliability%20poll%20finds,mission%20critical%20servers%20and%20applications
Jammal, M., Kanso, A., Heidari, P., & Shami, A. (2016). A formal model for the availability analysis of cloud deployed multi-tiered applications 2016 IEEE international conference on cloud engineering workshop Berlin, Germany.
Jammal, M., Hawilo, H., Kanso, A., & Shami, A. (2018). ACE: Availability-aware CloudSim extension. IEEE Transactions on Network and Service Management, 15(4), 1586–1599.
Kauffman, R. J., Ma, D., Shang, R., Huang, J., & Yang, Y. (2015). On the Financification of cloud computing: An agenda for pricing and service delivery mechanism design research. International Journal of Cloud Computing.
Mansouri, Y., Toosi, A. N., & Buyya, R. (2019). Cost optimization for dynamic replication and migration of data in cloud data centers. IEEE Transactions on Cloud Computing, 7(3), 705–718.
Marques, D., Bronevetsky, G., Fernandes, R., Pingali, K., & Stodghill, P. (2005). Optimizing checkpoint sizes in the C3 system 19th IEEE international parallel and distributed processing symposium, Denver, CO, USA.
Martens, B., & Teuteberg, F. (2012). Decision-making in cloud computing environments: A cost and risk based approach. Information Systems Frontiers, 14(4), 871–893.
Mateo-Fornés, J., Solsona-Tehàs, F., Vilaplana-Mayoral, J., Teixidó-Torrelles, I., & Rius-Torrentó, J. (2019). CART, a decision SLA model for SaaS providers to keep QoS regarding availability and performance. IEEE Access, 7, 38195–38204.
Mell, P., & Grance, T. (2011). The NIST definition of cloud computing.
Mistry, S., Bouguettaya, A., Dong, H., & Qin, A. K. (2018). Metaheuristic optimization for long-term IaaS service composition. IEEE Transactions on Services Computing, 11(1), 131–143.
Panda, S. K., Gupta, I., & Jana, P. K. (2019). Task scheduling algorithms for multi-cloud systems: Allocation-aware approach. Information Systems Frontiers, 21(2), 241–259.
Ran, Y., Yang, J., Zhang, S., & Xi, H. (2017). Dynamic IaaS computing resource provisioning strategy with QoS constraint. IEEE Transactions on Services Computing, 10(2), 190–202.
Randal, A. (2020). The ideal versus the real: Revisiting the history of virtual machines and containers. ACM Computing Surveys (CSUR), 53(1), 1–31.
Silic, M., Delac, G., Krka, I., Srbljic, S., & J. I. T. o. S. C. (2014). Scalable and accurate prediction of availability of atomic web services. IEEE Transactions on Services Computing, 7(2), 252–264.
Singh, S., Chana, I., & Buyya, R. (2017). STAR: SLA-aware autonomic Management of Cloud Resources. IEEE Transactions on Cloud Computing.
Smith, J. E., & Nair, R. (2005). The architecture of virtual machines. Computer, 38(5), 32–38.
Toosi, A. N., Vanmechelen, K., Ramamohanarao, K., & Buyya, R. (2015). Revenue maximization with optimal capacity control in infrastructure as a service cloud markets. IEEE Transactions on Cloud Computing, 3(3), 261–274.
Van, H. N., Tran, F. D., & Menaud, J.-M. (2009). SLA-aware virtual resource Management for Cloud Infrastructures 2009 ninth IEEE international conference on computer and information technology, Xiamen, China.
Wang, X., Du, Z., Chen, Y., & Li, S. (2008). Virtualization-based autonomic resource Management for Multi-tier web Applications in shared data center. Journal of Systems and Software, 81(9), 1591–1608.
Wu, L., Garg, S. K., Versteeg, S., & Buyya, R. (2014). SLA-based resource provisioning for hosted software-as-a-service applications in cloud computing environments. IEEE Transactions on Services Computing, 7(3), 465–485.
Yala, L., Frangoudis, P. A., Lucarelli, G., & Ksentini, A. (2018). Cost and availability aware resource allocation and virtual function placement for CDNaaS provision. IEEE Transactions on Network and Service Management, 15(4), 1334–1348.
Yang, J., Liu, C., Shang, Y., Cheng, B., Mao, Z., Liu, C., Niu, L., & Chen, J. (2014). A cost-aware auto-scaling approach using the workload prediction in service clouds. Information Systems Frontiers, 16(1), 7–18.
Yuan, S., Das, S., Ramesh, R., & Qiao, C. (2018). Service agreement trifecta: Backup resources, Price and penalty in the availability-aware cloud. Information Systems Research.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors have no conflicts of interest to declare that are relevant to the content of this article.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yuan, S., Das, S., Ramesh, R. et al. Availability-Aware Virtual Resource Provisioning for Infrastructure Service Agreements in the Cloud. Inf Syst Front 25, 1495–1512 (2023). https://doi.org/10.1007/s10796-022-10302-4
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10796-022-10302-4