A Novel Fault-Tolerant Approach for Dynamic Redundant Path Selection Service Migration in Vehicular Edge Computing
Abstract
:1. Introduction
- It presents the VEC system of a fault-tolerant environment, including the service migration model, and edge server fault model for the first time.
- Propose a path selection algorithm that combines the sliding window model to evaluate the time-varying failure rate of ESs and obtain the high-quality migration path.
- Propose a service migration algorithm. To cope with edge node failure in the VEC environment, this paper uses the advantages of replication and resubmission strategies to ensure the reliability of service migration.
- To verify the performance of the DRPS approach, we also conducted simulation experiments on two real edge computing datasets. The experimental results demonstrate that, compared with the traditional method, our proposed DRPS approach has a better task completion rate and average task completion time.
2. Related Work
- The performance fluctuations of ES are not taken into account. In the VEC environment, the processing capability and operational stability of ES are always affected by natural or human factors, but the existing research has not fully considered this issue.
- The fault tolerance mechanism is single. Each fault tolerance mechanism has its applicable scenarios and shortcomings. Most of the existing researches are based on one fault tolerance mechanism for service migration, which is difficult to apply to other scenarios.
3. System Model and Problem Description
3.1. System Model
3.2. Problem Description
4. DRPS: Fault-Tolerant Approach for Dynamic Redundant Path Selection Service Migration
4.1. Path Selection
Algorithm 1 Path Selection Algorithm |
Input: Task ; source edge server ; target edge server . Output:T or F, where T indicates that the task migration succeeded, and F indicates that the task migration fail.
|
4.2. Service Migration
Algorithm 2 Service Migration Algorithm |
Input: Task ; s feasible migration paths . Output: The task final migration time .
|
Algorithm 3 Service Migration Time |
Input: Task ; The migration path . Output: The migration time of the task on path .
|
5. Performance Evaluation
5.1. Experiment Setting
- (1)
- As illustrated in Figure 3a, 80 base stations in Pudong Central Business District are selected in the Telecom data set for simulation experiment, and 30 of them are randomly chosen as network routers, which are only responsible for forwarding data without calculation.The communication range of each base station and network router is set as 250 meters, and the initial failure rate is evenly distributed between 0.1 and 0.15. All base stations are equipped with the same ES specifications and support parallel computing. The processed Telecom dataset is illustrated in Figure 3b, with circles representing network routers, rectangles representing base stations, and red connection lines representing data links.
- (2)
- In the Taxi dataset, the driving trajectories of 100, 300, 500, 700, and 900 taxis in the Pudong Central Business District in the same period were selected as the movement trajectories of edge users. The taxi drives at a constant speed. The base station closest to the initial position of the taxi is set as the source base station, and the base station closest to the taxi position after the maximum tolerable time is set as the target base station. Each taxi only needs to migrate one task.
5.2. Baseline Algorithms and Metrics
- (1)
- DRPS: It is the algorithm proposed in this paper. A novel fault-tolerant approach for redundant-path-enabled service migration in mobile edge computing.
- (2)
- DRPS/F: It is a simplified version of the DRPS approach, where the failure rate of ESs is constant.
- (3)
- Greedy-SP: It is a traditional greedy algorithm, which first finds the current closest path and performs service migration based on the resubmission strategy.
- (4)
- Greedy-CP: It is a traditional greedy algorithm, which first find ES with the most remaining computing force within the current coverage range and performs service migration based on the resubmission strategy.
- (5)
- PLP [43]: It is based on weak Pareto optimization and selects multiple paths for service migration.
- As illustrated in Figure 4a, for independent tasks, the migration paths of DRPS and DRPS/F approach are the same, because both algorithms choose the migration path according to the failure rate of ES. However, for the task flow that is continuously reached within a period of time, because the DRPS approach adjusts the failure rate of ES dynamically, and the failure rate of the DRPS approach will be lower than that of DRPS/F, which has also been confirmed in subsequent experiments.
- As illustrated in Figure 4b, PLP algorithm always selects the path with the least time and consumption for migration, but it may increase migration time and energy consumption due to the ES failure.
- As illustrated in Figure 4c, the Greedy-SP algorithm selects the closest ES or NR for service migration, but it is easy to fall into the local high-quality solution and cannot find the high-quality migration path.
- As illustrated in Figure 4d, similar to Greedy-SP, Greedy-CP is also based on greedy thinking, but it always prefers the ES with the highest remaining computing power within the scope for service migration.
- (1)
- Average Completion Time (ACT): Time to migrate services from source ES to target ES, including data transfer time and task computation time.
- (2)
- Task Completion Rate (TCR): The success rate of service migration. If the service is migrated to the targeted ES within the user’s maximum tolerable time, the service is successfully migrated.
- (3)
- Average Failure Count (AFC): The number of service failures during migration. The more failures, the higher the cost of the service migration.
5.3. Experimental Results
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Song, C.; Liu, M.; Cao, J.N.; Zheng, Y.; Gong, H.G.; Chen, G.H. Maximizing network lifetime based on transmission range adjustment in wireless sensor networks. Comput. Commun. 2009, 11, 1316–1325. [Google Scholar] [CrossRef]
- Xu, F.L.; Guo, S.; Jeong, J.; Yu, G.; Cao, Q.; Liu, M.; He, T. Utilizing shared vehicle trajectories for data forwarding in vehicular networks. In Proceedings of the 2011 IEEE INFOCOM, Shanghai, China, 10–15 April 2011; pp. 441–445. [Google Scholar]
- Liu, N.B.; Liu, M.; Wei, L.; Chen, G.H.; Cao, J.N. PVA in VANETs: Stopped cars are not silent. In Proceedings of the 2011 IEEE INFOCOM, Shanghai, China, 10–15 April 2011; pp. 431–435. [Google Scholar]
- Liu, M.; Gong, H.G.; Wen, Y.G.; Chen, G.H.; Cao, J.N. The last minute: Efficient data evacuation strategy for sensor networks in post-disaster applications. In Proceedings of the 2011 IEEE INFOCOM, Shanghai, China, 10–15 April 2011; pp. 291–295. [Google Scholar]
- Liu, N.B.; Liu, M.; Chen, G.H.; Cao, J.N. The sharing at roadside: Vehicular content distribution using parked vehicles. In Proceedings of the IEEE INFOCOM, Orlando, FL, USA, 25–30 March 2012; pp. 2641–2645. [Google Scholar]
- Shi, W.S.; Zhang, X.Z.; Wang, Y.F.; Zhang, Q.Y. Edge computing: State-of-the-art and future directions. J. Comput. Res. Dev. 2019, 56, 69. [Google Scholar]
- Hu, Y.C.; Patel, M.; Sabella, D.; Sprecher, N.; Young, V. Mobile edge computing—A key technology towards 5G. ETSI White Paper 2015, 11, 1–16. [Google Scholar]
- Deng, S.G.; Huang, L.T.; Wu, H.Y.; Tan, W.; Taheri, J.; Zomaya, A.Y.; Wu, Z.H. Toward Mobile Service Computing: Opportunities and Challenges. IEEE Cloud Comput. 2016, 3, 32–41. [Google Scholar] [CrossRef]
- Chung, J.M.; Park, Y.S.; Park, J.H.; Cho, H.J. Adaptive cloud offloading of augmented reality applications on smart devices for minimum energy consumption. KSII Trans. Internet Inf. Syst. (TIIS) 2015, 8, 3090–3102. [Google Scholar]
- Xiong, K.; Leng, S.; Hu, J.; Chen, X.S.; Yang, K. Smart network slicing for vehicular fog-RANs. IEEE Trans. Veh. Technol. 2019, 4, 3075–3085. [Google Scholar] [CrossRef]
- Wang, S.Q.; Urgaonkar, R.; He, T.; Chan, K.; Zafer, M.; Leung, K.K. Dynamic service placement for mobile micro-clouds with predicted future costs. In Proceedings of the 2015 IEEE International Conference on Communications (ICC), London, UK, 8–12 June 2015. [Google Scholar]
- Kekki, S.; Featherstone, W.; Fang, Y.G.; Kuure, P.; Li, A.; Ranjan, A.; Purkayastha, D.; Jiangping, F.; Frydman, D.; Verin, G.; et al. MEC in 5G networks. ETSI White Paper 2018, 28, 1–28. [Google Scholar]
- Satyanarayanan, M.; Bahl, P.; Caceres, R.; Davies, N. The Case for VM-based Cloudlets in Mobile Computing. IEEE Pervasive Comput. 2011, 8, 14–23. [Google Scholar] [CrossRef]
- Khayyat, M.; Elgendy, I.A.; Muthanna, A.; Alshahrani, A.S.; Alharbi, S.; Koucheryavy, A. Advanced deep learning-based computational offloading for multilevel vehicular edge-cloud computing networks. IEEE Access 2020, 1, 137052–137062. [Google Scholar] [CrossRef]
- Pezoa, J.E.; Dhakal, S.; Hayat, M.M. Maximizing service reliability in distributed computing systems with random node failures: Theory and implementation. IEEE Trans. Parallel Distrib. Syst. 2010, 21, 1531–1544. [Google Scholar] [CrossRef] [Green Version]
- Chen, C.A.; Won, M.; Xie, G.G. Energy-efficient fault-tolerant data storage & processing in dynamic networks. In Proceedings of the Fourteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing, Bangalore, India, 29 July–1 August 2013; pp. 281–286. [Google Scholar]
- Chen, C.A.; Stoleru, R.; Xie, G.G. Energy-efficient and fault-tolerant mobile cloud storage. In Proceedings of the 2016 5th IEEE International Conference on Cloud Networking (Cloudnet), Pisa, Italy, 3–5 October 2016; pp. 51–57. [Google Scholar]
- Satria, D.; Park, D.; Jo, M. Recovery for overloaded mobile edge computing. Future Gener. Comput. Syst. 2017, 1, 138–147. [Google Scholar] [CrossRef]
- Long, T.Y.; Ma, Y.; Xia, Y.N.; Xiao, X.; Peng, Q.L.; Zhao, J.L. A Mobility-Aware and Fault-Tolerant Service Offloading Method in Mobile Edge Computing. In Proceedings of the 2022 IEEE International Conference on Web Services (ICWS), Barcelona, Spain, 10–16 July 2022; pp. 67–72. [Google Scholar]
- Poola, D.; Ramamohanarao, K.; Buyya, R. Enhancing reliability of workflow execution using task replication and spot instances. ACM Trans. Auton. Adapt. Syst. (TAAS) 2016, 10, 1–21. [Google Scholar] [CrossRef]
- Chen, W.; Lee, Y.C.; Fekete, A.; Zomaya, A.Y. Adaptive multiple-workflow scheduling with task rearrangement. J. Supercomput. 2015, 71, 1297–1317. [Google Scholar] [CrossRef]
- Olteanu, A.; Pop, F.; Dobre, C.; Cristea, V. A dynamic rescheduling algorithm for resource management in large scale dependable distributed systems. Comput. Math. Appl. 2012, 63, 1409–1423. [Google Scholar] [CrossRef]
- Cao, Y.; Ro, C.W.; Yin, J.W. Scheduling analysis of failure-aware VM in cloud system. Int. J. Control Autom. 2014, 7, 243–250. [Google Scholar] [CrossRef]
- Jing, W.; Liu, Y. Multiple DAGs reliability model and fault-tolerant scheduling algorithm in cloud computing system. Comput. Model. New Technol. 2014, 18, 22–30. [Google Scholar]
- Jayadivya, S.K.; Nirmala, J.S.; Bhanu, M.S.S. Fault tolerant workflow scheduling based on replication and resubmission of tasks in Cloud Computing. Int. J. Comput. Sci. Eng. 2012, 4, 996. [Google Scholar]
- Patra, P.K.; Singh, H.; Singh, R.; Das, S.; Dey, N.; Victoria, A.D.C. Replication and resubmission based adaptive decision for fault tolerance in real time cloud computing: A new approach. Int. J. Serv. Sci. Manag. Eng. Technol. (IJSSMET) 2016, 7, 46–60. [Google Scholar] [CrossRef] [Green Version]
- Plachy, J.; Becvar, Z.; Mach, P. Path selection enabling user mobility and efficient distribution of data for computation at the edge of mobile network. Comput. Netw. 2016, 1, 357–370. [Google Scholar] [CrossRef]
- Machen, A.; Wang, S.Q.; Leung, K.K.; Ko, B.J.; Salonidis, T. Migrating running applications across mobile edge clouds: Poster. In Proceedings of the 22nd Annual International Conference on Mobile Computing and Networking, New York City, NY, USA, 3–7 October 2016; pp. 435–436. [Google Scholar]
- Ha, K.; Abe, Y.; Chen, Z.; Hu, W.L.; Amos, B.; Pillai, P.; Satyanarayanan, M. Adaptive VM handoff across cloudlets; Technical Report CMU-CS-15-113; CMU School of Computer Science: Pittsburgh, PA, USA, 2015. [Google Scholar]
- Zhao, F.; Zeng, X. Optimization of user and operator cost for large-scale transit network. J. Trans. Eng. 2007, 133, 240–251. [Google Scholar] [CrossRef]
- Qiao, G.H.; Leng, S.P.; Maharjan, S.; Zhang, Y.; Ansari, N. Deep reinforcement learning for cooperative content caching in vehicular edge computing and networks. IEEE Internet Things J. 2019, 7, 247–257. [Google Scholar] [CrossRef]
- Wang, S.Q.; Urgaonkar, R.; Zafer, M.; He, T.; Chan, K.; Leung, K.K. Dynamic service migration in mobile edge computing based on Markov decision process. IEEE/ACM Trans. Netw. 2019, 3, 1272–1288. [Google Scholar] [CrossRef] [Green Version]
- Peng, Q.L.; Xia, Y.N.; Wang, Y.; Wu, C.R.; Luo, X.; Lee, J. A decentralized reactive approach to online task offloading in mobile edge computing environments. In Proceedings of the International Conference on Service-Oriented Computing, Dubai, United Arab Emirates, 14–17 December 2020; pp. 232–247. [Google Scholar]
- Liang, Z.Z.; Liu, Y.; Lok, T.M.; Huang, K.B. Multi-cell mobile edge computing: Joint service migration and resource allocation. IEEE Trans. Wirel. Commun. 2021, 9, 5898–5912. [Google Scholar] [CrossRef]
- Gao, Z.P.; Jiao, Q.D.; Xiao, K.L.; Wang, Q.; Mo, Z.J.; Yang, Y. Deep reinforcement learning based service migration strategy for edge computing. In Proceedings of the 2019 IEEE International Conference on Service-Oriented System Engineering (SOSE), San Francisco, CA, USA, 4–9 April 2019; pp. 116–1165. [Google Scholar]
- Chen, W.; Chen, Y.H.; Wu, J.X.; Tang, Z.B. A multi-user service migration scheme based on deep reinforcement learning and SDN in mobile edge computing. Phys. Commun. 2017, 1, 101397. [Google Scholar] [CrossRef]
- Chen, Y.P.; He, S.S.; Jin, X.M.; Wang, Z.M.; Wang, F.W.; Chen, L. Resource utilization and cost optimization oriented container placement for edge computing in industrial internet. J. Supercomput. 2022, 1–29. [Google Scholar] [CrossRef]
- Li, C.L.; Zhang, Y.; Gao, X.; Luo, Y.L. Energy-latency tradeoffs for edge caching and dynamic service migration based on DQN in mobile edge computing. J. Parallel Distrib. Comput. 2022, 1, 15–31. [Google Scholar] [CrossRef]
- Yuan, Q.; Li, J.L.; Zhou, H.B.; Lin, T.; Luo, G.Y.; Shen, X.M. A joint service migration and mobility optimization approach for vehicular edge computing. IEEE Trans. Veh. Technol. 2020, 8, 9041–9052. [Google Scholar] [CrossRef]
- Long, T.Y.; Chen, P.; Xia, Y.N.; Jiang, N.; Wang, X.; Long, M. A novel fault-tolerant approach to web service composition upon the Edge Computing Environment. In Proceedings of the International Conference on Web Services, Chicago, IL, USA, 5–10 September 2021; pp. 15–31. [Google Scholar]
- Wang, R.F.; Chen, N.J.; Yao, X.Y.; Hu, L.Q. Fasdq: Fault-tolerant adaptive scheduling with dynamic qos-awareness in edge containers for delay-sensitive tasks. Sensors 2021, 9, 2973. [Google Scholar] [CrossRef]
- Tuli, S.; Casale, G.; Jennings, N.R. PreGAN: Preemptive Migration Prediction Network for Proactive Fault-Tolerant Edge Computing. In Proceedings of the IEEE INFOCOM 2022-IEEE Conference on Computer Communications, London, UK, 2–5 May 2022; pp. 670–679. [Google Scholar]
- Xu, J.L.; Ma, X.; Zhou, A.; Duan, Q.; Wang, S.G. Path selection for seamless service migration in vehicular edge computing. IEEE Internet Things J. 2020, 7, 9040–9049. [Google Scholar] [CrossRef]
- Wang, C.; Gill, C.; Lu, C.Y. Adaptive data replication in real-time reliable edge computing for Internet of Things. In Proceedings of the 2020 IEEE/ACM Fifth International Conference on Internet-of-Things Design and Implementation (IoTDI), Sydney, Australia, 21–24 April 2020; pp. 128–134. [Google Scholar]
- Jhawar, R.; Piuri, V.; Santambrogio, M. Fault tolerance management in cloud computing: A system-level perspective. IEEE Syst. J. 2012, 7, 288–297. [Google Scholar] [CrossRef]
- Plankensteiner, K.; Prodan, R. Meeting soft deadlines in scientific workflows using resubmission impact. IEEE Trans. Parallel Distrib. Syst. 2011, 23, 890–901. [Google Scholar] [CrossRef]
- Yao, G.; Ding, Y.; Hao, K. Using imbalance characteristic for fault-tolerant workflow scheduling in cloud systems. IEEE Trans. Parallel Distrib. Syst. 2017, 28, 3671–3683. [Google Scholar] [CrossRef]
- Li, Y.Z.; Zhou, A.; Ma, X.; Wang, S.G. Profit-aware edge server placement. IEEE Internet Things J. 2021, 9, 55–67. [Google Scholar] [CrossRef]
- Guo, Y.; Wang, S.G.; Zhou, A.; Xu, J.L.; Yuan, J.; Hsu, C.H. User allocation-aware edge cloud placement in mobile edge computing. Softw. Pract. Exp. 2020, 50, 489–502. [Google Scholar] [CrossRef]
- Wang, S.G.; Guo, Y.; Zhang, N.; Yang, P.; Zhou, A.; Shen, X.M. Delay-aware microservice coordination in mobile edge computing: A reinforcement learning approach. IEEE Trans. Mob. Comput. 2019, 20, 939–951. [Google Scholar] [CrossRef]
- Liu, S.Y.; Liu, Y.H.; Ni, L.M.; Fan, J.P.; Li, M.L. Towards mobility-based clustering. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 24–28 July 2010; pp. 919–928. [Google Scholar]
Notation | Definition |
---|---|
B | The set of base stations |
N | The set of network routers |
R | The set of tasks |
The set of Migration paths for task | |
The Migration time of task | |
The i-th base station in B | |
The i-th network router in N | |
The i-th task in R | |
The j-th Migration path in | |
The coordinates of | |
The signal coverage radius of | |
The edge server of | |
The residual computing power of | |
The task information performed on the in the recent period | |
The coordinates of | |
The failure rate of | |
The arrival time of | |
The amount of data transmitted in | |
The maximum tolerable time of | |
The coordinates of | |
The amount of calculation of | |
The execution time of task on | |
The normalized data of | |
The number of edge servers in | |
The bandwidth between node x and node y | |
The calculation result of | |
The channel bandwidth of the base station | |
The signal transmission power of | |
The Gaussian noise power of the device |
DRPS/F | Greedy-SP | Greedy-CP | PLP | DRPS/F/OP | DRPS/F/AP | |
---|---|---|---|---|---|---|
ACT | 2.03% | 19.17% | 22.41% | 17.45% | / | / |
TCR | 2.13% | 14.02% | 15.61% | 13.17% | / | / |
AFC | / | 4.18% | 6.01% | 7.22% | 2.97% | 11.6% |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhao, J.; Ma, Y.; Xia, Y.; Dai, M.; Chen, P.; Long, T.; Shao, S.; Li, F.; Li, Y.; Zeng, F. A Novel Fault-Tolerant Approach for Dynamic Redundant Path Selection Service Migration in Vehicular Edge Computing. Appl. Sci. 2022, 12, 9987. https://doi.org/10.3390/app12199987
Zhao J, Ma Y, Xia Y, Dai M, Chen P, Long T, Shao S, Li F, Li Y, Zeng F. A Novel Fault-Tolerant Approach for Dynamic Redundant Path Selection Service Migration in Vehicular Edge Computing. Applied Sciences. 2022; 12(19):9987. https://doi.org/10.3390/app12199987
Chicago/Turabian StyleZhao, Jiale, Yong Ma, Yunni Xia, Mengxuan Dai, Peng Chen, Tingyan Long, Shiyun Shao, Fan Li, Yin Li, and Feng Zeng. 2022. "A Novel Fault-Tolerant Approach for Dynamic Redundant Path Selection Service Migration in Vehicular Edge Computing" Applied Sciences 12, no. 19: 9987. https://doi.org/10.3390/app12199987
APA StyleZhao, J., Ma, Y., Xia, Y., Dai, M., Chen, P., Long, T., Shao, S., Li, F., Li, Y., & Zeng, F. (2022). A Novel Fault-Tolerant Approach for Dynamic Redundant Path Selection Service Migration in Vehicular Edge Computing. Applied Sciences, 12(19), 9987. https://doi.org/10.3390/app12199987