Insights into cloud autoscaling: a unique perspective through MDP and DTMC formal models

Cloud autoscaling is an important mechanism to dynamically manage cloud system resources with uncertain workload demands. The common goal in developing such a mechanism is to produce correct-by-design scaling policies that contribute to a reliable autoscaling component. A promising approach is forma...

Full description

Bibliographic Details
Published in:Journal of Supercomputing
Main Author: Agos Jawaddi S.N.; Ismail A.; Mohammad Hatta M.N.H.; Kamarulzaman A.F.
Format: Article
Language:English
Published: Springer 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85171969551&doi=10.1007%2fs11227-023-05665-7&partnerID=40&md5=49a8d6792c79839a568415b56997650d
id 2-s2.0-85171969551
spelling 2-s2.0-85171969551
Agos Jawaddi S.N.; Ismail A.; Mohammad Hatta M.N.H.; Kamarulzaman A.F.
Insights into cloud autoscaling: a unique perspective through MDP and DTMC formal models
2024
Journal of Supercomputing
80
4
10.1007/s11227-023-05665-7
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85171969551&doi=10.1007%2fs11227-023-05665-7&partnerID=40&md5=49a8d6792c79839a568415b56997650d
Cloud autoscaling is an important mechanism to dynamically manage cloud system resources with uncertain workload demands. The common goal in developing such a mechanism is to produce correct-by-design scaling policies that contribute to a reliable autoscaling component. A promising approach is formal verification, specifically probabilistic model checking (PMC), used to model and assess cloud scaling processes and policies. Utilizing the appropriate formal models effectively for carrying out verification tasks is one of the important factors in gaining significant insights. This is a great challenge dealt by the responsible party (e.g., cloud architects, engineers) in maximizing the benefits of PMC for their further decision-making (e.g., deploying the verified policies). In response to this challenge, we propose an approach to formalize and assess the cloud scaling process based on two formal models, namely, Markov Decision Process (MDP) and discrete-time Markov Chain (DTMC), and using PMC. We treat these two models independently for modeling similar cloud scaling contexts and assess the relevant aspects for gaining significant insights. Our approach consists of three main phases: modeling, encoding, and assessment. In our assessment, which includes complexity, performance, and sensitivity analyses, we have identified distinct sensitivities in the DTMC and MDP models. Specifically, DTMC is more sensitive to over-provisioning events, while MDP is more sensitive to under-provisioning events. These sensitivities are mainly influenced by two critical parameters: the initial number of VMs and the minimum number of VMs. Across various sample sizes, DTMC showed an increase in average latency violations from 53 to 209 for over-provisioning events, while MDP demonstrated a decrease in average latency violations from 88 to 23 for under-provisioning events. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.
Springer
9208542
English
Article

author Agos Jawaddi S.N.; Ismail A.; Mohammad Hatta M.N.H.; Kamarulzaman A.F.
spellingShingle Agos Jawaddi S.N.; Ismail A.; Mohammad Hatta M.N.H.; Kamarulzaman A.F.
Insights into cloud autoscaling: a unique perspective through MDP and DTMC formal models
author_facet Agos Jawaddi S.N.; Ismail A.; Mohammad Hatta M.N.H.; Kamarulzaman A.F.
author_sort Agos Jawaddi S.N.; Ismail A.; Mohammad Hatta M.N.H.; Kamarulzaman A.F.
title Insights into cloud autoscaling: a unique perspective through MDP and DTMC formal models
title_short Insights into cloud autoscaling: a unique perspective through MDP and DTMC formal models
title_full Insights into cloud autoscaling: a unique perspective through MDP and DTMC formal models
title_fullStr Insights into cloud autoscaling: a unique perspective through MDP and DTMC formal models
title_full_unstemmed Insights into cloud autoscaling: a unique perspective through MDP and DTMC formal models
title_sort Insights into cloud autoscaling: a unique perspective through MDP and DTMC formal models
publishDate 2024
container_title Journal of Supercomputing
container_volume 80
container_issue 4
doi_str_mv 10.1007/s11227-023-05665-7
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85171969551&doi=10.1007%2fs11227-023-05665-7&partnerID=40&md5=49a8d6792c79839a568415b56997650d
description Cloud autoscaling is an important mechanism to dynamically manage cloud system resources with uncertain workload demands. The common goal in developing such a mechanism is to produce correct-by-design scaling policies that contribute to a reliable autoscaling component. A promising approach is formal verification, specifically probabilistic model checking (PMC), used to model and assess cloud scaling processes and policies. Utilizing the appropriate formal models effectively for carrying out verification tasks is one of the important factors in gaining significant insights. This is a great challenge dealt by the responsible party (e.g., cloud architects, engineers) in maximizing the benefits of PMC for their further decision-making (e.g., deploying the verified policies). In response to this challenge, we propose an approach to formalize and assess the cloud scaling process based on two formal models, namely, Markov Decision Process (MDP) and discrete-time Markov Chain (DTMC), and using PMC. We treat these two models independently for modeling similar cloud scaling contexts and assess the relevant aspects for gaining significant insights. Our approach consists of three main phases: modeling, encoding, and assessment. In our assessment, which includes complexity, performance, and sensitivity analyses, we have identified distinct sensitivities in the DTMC and MDP models. Specifically, DTMC is more sensitive to over-provisioning events, while MDP is more sensitive to under-provisioning events. These sensitivities are mainly influenced by two critical parameters: the initial number of VMs and the minimum number of VMs. Across various sample sizes, DTMC showed an increase in average latency violations from 53 to 209 for over-provisioning events, while MDP demonstrated a decrease in average latency violations from 88 to 23 for under-provisioning events. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.
publisher Springer
issn 9208542
language English
format Article
accesstype
record_format scopus
collection Scopus
_version_ 1809678010497892352