An Analytic Model of Traffic Surges for Multi-Server Queues in Cloud Environments

被引:3
|
作者
Tadakamalla, Venkat [1 ]
Menasce, Daniel A. [1 ]
机构
[1] George Mason Univ, Comp Sci Dept, Fairfax, VA 22030 USA
关键词
cloud computing; traffic surges; trapezoidal workloads; triangular workloads; queuing theory; G/G/c;
D O I
10.1109/CLOUD.2018.00092
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many computer systems, such as cloud computing environments and Internet datacenters, consist of a multitude of servers that process user requests. The performance and scalability of these environments suffer significantly when the workload surges to levels that cause the arrival rate of requests to exceed the system's capacity to process them. This paper models these systems as G/G/c queues and derives equations that estimate the impact of workload surges on these multiserver systems. The existing queuing literature offers approximations and/or bounds for G/G/c systems in equilibrium, but not when these systems are subject to workload surges. This paper's main contributions are: (1) Generic equations for surges of any shape. (2) A set of equations to estimate the impact of trapezoidal and triangular shaped surges on response time. (3) Extensive validations of the derived equations using a G/G/c simulator developed by the authors (available online) and with Google cluster-usage trace workloads. The results show that the equations estimate with great accuracy the impact of surges on response time. The work presented in this paper can be utilized in the design of autonomic elasticity controllers in the cloud to vary the number of servers to mitigate the impact of workload surges.
引用
收藏
页码:668 / 677
页数:10
相关论文
共 50 条
  • [41] SOME PRACTICAL CONSIDERATIONS ON MULTI-SERVER QUEUES WITH MULTIPLE POISSON ARRIVALS
    COSMETATOS, GP
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1978, 6 (05): : 443 - 448
  • [42] Multi-server queues with intermediate buffer and delayed information on service completions
    Kitsio, V.
    Yechiali, U.
    STOCHASTIC MODELS, 2008, 24 (02) : 212 - 245
  • [43] On the time to reach a certain orbit level in multi-server retrial queues
    Apaolaza, NM
    Artalejo, JR
    APPLIED MATHEMATICS AND COMPUTATION, 2005, 168 (01) : 686 - 703
  • [44] Optimal arrival rate and service rate control of multi-server queues
    Lee, Nelson
    Kulkarni, Vidyadhar G.
    QUEUEING SYSTEMS, 2014, 76 (01) : 37 - 50
  • [46] Nonparametric Estimation for Multi-server Queues Based on the Number of Clients in the System
    Quinino, V. B.
    Cruz, F. R. B.
    Quinino, R. C.
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2024, 86 (01): : 494 - 529
  • [47] Optimal Multi-Server Allocation to Parallel Queues With Random Connectivity and Retransmissions
    Al-Zubaidy, Hussein
    Lambadaris, Ioannis
    Viniotis, Yannis
    Yu, F. Richard
    Srinivasan, Anand
    2010 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2010,
  • [48] Gaussian skewness approximation for dynamic rate multi-server queues with abandonment
    William A. Massey
    Jamol Pender
    Queueing Systems, 2013, 75 : 243 - 277
  • [49] Numerical-Analytic Model of Multi-Class, Multi-Server Queue with Nonpreemptive Priorities
    Snipas, Mindaugas
    Valakevicius, Eimutis
    INNOVATIONS AND ADVANCES IN COMPUTER SCIENCES AND ENGINEERING, 2010, : 413 - 415
  • [50] A taxonomy of user authentication schemes for multi-server environments
    Yang, Hung-Wei
    Pan, Hsieh-Tsen
    Chen, Yung-Hsing
    Hwang, Min-Shiang
    International Journal of Network Security, 2020, 22 (03): : 365 - 372