SpotWeb: Running Latency-sensitive Distributed Web Services on Transient Cloud Servers

被引:6
|
作者
Ali-Eldin, Ahmed [1 ]
Westin, Jonathan [1 ]
Wang, Bin [1 ]
Sharma, Prateek [2 ]
Shenoy, Prashant [1 ]
机构
[1] UMass Amherst, Amherst, MA 01003 USA
[2] Indiana Univ, Bloomington, IN 47405 USA
基金
美国国家科学基金会;
关键词
WORKLOAD;
D O I
10.1145/3307681.3325397
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many cloud providers offer servers with transient availability at a reduced cost. These servers can be unilaterally revoked by the provider, usually after a warning period to the user. Until recently, it has been thought that these servers are not suitable to run latency-sensitive workloads due to their transient availability. In this paper, we introduce SpotWeb, a framework for running latency-sensitive web workloads on transient computing platforms while maintaining the Quality-of-Service (QoS) of the running applications. SpotWeb is based on three novel concepts; using multi-period optimization-a novel approach developed in finance-for server selection; transiency-aware load-balancing; and using intelligent capacity over-provisioning. We implement SpotWeb and evaluate its performance in both simulations and testbed experiments. Our results show that SpotWeb reduces costs by up to 50% compared to state-of-the-art solutions while being scalable to hundreds of cloud server configurations.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [41] NeiLatS: Neighbor-Aware Latency-Sensitive Application Scheduling in Heterogeneous Cloud-Edge Environment
    Li, Huadong
    Liu, Hui
    Liu, Changyuan
    Chen, Aoqi
    Niu, Zhaocheng
    Du, Junzhao
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 615 - 624
  • [42] Understanding Performance Interference Benchmarking and Application Profiling Techniques for Cloud-hosted Latency-Sensitive Applications
    Shekhar, Shashank
    Barve, Yogesh
    Gokhale, Aniruddha
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC' 17), 2017, : 187 - 188
  • [43] Real-time maintenance of latency-sensitive 5G services through network slicing
    Rafael Montero
    Fernando Agraz
    Albert Pagès
    Salvatore Spadaro
    Photonic Network Communications, 2020, 40 : 221 - 232
  • [44] Real-time maintenance of latency-sensitive 5G services through network slicing
    Montero, Rafael
    Agraz, Fernando
    Pages, Albert
    Spadaro, Salvatore
    PHOTONIC NETWORK COMMUNICATIONS, 2020, 40 (03) : 221 - 232
  • [45] Distributed Join-the-Idle-Queue for Low Latency Cloud Services
    Wang, Chunpu
    Feng, Chen
    Cheng, Julian
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2018, 26 (05) : 2309 - 2319
  • [46] Eliminating OS-caused Large JVM Pauses for Latency-sensitive Java']Java-based Cloud Platforms
    Zhuang, Zhenyun
    Tran, Cuong
    Ramachandra, Haricharan
    Sridharan, Badri
    PROCEEDINGS OF 2016 IEEE 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2016, : 694 - 701
  • [47] Fog-Aided Verifiable Privacy Preserving Access Control for Latency-Sensitive Data Sharing in Vehicular Cloud Computing
    Xue, Kaiping
    Hong, Jianan
    Ma, Yongjin
    Wei, David S. L.
    Hong, Peilin
    Yu, Nenghai
    IEEE NETWORK, 2018, 32 (03): : 7 - 13
  • [48] Near-optimal Cloud-Network Integrated Resource Allocation for Latency-Sensitive B5G
    Shokrnezhad, Masoud
    Taleb, Tarik
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 4498 - 4503
  • [49] Latency-Sensitive Edge/Cloud Serverless Dynamic Deployment Over Telemetry-Based Packet-Optical Network
    Pelle, Istvan
    Paolucci, Francesco
    Sonkoly, Balazs
    Cugini, Filippo
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (09) : 2849 - 2863
  • [50] An Energy-Aware Task Offloading and Load Balancing for Latency-Sensitive IoT Applications in the Fog-Cloud Continuum
    Mahapatra, Abhijeet
    Majhi, Santosh K.
    Mishra, Kaushik
    Pradhan, Rosy
    Rao, D. Chandrasekhar
    Panda, Sandeep K.
    IEEE ACCESS, 2024, 12 : 14334 - 14349