SpotWeb: Running Latency-sensitive Distributed Web Services on Transient Cloud Servers

被引：6

作者：

Ali-Eldin, Ahmed ^{[1
]}

Westin, Jonathan ^{[1
]}

Wang, Bin ^{[1
]}

Sharma, Prateek ^{[2
]}

Shenoy, Prashant ^{[1
]}

机构：

[1] UMass Amherst, Amherst, MA 01003 USA

[2] Indiana Univ, Bloomington, IN 47405 USA

来源：

HPDC'19: PROCEEDINGS OF THE 28TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING | 2019年

基金：

美国国家科学基金会;

关键词：

WORKLOAD;

D O I：

10.1145/3307681.3325397

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Many cloud providers offer servers with transient availability at a reduced cost. These servers can be unilaterally revoked by the provider, usually after a warning period to the user. Until recently, it has been thought that these servers are not suitable to run latency-sensitive workloads due to their transient availability. In this paper, we introduce SpotWeb, a framework for running latency-sensitive web workloads on transient computing platforms while maintaining the Quality-of-Service (QoS) of the running applications. SpotWeb is based on three novel concepts; using multi-period optimization-a novel approach developed in finance-for server selection; transiency-aware load-balancing; and using intelligent capacity over-provisioning. We implement SpotWeb and evaluate its performance in both simulations and testbed experiments. Our results show that SpotWeb reduces costs by up to 50% compared to state-of-the-art solutions while being scalable to hundreds of cloud server configurations.

引用

页码：1 / 12

页数：12

共 50 条

[41] NeiLatS: Neighbor-Aware Latency-Sensitive Application Scheduling in Heterogeneous Cloud-Edge Environment
Li, Huadong
Liu, Hui
Liu, Changyuan
Chen, Aoqi
Niu, Zhaocheng
Du, Junzhao
PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 615 - 624
[42] Understanding Performance Interference Benchmarking and Application Profiling Techniques for Cloud-hosted Latency-Sensitive Applications
Shekhar, Shashank
Barve, Yogesh
Gokhale, Aniruddha
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC' 17), 2017, : 187 - 188
[43] Real-time maintenance of latency-sensitive 5G services through network slicing
Rafael Montero
Fernando Agraz
Albert Pagès
Salvatore Spadaro
Photonic Network Communications, 2020, 40 : 221 - 232
[44] Real-time maintenance of latency-sensitive 5G services through network slicing
Montero, Rafael
Agraz, Fernando
Pages, Albert
Spadaro, Salvatore
PHOTONIC NETWORK COMMUNICATIONS, 2020, 40 (03) : 221 - 232
[45] Distributed Join-the-Idle-Queue for Low Latency Cloud Services
Wang, Chunpu
Feng, Chen
Cheng, Julian
IEEE-ACM TRANSACTIONS ON NETWORKING, 2018, 26 (05) : 2309 - 2319
[46] Eliminating OS-caused Large JVM Pauses for Latency-sensitive Java']Java-based Cloud Platforms
Zhuang, Zhenyun
Tran, Cuong
Ramachandra, Haricharan
Sridharan, Badri
PROCEEDINGS OF 2016 IEEE 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2016, : 694 - 701
[47] Fog-Aided Verifiable Privacy Preserving Access Control for Latency-Sensitive Data Sharing in Vehicular Cloud Computing
Xue, Kaiping
Hong, Jianan
Ma, Yongjin
Wei, David S. L.
Hong, Peilin
Yu, Nenghai
IEEE NETWORK, 2018, 32 (03): : 7 - 13
[48] Near-optimal Cloud-Network Integrated Resource Allocation for Latency-Sensitive B5G
Shokrnezhad, Masoud
Taleb, Tarik
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 4498 - 4503
[49] Latency-Sensitive Edge/Cloud Serverless Dynamic Deployment Over Telemetry-Based Packet-Optical Network
Pelle, Istvan
Paolucci, Francesco
Sonkoly, Balazs
Cugini, Filippo
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (09) : 2849 - 2863
[50] An Energy-Aware Task Offloading and Load Balancing for Latency-Sensitive IoT Applications in the Fog-Cloud Continuum
Mahapatra, Abhijeet
Majhi, Santosh K.
Mishra, Kaushik
Pradhan, Rosy
Rao, D. Chandrasekhar
Panda, Sandeep K.
IEEE ACCESS, 2024, 12 : 14334 - 14349

← 1 2 3 4 5 →