Optimizing I/O Performance Through Effective vCPU Scheduling Interference Management

被引:0
|
作者
Wang, Liang [1 ]
Yang, Jinzhe [2 ]
Zhai, Jidong [1 ]
Yang, Guangwen [1 ,3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Imperial Coll London, TC Technol, London SW7 2BX, England
[3] Zhejiang Lab, Hangzhou 311121, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Interference; Cloud computing; Dynamic scheduling; Production; Task analysis; Processor scheduling; Performance evaluation; Virtualization; cloud computing; vCPU scheduling; I/O performance; interference diagnosis;
D O I
10.1109/TPDS.2023.3329298
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Virtual machines (VMs) heavily rely on virtual CPUs (vCPUs) scheduling to achieve efficient I/O performance. The vCPU scheduling interference can cause inconsistent scheduling latency and degraded I/O performance, potentially compromising the services provided by affected VMs. Existing solutions have limitations, such as inefficiency in diagnosing interference issues or imposing undesired side effects on cloud systems. To address these challenges, we present Otter, a holistic technique for optimizing I/O performance in the presence of vCPU scheduling interference. Otter employs innovative methods to enhance interference diagnosis efficiency. First, we propose lightweight methods to measure the dynamic changes in scheduling latencies for co-running vCPUs, ensuring both flexibility and accuracy. Second, we propose fine-grained quantification methods to timely determine the interference, with low false positive and false negative rates. Third, we identify interference patterns that aid in analyzing the root causes of interference and preventing similar issues from recurring. Otter has been operational for one year in the production cloud at the National Supercomputing Center (Wuxi). It diagnoses and helps fix more than 470 vCPU scheduling interference-related issues, resulting in a 19.6% improvement in cloud service I/O performance with negligible overhead in production.
引用
收藏
页码:2315 / 2330
页数:16
相关论文
共 50 条
  • [31] EDC: An Elastic Data Cache to Optimizing the I/O Performance in Deduplicated SSDs
    Lu, Mengting
    Wang, Fang
    Li, Zongwei
    He, Wenpeng
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (07) : 2250 - 2262
  • [32] Analysis of I/O Performance for Optimizing Software Defined Storage in Cloud Integration
    Cha, Jae-Geun
    Kim, Seongwoon
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION SYSTEMS (ICCIS), 2018, : 222 - 226
  • [33] Who Is Your Neighbor: Net I/O Performance Interference in Virtualized Clouds
    Pu, Xing
    Liu, Ling
    Mei, Yiduo
    Sivathanu, Sankaran
    Koh, Younggyun
    Pu, Calton
    Cao, Yuanda
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2013, 6 (03) : 314 - 329
  • [34] Exploring Virtual Machine Covert Channel via I/O Performance Interference
    Yang, Ziye
    Chen, Ping
    2013 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CLOUDCOM-ASIA), 2013, : 232 - 239
  • [35] Automated Modeling of I/O Performance and Interference Effects in Virtualized Storage Systems
    Noorshams, Qais
    Busch, Axel
    Rentschler, Andreas
    Bruhn, Dominik
    Kounev, Samuel
    Tuma, Petr
    Reussner, Ralf
    2014 IEEE 34TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW), 2014, : 88 - 93
  • [36] An Interference Fair Queueing I/O Scheduler under High Performance Processors
    Sui Y.
    Ye J.
    Yang C.
    Tong D.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2020, 56 (06): : 1005 - 1012
  • [37] Performance analysis of parallel I/O scheduling approaches on cluster computing systems
    Abawajy, JH
    CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 724 - 729
  • [38] Energy-efficient task scheduling for mobile edge computing with virtual machine I/O interference
    Lu, Baoshan
    Fang, Junli
    Hong, Xuemin
    Shi, Jianghong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 538 - 549
  • [39] Optimizing Performance of Hybrid Electrochemical Energy Storage Systems through Effective Control: A Comprehensive Review
    Clemente, Alejandro
    Arias, Paula
    Gevorkov, Levon
    Trilla, Lluis
    Rey, Sergi Obrador
    Roger, Xavier Sanchez
    Dominguez-Garcia, Jose Luis
    Martinez, Alber Filba
    ELECTRONICS, 2024, 13 (07)
  • [40] Employee Engagement: Through Effective Performance Management - A Practical Guide for Managers
    Richardson, Stan
    ERGONOMICS, 2010, 53 (12) : 1506 - 1507