Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers

被引:0
|
作者
Hu, Cunchen [1 ,2 ]
Wang, Chenxi [1 ,2 ]
Wang, Sa [1 ,2 ]
Sun, Ninghui [1 ,2 ]
Bao, Yungang [1 ,2 ]
Zhao, Jieru [3 ]
Kashyap, Sanidhya [4 ]
Zuo, Pengfei [5 ]
Chen, Xusheng [5 ]
Xu, Liangliang [5 ]
Zhang, Qin [5 ]
Feng, Hao [5 ]
Shan, Yizhou [5 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
[2] Chinese Acad Sci, ICT, State Key Lab Proc, Beijing, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[5] Huawei Cloud, Dublin, Ireland
来源
PROCEEDINGS OF THE 19TH WORKSHOP ON HOT TOPICS IN OPERATING SYSTEMS, HOTOS 2023 | 2023年
基金
中国国家自然科学基金;
关键词
D O I
10.1145/3593856.3595897
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data-intensive systems are the backbone of today's computing and are responsible for shaping data centers. Over the years, cloud providers have relied on three principles to maintain cost-effective data systems: use disaggregation to decouple scaling, use domain-specific computing to battle waning laws, and use serverless to lower costs. Although they work well individually, they fail to work in harmony: an issue amplified by emerging data system workloads. In this paper, we envision a distributed runtime to mitigate current shortcomings. The distributed runtime has a tiered access layer exposing declarative APIs, underpinned by a stateful serverless runtime with a distributed task execution model. It will be the narrow waist between data systems and hardware. Users are oblivious to data location, concurrency, disaggregation style, or even the hardware to do the computing. The underlying stateful serverless runtime transparently evolves with novel data-center architectures, such as disaggregation and tightly-coupled clusters. We prototype Skadi to showcase that the distributed runtime is practical.
引用
收藏
页码:94 / 102
页数:9
相关论文
共 50 条
  • [31] Optical Systems for Data Centers
    Ho, Ron
    Schwetman, Herb
    McCracken, Michael O.
    Koka, Pranay
    Lexau, Jon
    Cunningham, John E.
    Zheng, Xuezhe
    Krishnamoorthy, Ashok V.
    2011 OPTICAL FIBER COMMUNICATION CONFERENCE AND EXPOSITION (OFC/NFOEC) AND THE NATIONAL FIBER OPTIC ENGINEERS CONFERENCE, 2011,
  • [32] MetaFlow: A Scalable Metadata Lookup Service for Distributed File Systems in Data Centers
    Sun, Peng
    Wen, Yonggang
    Duong Nguyen Binh Ta
    Xie, Haiyong
    IEEE TRANSACTIONS ON BIG DATA, 2018, 4 (02) : 203 - 216
  • [33] REDUX: Managing Renewable Energy in Data Centers using Distributed UPS Systems
    Peng, Xiaopu
    Kauten, Christian
    Zhang, Chaowei
    Mao, Jianzhou
    Heckwolf, Thomas
    Al Tekreeti, Taha Khalid
    Qin, Xiao
    2018 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2018, : 46 - 53
  • [34] Demonstrating Optically Interconnected Remote Serial and Parallel Memory in Disaggregated Data Centers
    Mishra, Vaibhawa
    Benjamin, Joshua L.
    Zervas, Georgios
    2020 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXPOSITION (OFC), 2020,
  • [35] Dynamic data replication and placement strategy in geographically distributed data centers
    Bouhouch, Laila
    Zbakh, Mostapha
    Tadonki, Claude
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (14):
  • [36] Evaluation of a Rack-Scale Disaggregated Memory Prototype for Cloud Data Centers
    Quiroga, Josue, V
    Torrents, Marti
    Sonmez, Nehir
    Theodoropoulos, Dimitris
    Zyulkyarov, Ferad
    Nemirovsky, Mario
    PROCEEDINGS OF THE 30TH INTERNATIONAL WORKSHOP ON RAPID SYSTEM PROTOTYPING (RSP'19): SHORTENING THE PATH FROM SPECIFICATION TO PROTOTYPE, 2019, : 15 - 21
  • [37] Versatile Deployment of FPGA Accelerators in Disaggregated Data Centers: a Bioinformatics Case Study
    Alachiotis, Nikolaos
    Theodoropoulos, Dimitris
    Pnevmatikatos, Dionisios
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [38] Data placement in distributed data centers for improved SLA and network cost
    Fan, Yuqi
    Wang, Chen
    Zhang, Bei
    Gu, Shuyang
    Wu, Weili
    Du, Dingzhu
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 146 : 189 - 200
  • [39] Silicon Photonic Multi-Chip Module Interconnects for Disaggregated Data Centers
    Abrams, Nathan C.
    Glick, Madeleine
    Bergman, Keren
    2020 INTERNATIONAL CONFERENCE ON OPTICAL NETWORK DESIGN AND MODELING (ONDM), 2020,
  • [40] Optimizing Water Efficiency in Distributed Data Centers
    Ren, Shaolei
    2013 IEEE THIRD INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING (CGC 2013), 2013, : 68 - 75