Exoshuffle: An Extensible Shuffle Architecture

被引:2
|
作者
Luan, Frank Sifei [1 ]
Wang, Stephanie [1 ,2 ]
Yagati, Samyukta [1 ]
Kim, Sean [1 ]
Lien, Kenneth [1 ]
Ong, Isaac [1 ]
Hong, Tony [1 ]
Cho, SangBin [3 ]
Liang, Eric [3 ]
Stoica, Ion [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Anyscale, Berkeley, CA USA
[3] Anyscale, San Francisco, CA USA
关键词
Shuffle; MapReduce; distributed computing; extensibility;
D O I
10.1145/3603269.3604848
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Shuffle is one of the most expensive communication primitives in distributed data processing and is difficult to scale. Prior work addresses the scalability challenges of shuffle by building monolithic shuffle systems. These systems are costly to develop, and they are tightly integrated with batch processing frameworks that offer only high-level APIs such as SQL. New applications, such as ML training, require more flexibility and finer-grained interoperability with shuffle. They are often unable to leverage existing shuffle optimizations. We propose an extensible shuffle architecture. We present Exoshuffle, a library for distributed shuffle that offers competitive performance and scalability as well as greater flexibility than monolithic shuffle systems. We design an architecture that decouples the shuffle control plane from the data plane without sacrificing performance. We build Exoshuffle on Ray, a distributed futures system for data and ML applications, and demonstrate that we can: (1) rewrite previous shuffle optimizations as application-level libraries with an order of magnitude less code, (2) achieve shuffle performance and scalability competitive with monolithic shuffle systems, and break the CloudSort record as the world's most cost-efficient sorting system, and (3) enable new applications such as ML training to easily leverage scalable shuffle.
引用
收藏
页码:564 / 577
页数:14
相关论文
共 50 条
  • [21] An extensible browser architecture for mobile terminals
    Saida, Yoshinori
    Chishima, Hiroshi
    Hieda, Satoshi
    Sato, Naoki
    Nakamoto, Yukikazu
    Proc Int Conf Distrib Comput Syst, 1600, (394-399):
  • [22] An inclusive and extensible architecture for electronic brokerage
    Hands, J
    Bessonov, M
    Blinov, M
    Patel, A
    Smith, R
    DECISION SUPPORT SYSTEMS, 2000, 29 (04) : 305 - 321
  • [23] Extensible communication architecture for Grid nodes
    Mohamed, N
    Al-Jaroodi, J
    Jiang, H
    ITCC 2004: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 2, PROCEEDINGS, 2004, : 40 - 44
  • [24] Implemented optical perfect shuffle with a planarized architecture
    Liu, J
    Sun, DG
    Zhao, CH
    Chen, RT
    ALGORITHMS, DEVICES, AND SYSTEMS FOR OPTICAL INFORMATION PROCESSING, 1997, 3159 : 200 - 209
  • [25] Software architecture for flexible and extensible image decoding
    Arakawa, H
    Maeda, T
    Etoh, M
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 1997, 10 (1-3) : 235 - 248
  • [26] ARCHITECTURE FOR AN INTEGRATED, EXTENSIBLE ENTERPRISE MANAGEMENT DIRECTOR
    STRUTT, C
    SHURTLEFF, DG
    INTEGRATED NETWORK MANAGEMENT, 1, 1989, : 61 - 72
  • [27] Designing an extensible architecture for Personalized Ambient Information
    Elsholz, Jan-Patrick
    de Melo, Guido
    Hermann, Marc
    Weber, Michael
    PERVASIVE AND MOBILE COMPUTING, 2009, 5 (05) : 592 - 605
  • [28] The eXtensible Tutor Architecture: A New Foundation for ITS
    Nuzzo-Jones, Goss
    Walonoski, Jason A.
    Heffernan, Neil T.
    Livak, Tom
    ARTIFICIAL INTELLIGENCE IN EDUCATION: SUPPORTING LEARNING THROUGH INTELLIGENT AND SOCIALLY INFORMED TECHNOLOGY, 2005, 125 : 902 - 904
  • [29] An extensible and lightweight architecture for adaptive server applications
    Gorton, Ian
    Liu, Yan
    Trivedi, Nihar
    SOFTWARE-PRACTICE & EXPERIENCE, 2008, 38 (08): : 853 - 883
  • [30] An Extensible Enterprise Architecture Pattern for Turnaround Management
    Ellermann, Katharina
    Steeger, Jonas
    Steffens, Ulrike
    2018 IEEE 22ND INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE WORKSHOPS (EDOCW 2018), 2018, : 170 - 175