Exoshuffle: An Extensible Shuffle Architecture

被引:2
|
作者
Luan, Frank Sifei [1 ]
Wang, Stephanie [1 ,2 ]
Yagati, Samyukta [1 ]
Kim, Sean [1 ]
Lien, Kenneth [1 ]
Ong, Isaac [1 ]
Hong, Tony [1 ]
Cho, SangBin [3 ]
Liang, Eric [3 ]
Stoica, Ion [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Anyscale, Berkeley, CA USA
[3] Anyscale, San Francisco, CA USA
关键词
Shuffle; MapReduce; distributed computing; extensibility;
D O I
10.1145/3603269.3604848
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Shuffle is one of the most expensive communication primitives in distributed data processing and is difficult to scale. Prior work addresses the scalability challenges of shuffle by building monolithic shuffle systems. These systems are costly to develop, and they are tightly integrated with batch processing frameworks that offer only high-level APIs such as SQL. New applications, such as ML training, require more flexibility and finer-grained interoperability with shuffle. They are often unable to leverage existing shuffle optimizations. We propose an extensible shuffle architecture. We present Exoshuffle, a library for distributed shuffle that offers competitive performance and scalability as well as greater flexibility than monolithic shuffle systems. We design an architecture that decouples the shuffle control plane from the data plane without sacrificing performance. We build Exoshuffle on Ray, a distributed futures system for data and ML applications, and demonstrate that we can: (1) rewrite previous shuffle optimizations as application-level libraries with an order of magnitude less code, (2) achieve shuffle performance and scalability competitive with monolithic shuffle systems, and break the CloudSort record as the world's most cost-efficient sorting system, and (3) enable new applications such as ML training to easily leverage scalable shuffle.
引用
收藏
页码:564 / 577
页数:14
相关论文
共 50 条
  • [31] REQUIREMENTS FOR AN EXTENSIBLE GDSS - AN OBJECT ORIENTED ARCHITECTURE
    RAJKUMAR, TM
    YADAV, SB
    PROCEEDINGS OF THE TWENTY-FIRST, ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOLS 1-4: ARCHITECTURE TRACK, SOFTWARE TRACK, DECISION SUPPORT AND KNOWLEDGE BASED SYSTEMS TRACK, APPLICATIONS TRACK, 1988, : 125 - 130
  • [32] A Configurable and Extensible Security Service Architecture for Smartphones
    Titze, Dennis
    Stephanow, Philipp
    Schuette, Julian
    2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2013, : 1056 - 1062
  • [33] A Design of Extensible Architecture Based on Consortium Blockchain
    Ni, Lin
    Zhang, Shuai
    Li, Guopeng
    Han, Kun
    Sun, Hui
    2022 IEEE 14TH INTERNATIONAL CONFERENCE ON ADVANCED INFOCOMM TECHNOLOGY (ICAIT 2022), 2022, : 320 - 324
  • [34] A User-Extensible and Adaptable Parser Architecture
    Tobin, John
    Vogel, Carl
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXV, 2009, : 191 - 204
  • [35] Towards a high performance extensible Grid architecture
    Krauter, K
    Maheswaran, M
    HIGH PERFORMANCE COMPUTING SYSTEMS AND APPLICATIONS, 2002, 657 : 21 - 37
  • [36] SESAME: A scalable and extensible architecture for multimedia entertainment
    Lee, YJ
    Du, DHC
    Ma, WH
    TWENTIETH ANNUAL INTERNATIONAL COMPUTER SOFTWARE & APPLICATIONS CONFERENCE (COMPSAC'96), PROCEEDINGS, 1996, 20 : 56 - 61
  • [37] The type extensible architecture of a simple database system
    Cook, RP
    SOFTWARE-CONCEPTS AND TOOLS, 1996, 17 (04): : 141 - 147
  • [38] Extensible and Efficient Proxy for Neural Architecture Search
    Li, Yuhong
    Li, Jiajie
    Hao, Cong
    Li, Pan
    Xiong, Jinjun
    Chen, Deming
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6176 - 6187
  • [39] Migrating to an Extensible Architecture for Abstract Syntax Trees
    Couto, Luis Diogo
    Tran-Jorgensen, Peter W. V.
    Coleman, Joey W.
    Lausdahl, Kenneth
    2015 12TH WORKING IEEE/IFIP CONFERENCE ON SOFTWARE ARCHITECTURE (WICSA), 2015, : 145 - 154
  • [40] Extensible and Adaptive Architecture for an Evolutive Transport Layer
    Oulmahdi, Mohamed
    Chassot, Christophe
    Van Wambeke, Nicolas
    2018 INTERNATIONAL CONFERENCE ON SMART COMMUNICATIONS IN NETWORK TECHNOLOGIES (SACONET), 2018, : 102 - 107