Distributed Data-Parallel Computing Using a High-Level Programming Language

被引:0
|
作者
Isard, Michael
Yu, Yuan
机构
关键词
Distributed programming; cloud computing; concurrency;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Dryad and DryadLINQ systems offer a new programming model for large scale data-parallel computing. They generalize previous execution environments such as SQL and Map Reduce in three ways: by providing a general-purpose distributed execution engine for data-parallel applications; by adopting an expressive data model of strongly typed .NET objects; and by supporting general-purpose imperative and declarative operations on datasets within a traditional high-level programming language. A DryadLINQ program is a sequential program composed of LINQ expressions performing arbitrary side-effect-free operations on datasets, and can be written and debugged using standard .NET development tools. The DryadLINQ system automatically and transparently translates the data-parallel portions of the program into a distributed execution plan which is passed to the Dryad execution platform. Dryad, which has been in continuous operation for several years on production clusters made up of thousands of computers, ensures efficient, reliable execution of this plan on a large compute cluster. This paper describes the programming model, provides a high-level overview of the design and implementation of the Dryad and DryadLINQ systems, and discusses the tradeoffs and connections to parallel and distributed databases.
引用
收藏
页码:987 / 994
页数:8
相关论文
共 50 条
  • [21] A General-purpose Distributed Programming System using Data-parallel Streams
    Huang, Tsung-Wei
    Lin, Chun-Xun
    Guo, Guannan
    Wong, Martin D. F.
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1360 - 1363
  • [22] ParSoDA: high-level parallel programming for social data mining
    Belcastro L.
    Marozzo F.
    Talia D.
    Trunfio P.
    Social Network Analysis and Mining, 2019, 9 (01)
  • [23] JADE - A HIGH-LEVEL, MACHINE-INDEPENDENT LANGUAGE FOR PARALLEL PROGRAMMING
    RINARD, MC
    SCALES, DJ
    LAM, MS
    COMPUTER, 1993, 26 (06) : 28 - 38
  • [24] ORDERED PORTS - A LANGUAGE CONCEPT FOR HIGH-LEVEL DISTRIBUTED-PROGRAMMING
    BASU, J
    PATNAIK, LM
    GOSWAMI, AK
    COMPUTER JOURNAL, 1987, 30 (06): : 487 - 497
  • [25] UC - A SET-BASED LANGUAGE FOR DATA-PARALLEL PROGRAMMING
    BAGRODIA, R
    CHANDY, M
    DHAGAT, M
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1995, 28 (02) : 186 - 201
  • [26] Efficient high-level parallel programming
    Botorog, GH
    Kuchen, H
    THEORETICAL COMPUTER SCIENCE, 1998, 196 (1-2) : 71 - 107
  • [27] THE EXTENDED C-LANGUAGE NCX FOR DATA-PARALLEL PROGRAMMING
    YUASA, T
    KIJIMA, T
    KONISHI, Y
    SYSTEMS AND COMPUTERS IN JAPAN, 1995, 26 (10) : 13 - 23
  • [28] DATA-PARALLEL PROGRAMMING ON MULTICOMPUTERS
    QUINN, MJ
    HATCHER, PJ
    IEEE SOFTWARE, 1990, 7 (05) : 69 - 76
  • [29] Experiences using high-level programming for parallel cellular computation
    Spezzano, G
    Talia, D
    HIGH PERFORMANCE COMPUTING SYSTEMS AND APPLICATIONS, 2000, 541 : 515 - 528
  • [30] Compositional high-level Petri net semantics of a parallel programming language with procedures
    Klaudel, H
    SCIENCE OF COMPUTER PROGRAMMING, 2001, 41 (03) : 195 - 240