Distributed Data-Parallel Computing Using a High-Level Programming Language

被引:0
|
作者
Isard, Michael
Yu, Yuan
机构
关键词
Distributed programming; cloud computing; concurrency;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Dryad and DryadLINQ systems offer a new programming model for large scale data-parallel computing. They generalize previous execution environments such as SQL and Map Reduce in three ways: by providing a general-purpose distributed execution engine for data-parallel applications; by adopting an expressive data model of strongly typed .NET objects; and by supporting general-purpose imperative and declarative operations on datasets within a traditional high-level programming language. A DryadLINQ program is a sequential program composed of LINQ expressions performing arbitrary side-effect-free operations on datasets, and can be written and debugged using standard .NET development tools. The DryadLINQ system automatically and transparently translates the data-parallel portions of the program into a distributed execution plan which is passed to the Dryad execution platform. Dryad, which has been in continuous operation for several years on production clusters made up of thousands of computers, ensures efficient, reliable execution of this plan on a large compute cluster. This paper describes the programming model, provides a high-level overview of the design and implementation of the Dryad and DryadLINQ systems, and discusses the tradeoffs and connections to parallel and distributed databases.
引用
收藏
页码:987 / 994
页数:8
相关论文
共 50 条
  • [31] An integrated synchronization and consistency protocol for the implementation of a high-level parallel programming language
    Rinard, MC
    10TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM - PROCEEDINGS OF IPPS '96, 1996, : 549 - 553
  • [32] Data-parallel programming on a reconfigurable parallel computer
    Sen, RK
    Rajesh, K
    Periswamy, M
    Selvakumar, S
    IETE TECHNICAL REVIEW, 1998, 15 (03) : 181 - 189
  • [33] A GEOMETRICAL DATA-PARALLEL LANGUAGE
    DEKEYSER, JL
    LAZURE, D
    MARQUET, P
    SIGPLAN NOTICES, 1994, 29 (04): : 31 - 40
  • [34] A task- and data-parallel programming language based on shared objects
    Ben Hassen, S
    Bal, HE
    Jacobs, CJH
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1998, 20 (06): : 1131 - 1170
  • [35] VALID, A HIGH-LEVEL FUNCTIONAL PROGRAMMING LANGUAGE FOR DATA FLOW MACHINES
    AMAMIYA, M
    HASEGAWA, R
    ONO, S
    REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1984, 32 (05): : 793 - 802
  • [36] High-level parallel programming in a heterogeneous world
    Daniel Garcia, J.
    Llanos, Diego R.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (05):
  • [37] MULTIPROCESSOR SYSTEMS PROGRAMMING IN A HIGH-LEVEL DATA-FLOW LANGUAGE
    GAUDIOT, JL
    LEE, LT
    LECTURE NOTES IN COMPUTER SCIENCE, 1987, 258 : 134 - 151
  • [38] The case for high-level parallel programming in ZPL
    Chamberlain, BL
    Choi, SE
    Lewis, EC
    Snyder, L
    Weathersby, WD
    Lin, C
    IEEE COMPUTATIONAL SCIENCE & ENGINEERING, 1998, 5 (03): : 76 - 86
  • [39] ClusterGOP: A high-level parallel programming environment
    Cao, JN
    2004 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2004, : 158 - 158
  • [40] VERY HIGH-LEVEL PROGRAMMING LANGUAGE FOR DATA-PROCESSING APPLICATIONS
    HAMMER, M
    HOWE, WG
    KRUSKAL, VJ
    WLADAWSKY, I
    COMMUNICATIONS OF THE ACM, 1977, 20 (11) : 832 - 840