Next Generation Workload Management System For Big Data on Heterogeneous Distributed Computing

被引:17
|
作者
Klimentov, A. [1 ]
Buncic, P. [2 ]
De, K. [3 ,4 ]
Jha, S.
Maeno, T. [1 ]
Mount, R. [5 ]
Nilsson, P. [1 ]
Oleynik, D. [3 ,4 ]
Panitkin, S. [1 ]
Petrosyan, A. [3 ,4 ]
Porter, R. J. [6 ]
Read, K. F. [7 ]
Vaniachine, A. [8 ]
Wells, J. C. [7 ]
Wenaus, T. [1 ]
机构
[1] Brookhaven Natl Lab, Upton, NY 11973 USA
[2] CERN, Geneva, Switzerland
[3] Univ Texas Arlington, Arlington, TX 76019 USA
[4] Rutgers State Univ, Piscataway, NJ USA
[5] SLAC Natl Accelerator Lab, Menlo Pk, CA USA
[6] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[7] Oak Ridge Natl Lab, Oak Ridge, TN USA
[8] Argonne Natl Lab Lemont, Argonne, IL USA
关键词
D O I
10.1088/1742-6596/608/1/012040
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe, and were recently credited for the discovery of a Higgs boson. ATLAS and ALICE are the largest collaborations ever assembled in the sciences and are at the forefront of research at the LHC. To address an unprecedented multi-petabyte data processing challenge, both experiments rely on a heterogeneous distributed computational infrastructure. The ATLAS experiment uses PanDA (Production and Data Analysis) Workload Management System (WMS) for managing the workflow for all data processing on hundreds of data centers. Through PanDA, ATLAS physicists see a single computing facility that enables rapid scientific breakthroughs for the experiment, even though the data centers are physically scattered all over the world. The scale is demonstrated by the following numbers: PanDA manages O(10(2)) sites, O(10(5)) cores, O(10(8)) jobs per year, O(10(3)) users, and ATLAS data volume is O(10(17)) bytes. In 2013 we started an ambitious program to expand PanDA to all available computing resources, including opportunistic use of commercial and academic clouds and Leadership Computing Facilities (LCF). The project titled 'Next Generation Workload Management and Analysis System for Big Data' (BigPanDA) is funded by DOE ASCR and HEP. Extending PanDA to clouds and LCF presents new challenges in managing heterogeneity and supporting workflow. The BigPanDA project is underway to setup and tailor PanDA at the Oak Ridge Leadership Computing Facility (OLCF) and at the National Research Center "Kurchatov Institute" together with ALICE distributed computing and ORNL computing professionals. Our approach to integration of HPC platforms at the OLCF and elsewhere is to reuse, as much as possible, existing components of the PanDA system. We will present our current accomplishments with running the PanDA WMS at OLCF and other supercomputers and demonstrate our ability to use PanDA as a portal independent of the computing facilities infrastructure for High Energy and Nuclear Physics as well as other data-intensive science applications.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] A Distributed Data Management System for Heterogeneous Sensor Networks
    Li, Xudong
    Chen, Lijun
    PROCEEDINGS OF 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2012, : 599 - 604
  • [22] Next-Generation Intrusion Detection and Prevention System Performance in Distributed Big Data Network Security Architectures
    Hart, Michael
    Dave, Rushit
    Richardson, Eric
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 990 - 998
  • [23] DATA SYSTEM WITH DISTRIBUTED-PROCESSING FOR A NEXT GENERATION SATELLITE
    TSAI, CM
    RAGHAVENDER, M
    JOURNAL OF SPACECRAFT AND ROCKETS, 1989, 26 (02) : 116 - 123
  • [24] Distributed Data Processing Method for Next Generation IoT System
    Furuichi, Tetsuo
    Mineno, Hiroshi
    2016 IEEE 5TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS, 2016,
  • [25] Processing of big heterogeneous genomic datasets for tertiary analysis of Next Generation Sequencing data
    Masseroli, Marco
    Canakoglu, Arif
    Pinoli, Pietro
    Kaitoua, Abdulrahman
    Gulino, Andrea
    Horlova, Olha
    Nanni, Luca
    Bernasconi, Anna
    Perna, Stefano
    Stamoulakatou, Eirini
    Ceri, Stefano
    BIOINFORMATICS, 2019, 35 (05) : 729 - 736
  • [26] Next Generation Computing Systems with Heterogeneous Packaging Integration
    Knickerbocker, John U.
    2017 5TH INTERNATIONAL WORKSHOP ON LOW TEMPERATURE BONDING FOR 3D INTEGRATION (LTB-3D), 2017, : 45 - 45
  • [27] Distributed Big Data Analytics in Service Computing
    Yu, Weider D.
    Gottumukkala, AvinashChander
    Senthailselvi, Deenash Arivazhagan
    Maniraj, Prabhu
    Khonde, Tushar
    2017 IEEE 13TH INTERNATIONAL SYMPOSIUM ON AUTONOMOUS DECENTRALIZED SYSTEMS (ISADS 2017), 2017, : 55 - 60
  • [28] Parallel and distributed computing for Big Data applications
    Senger, Hermes
    Geyer, Claudio
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (08): : 2412 - 2415
  • [29] Distributed computing and big data techniques for efficient fault detection and data management in wireless networks
    Kiran, Ajmeera
    Renjith, P. N.
    Gupta, Sapna
    Ambala, Srinivas
    Raju, Preethi Sambandam
    Sriramsetti, Drakshayani
    OPTICAL AND QUANTUM ELECTRONICS, 2023, 55 (13)
  • [30] Big forensic data management in heterogeneous distributed systems: quick analysis of multimedia forensic data
    Quick, Darren
    Choo, Kim-Kwang Raymond
    SOFTWARE-PRACTICE & EXPERIENCE, 2017, 47 (08): : 1095 - 1109