Contributions to High-Performance Big Data Computing

被引:2
|
作者
Fox, Geoffrey [1 ]
Qiu, Judy [1 ]
Crandall, David [1 ]
Von Laszewski, Gregor [1 ]
Beckstein, Oliver [2 ]
Paden, John [3 ]
Paraskevakos, Ioannis [4 ]
Jha, Shantenu [4 ]
Wang, Fusheng [5 ]
Marathe, Madhav [6 ,7 ]
Vullikanti, Anil [6 ,7 ]
Cheatham, Thomas [8 ]
机构
[1] Indiana Univ, Bloomington, IN USA
[2] Arizona State Univ, Tempe, AZ 85287 USA
[3] Kansas Univ, Lawrence, KS USA
[4] Rutgers State Univ, New Brunswick, NJ USA
[5] SUNY Stony Brook, Stony Brook, NY 11794 USA
[6] Virginia Tech, Blacksburg, VA USA
[7] Univ Virginia, Charlottesville, VA 22903 USA
[8] Univ Utah, Salt Lake City, UT 84112 USA
关键词
HPC; Big Data; Clouds; Graph Analytics; Polar Science; Pathology; Biomolecular simulations; Network Science; MIDAS; SPIDAL; IMAGE REGISTRATION; DATA ANALYTICS; SOFTWARE; SYSTEM; SPARK; RECONSTRUCTION; LOCALIZATION; ALGORITHMS; LIBRARY; HADOOP;
D O I
10.3233/APC190005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Our project is at the interface of Big Data and HPC - High-Performance Big Data computing and this paper describes a collaboration between 7 collaborating Universities at Arizona State, Indiana (lead), Kansas, Rutgers, Stony Brook, Virginia Tech, and Utah. It addresses the intersection of High-performance and Big Data computing with several different application areas or communities driving the requirements for software systems and algorithms. We describe the base architecture, including the HPC-ABDS, High-Performance Computing enhanced Apache Big Data Stack, and an application use case study identifying key features that determine software and algorithm requirements. We summarize middleware including Harp-DAAL collective communication layer, Twister2 Big Data toolkit, and pilot jobs. Then we present the SPIDAL Scalable Parallel Interoperable Data Analytics Library and our work for it in core machine-learning, image processing and the application communities, Network science, Polar Science, Biomolecular Simulations, Pathology, and Spatial systems. We describe basic algorithms and their integration in end-to-end use cases.
引用
收藏
页码:34 / 81
页数:48
相关论文
共 50 条
  • [21] EVOLVE: Towards Converging Big-Data, High-Performance and Cloud-Computing Worlds
    Tzenetopoulos, Achilleas
    Masouros, Dimosthenis
    Koliogeorgi, Konstantina
    Xydis, Sotirios
    Soudris, Dimitrios
    Chazapis, Antony
    Kozanitis, Christos
    Bilas, Angelos
    Pinto, Christian
    Huy-Nam Nguyen
    Louloudakis, Stelios
    Gardikis, Georgios
    Vamvakas, George
    Aubrun, Michelle
    Symeonidou, Christy
    Spitadakis, Vassilis
    Xylogiannopoulos, Konstantinos
    Peischl, Bernhard
    Kalayci, Tahir Emre
    Stocker, Alexander
    Acquaviva, Jean-Thomas
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 975 - 980
  • [22] Toward High-Performance Computing and Big Data Analytics Convergence: The Case of Spark-DIY
    Caino-Lores, Silvina
    Carretero, Jesus
    Nicolae, Bogdan
    Yildiz, Orcun
    Peterka, Tom
    IEEE ACCESS, 2019, 7 : 156929 - 156955
  • [23] On Performance Prediction of Big Data Transfer in High-performance Networks
    Liu, Wuji
    Yun, Daqing
    Wu, Chase Q.
    Rao, Nageswara S., V
    Hou, Aiqin
    Shen, Wei
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [24] Transforming medical sciences with high-performance computing, high-performance data analytics and AI
    Lewandowski, Natalie
    Koller, Bastian
    TECHNOLOGY AND HEALTH CARE, 2023, 31 (04) : 1505 - 1507
  • [25] Lightweight distributed computing framework for orchestrating high performance computing and big data
    Ince, Muhammed Numan
    Gunay, Melih
    Ledet, Joseph
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (04) : 1571 - 1585
  • [26] CedCom: A High-Performance Architecture for Big Data Applications
    Raynaud, Tanguy
    Haque, Rafiqul
    Ait-kaci, Hassan
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 621 - 632
  • [27] High-performance modelling and simulation for big data applications
    Kolodziej, Joanna
    Gonzalez-Velez, Horacio
    Karatza, Helen D.
    SIMULATION MODELLING PRACTICE AND THEORY, 2017, 76 : 1 - 2
  • [28] Is High Performance Computing (HPC) Ready to Handle Big Data?
    Ray, Biplob R.
    Chowdhury, Morshed
    Atif, Usman
    FUTURE NETWORK SYSTEMS AND SECURITY, FNSS 2017, 2017, 759 : 97 - 112
  • [29] Big Data Proteogenomics and High Performance Computing: Challenges and Opportunities
    Saeed, Fahad
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 141 - 145
  • [30] 3rd Ieee international workshop on high-performance big data computing (HPBDC 2017)
    Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017, 2017,