Large-Scale Algorithm Design for Parallel FFT-based Simulations on GPUs

被引:0
|
作者
Kulkarni, Anuva [1 ]
Franchetti, Franz [1 ]
Kovacevic, Jelena [1 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
Irregular domain decomposition; algorithm design; GPU; lossy compression;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We describe and analyze a co-design of algorithm and software for high-performance simulation of a partial differential equation (PDE) numerical solver for large-scale datasets. Large-scale scientific simulations involving parallel Fast Fourier Transforms (FFTs) have extreme memory requirements and high communication cost. This hampers high resolution analysis with fine grids. Moreover, it is difficult to accelerate legacy Fortran scientific codes with modern hardware such as GPUs because of memory constraints of GPUs. Our proposed solution uses signal processing techniques such as lossy compression and domain-local FFTs to lower iteration cost without adversely impacting accuracy of the result. In this work, we discuss proof-of-concept results for various aspects of algorithm development.
引用
收藏
页码:301 / 305
页数:5
相关论文
共 50 条
  • [31] Design of FFT-Based TDCC for GNSS Acquisition
    Kim, Binhee
    Kong, Seung-Hyun
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2014, 13 (05) : 2798 - 2808
  • [32] Accurate Parallel Algorithm for Tracking Inertial Particles in Large-Scale Direct Numerical Simulations of Turbulence
    Ishihara, Takashi
    Enohata, Kei
    Morishita, Koji
    Yokokawa, Mitsuo
    Ishii, Katsuya
    PARALLEL COMPUTING TECHNOLOGIES (PACT 2015), 2015, 9251 : 522 - 527
  • [33] An efficient particle tracking algorithm for large-scale parallel pseudo-spectral simulations of turbulence
    Lalescu, Cristian C.
    Bramas, Berenger
    Rampp, Markus
    Wilczek, Michael
    COMPUTER PHYSICS COMMUNICATIONS, 2022, 278
  • [34] Massively Parallel Multilevel Fast Multipole Algorithm for Extremely Large-Scale Electromagnetic Simulations: A Review
    He, Wei-Jia
    Huang, Xiao-Wei
    Yang, Ming-Lin
    Sheng, Xin-Qing
    Progress in Electromagnetics Research, 2022, 173 : 37 - 52
  • [35] A scalable parallel algorithm for large-scale reactive force-field molecular dynamics simulations
    Nomura, Ken-ichi
    Kalia, Rajiv K.
    Nakano, Aiichiro
    Vashishta, Priya
    COMPUTER PHYSICS COMMUNICATIONS, 2008, 178 (02) : 73 - 87
  • [36] Massively Parallel Multilevel Fast Multipole Algorithm for Extremely Large-Scale Electromagnetic Simulations: A Review
    He, Wei-Jia
    Huang, Xiao-Wei
    Yang, Ming-Lin
    Sheng, Xin-Qing
    PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2022, 173 : 37 - 52
  • [37] Improved FFT-based algorithm for GPS signal acquisition
    Wei, Wang
    Pei, Chen
    Chao, Han
    SEVENTH INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION AND CONTROL TECHNOLOGY: OPTOELECTRONIC TECHNOLOGY AND INSTUMENTS, CONTROL THEORY AND AUTOMATION, AND SPACE EXPLORATION, 2008, 7129
  • [38] FFT-Based Algorithm Improvements for Detecting Leakage in Pipelines
    Lay-Ekuakille, Aime
    Trotta, Amerigo
    Vendramin, Giuseppe
    Vanderbemdem, Philippe
    2009 6TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES, VOLS 1 AND 2, 2009, : 906 - +
  • [39] Measurement of Harmonics and Interharmonics using FFT-based Algorithm
    Lin, Hsiung-Cheng
    2017 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2017, : 1650 - 1654
  • [40] A Low Complexity Algorithm for Code Doppler Compensation Using FFT-Based Parallel Acquisition Architecture
    Tang, Ping
    Li, Xiangming
    Wang, Shuai
    Wang, Ke
    CHINA SATELLITE NAVIGATION CONFERENCE (CSNC) 2018 PROCEEDINGS, VOL III, 2018, 499 : 355 - 364