A Fast Parallel Matrix Inversion Algorithm based on Heterogeneous Multicore Architectures

被引:0
|
作者
Yu, Denggao [1 ]
He, Shiwen [1 ,2 ]
Huang, Yongming [1 ]
Yu, Guangshi [1 ]
Yang, Luxi [1 ]
机构
[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Dept Radio Engn, State Key Lab Millimeter Waves, Nanjing 210096, Jiangsu, Peoples R China
关键词
matrix inversion; high performance computing; software-defined radio; GPU; CUDA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large matrix inversion is usually a basic step in a wide range of signal processing or numerical problems, such as digital filtering, equalization detection, and etc. It is essential to figure out an algorithm to invert large matrix quickly and accurately. On the other hand, the Graphics Processor Unit (GPU) is able to provide a low-cost and flexible multicore architecture for high performance computing, which has attracted many researchers' attention for the building of GPU-based software-defined radio (SDR). In this paper, we propose a fast parallel algorithm for matrix inversion on heterogeneous multicore architectures to utilize the computational power of GPU. Our implementation is based on a modified Squared Givens Rotations (SGR) algorithm, which could adapt to the GPU architecture effectively. The result implemented on Compute Unified Device Architecture (CUDA) obtains a speedup ratio more than 20x versus the CPU-based-only algorithm when the matrix become large, and runs at up to 12.14 gigaflops/s on a graphics processor Geforce GT620 in our implementation.
引用
收藏
页码:903 / 907
页数:5
相关论文
共 50 条
  • [21] Parallel Graph Partitioning on Multicore Architectures
    Sui, Xin
    Donald Nguyen
    Burtscher, Martin
    Pingali, Keshav
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2011, 6548 : 246 - +
  • [22] Parallel nonlinear preconditioners on multicore architectures
    Galiano, Vicente
    Migallon, Hector
    Migallon, Violeta
    Penades, Jose
    JOURNAL OF SUPERCOMPUTING, 2011, 58 (02): : 160 - 167
  • [23] Parallel Subgraph Counting for Multicore Architectures
    Aparicio, David
    Ribeiro, Pedro
    Silva, Fernando
    2014 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA), 2014, : 34 - 41
  • [24] Parallel nonlinear preconditioners on multicore architectures
    Vicente Galiano
    Héctor Migallón
    Violeta Migallón
    José Penadés
    The Journal of Supercomputing, 2011, 58 : 160 - 167
  • [25] Parallel skyline computation on multicore architectures
    Im, Hyeonseung
    Park, Jonghyun
    Park, Sungwoo
    INFORMATION SYSTEMS, 2011, 36 (04) : 808 - 823
  • [26] Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Multicore Architectures
    Agullo, Emmanuel
    Bouwmeester, Henricus
    Dongarra, Jack
    Kurzak, Jakub
    Langou, Julien
    Rosenberg, Lee
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2010, 2011, 6449 : 129 - +
  • [27] A parallel algorithm for matrix inversion based on the Sherman-Morrison formula
    Martínez, Ángeles
    Mas, José
    Recent Advances in Applied and Theoretical Mathematics, 2000, : 45 - 49
  • [28] Parallel Skyline Computation on Multicore Architectures
    Park, Sungwoo
    Kim, Taekyung
    Park, Jonghyun
    Kim, Jinha
    Im, Hyeonseung
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 760 - 771
  • [29] A massively parallel adaptive fast multipole method on heterogeneous architectures
    Institute for Scientific Computing Research, Lawrence Livermore National Laboratory, Livermore, CA, United States
    不详
    不详
    不详
    不详
    不详
    Commun ACM, 5 (101-109):
  • [30] A Massively Parallel Adaptive Fast Multipole Method on Heterogeneous Architectures
    Lashuk, Ilya
    Chandramowlishwaran, Aparna
    Langston, Harper
    Tuan-Anh Nguyen
    Sampath, Rahul
    Shringarpure, Aashay
    Vuduc, Richard
    Ying, Lexing
    Zorin, Denis
    Biros, George
    COMMUNICATIONS OF THE ACM, 2012, 55 (05) : 101 - 109