A novel algorithm of optimal matrix partitioning for parallel dense factorization on heterogeneous processors

被引:0
|
作者
Lastovetsky, Alexey [1 ]
Reddy, Ravi [1 ]
机构
[1] Univ Coll Dublin, Sch Informat & Comp Sci, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a novel algorithm of optimal matrix partitioning for parallel dense matrix factorization on heterogeneous processors based on their constant performance model. We prove the correctness of the algorithm and estimate its complexity. We demonstrate that this algorithm better suits extensions to more complicated, non-constant, performance models of heterogeneous processors than traditional algorithms.
引用
收藏
页码:261 / +
页数:2
相关论文
共 50 条
  • [31] Efficient parallel algorithm for dense matrix LU decomposition with pivoting on hypercubes
    Liu, ZY
    Cheung, DW
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1997, 33 (08) : 39 - 50
  • [32] Efficient parallel algorithm for dense matrix LU decomposition with pivoting on hypercubes
    Liu, Zhiyong
    Cheung, D.W.
    Computers and Mathematics with Applications, 1997, 33 (08): : 39 - 50
  • [33] Simplicial Nonnegative Matrix Tri-factorization: Fast Guaranteed Parallel Algorithm
    Nguyen, Duy-Khuong
    Quoc Tran-Dinh
    Ho, Tu-Bao
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 117 - 125
  • [35] Optimizing parallel matrix transpose algorithm on multi-core digital signal processors
    Pei X.
    Wang Q.
    Liao L.
    Li R.
    Mei S.
    Liu J.
    Pang Z.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2023, 45 (01): : 57 - 66
  • [36] Large Matrix Multiplication on a Novel Heterogeneous Parallel DSP Architecture
    Sohl, Joar
    Wang, Jian
    Liu, Dake
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2009, 5737 : 408 - 419
  • [37] A Novel Nonnegative Matrix Factorization Algorithm for Multi-manifold Learning
    Wang, Qian
    Chen, Wen-Sheng
    Pan, Binbin
    Li, Yugao
    Biometric Recognition, 2016, 9967 : 575 - 582
  • [38] A Fast Parallel Matrix Inversion Algorithm based on Heterogeneous Multicore Architectures
    Yu, Denggao
    He, Shiwen
    Huang, Yongming
    Yu, Guangshi
    Yang, Luxi
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 903 - 907
  • [39] Accelerated parallel and distributed algorithm using limited internal memory for nonnegative matrix factorization
    Duy Khuong Nguyen
    Tu Bao Ho
    Journal of Global Optimization, 2017, 68 : 307 - 328
  • [40] Accelerated parallel and distributed algorithm using limited internal memory for nonnegative matrix factorization
    Duy Khuong Nguyen
    Tu Bao Ho
    JOURNAL OF GLOBAL OPTIMIZATION, 2017, 68 (02) : 307 - 328