Design and Implementation of a Large Scale Tree-Based QR Decomposition Using a 3D Virtual Systolic Array and a Lightweight Runtime

被引:2
|
作者
Yamazaki, Ichitaro [1 ]
Kurzak, Jakub [1 ]
Luszczek, Piotr [1 ]
Dongarra, Jack [1 ,2 ,3 ]
机构
[1] Univ Tennessee, Knoxville, TN 37996 USA
[2] Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA
[3] Univ Manchester, Manchester M13 9PL, Lancs, England
基金
美国国家科学基金会;
关键词
systolic array; QR decomposition; multithreading; message-passing; dataflow; runtime;
D O I
10.1109/IPDPSW.2014.167
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A systolic array provides an alternative computing paradigm to the von Neuman architecture. Though its hardware implementation has failed as a paradigm to design integrated circuits in the past, we are now discovering that the systolic array as a software virtualization layer can lead to an extremely scalable execution paradigm. To demonstrate this scalability, in this paper, we design and implement a 3D virtual systolic array to compute a tile QR decomposition of a tall-and-skinny dense matrix. Our implementation is based on a state-of-the-art algorithm that factorizes a panel based on a tree-reduction. Using a runtime developed as a part of the Parallel Ultra Light Systolic Array Runtime (PULSAR) project, we demonstrate on a Cray-XT5 machine how our virtual systolic array can be mapped to a large-scale machine and obtain excellent parallel performance. This is an important contribution since such a QR decomposition is used, for example, to compute a least squares solution of an overdetermined system, which arises in many scientific and engineering problems.
引用
收藏
页码:1495 / 1504
页数:10
相关论文
共 50 条
  • [41] Hierarchical Clustering-Aligning Framework Based Fast Large-Scale 3D Reconstruction Using Aerial Imagery
    Xie, Xiuchuan
    Yang, Tao
    Li, Dongdong
    Li, Zhi
    Zhang, Yanning
    REMOTE SENSING, 2019, 11 (03)
  • [42] USING HYBRID HEURISTIC EVALUATION METHOD TO UNCOVER THE CONCEPTUAL DESIGN TASKS SUPPORTED BY A HOLOGRAPHIC DISPLAY BASED TRULY 3D VIRTUAL DESIGN ENVIRONMENT
    Opiyo, Eliab Z.
    Horvath, Imre
    DETC 2008: PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATIONAL IN ENGINEERING CONFERENCE, VOL 3, PTS A AND B: 28TH COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2009, : 1649 - 1659
  • [43] Sparse decomposition-based 3D ultrasound imaging and its application in pipeline defect testing using a multi-transducer composite array
    Song, Shoupeng
    Li, Yingxue
    NONDESTRUCTIVE TESTING AND EVALUATION, 2018, 33 (03) : 237 - 252
  • [44] Large-scale 3D point-cloud semantic segmentation of urban and rural scenes using data volume decomposition coupled with pipeline parallelism
    Chew, Alvin Wei Ze
    Ji, Ankang
    Zhang, Limao
    AUTOMATION IN CONSTRUCTION, 2022, 133
  • [45] Web-based real-time visualization of large-scale weather radar data using 3D tiles
    Lu, Mingyue
    Wang, Xinhao
    Liu, Xintao
    Chen, Min
    Bi, Shuoben
    Zhang, Yadong
    Lao, Tengfei
    TRANSACTIONS IN GIS, 2021, 25 (01) : 25 - 43
  • [46] Large-Scale Fabrication of 3D Scaffold-Based Patterns of Microparticles and Breast Cancer Cells using Reusable Acoustofluidic Device
    Nguyen, Tan Dai
    Tran, Van-Thai
    Pudasaini, Sanam
    Gautam, Archana
    Lee, Jia Min
    Fu, Yong Qing
    Du, Hejun
    ADVANCED ENGINEERING MATERIALS, 2021, 23 (06)
  • [47] Large scale fabrication of asymmetric 2D and 3D micro/nano array pattern structures using multi-beam interference lithography technique for Solar cell texturing application
    Kirubaraj, A. Alfred
    Moni, D. Jackuline
    Devaprakasam, D.
    MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2018, 24 (06): : 2569 - 2575
  • [48] Large scale fabrication of asymmetric 2D and 3D micro/nano array pattern structures using multi-beam interference lithography technique for Solar cell texturing application
    A. Alfred Kirubaraj
    D. Jackuline Moni
    D. Devaprakasam
    Microsystem Technologies, 2018, 24 : 2569 - 2575
  • [49] A compact yet flexible design space for large-scale nonperiodic 3D woven composites based on a weighted game for generating candidate tow architectures
    Wang, Zhen-Pei
    Cox, Brian N.
    Kuehsamy, Shemuel Joash
    Jhon, Mark Hyunpong
    Sudre, Olivier
    Sridhar, N.
    Conduit, Gareth J.
    COMPUTER-AIDED DESIGN, 2024, 167
  • [50] CLUSTER-WISE REMOVAL OF REFLECTION ARTIFACTS IN LARGE-SCALE 3D POINT CLOUDS USING SUPERPIXEL-BASED GLASS REGION ESTIMATION
    Yun, Jae-Seong
    Sim, Jae-Young
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1780 - 1784