Is the multigrid method fault tolerant? The multilevel case

被引:0
|
作者
Ainsworth M. [1 ,2 ,3 ]
Glusa C. [1 ,3 ]
机构
[1] Division of Applied Mathematics, Brown University, Providence, 02912, RI
[2] Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, 37831, TN
[3] Division of Applied Mathematics, Brown University, Providence, 02912, RI
来源
Ainsworth, Mark (MarkAinsworth@Brown.edu) | 1600年 / Society for Industrial and Applied Mathematics Publications卷 / 39期
关键词
Convergence analysis; Fault tolerance; Multigrid; Random matrices; Resilience;
D O I
10.1137/16m1097274
中图分类号
学科分类号
摘要
Computing at the exascale level is expected to be affected by a significantly higher rate of faults, due to increased component counts as well as power considerations. Therefore, current day numerical algorithms need to be re-examined to determine if they are fault resilient and to determine which critical operations need to be safeguarded in order to obtain performance that is close to the ideal fault-free method. In a previous paper, a framework for the analysis of random stationary linear iterations was presented and applied to the two grid method. The present work is concerned with the multigrid algorithm for the solution of linear systems of equations, which is widely used on high performance computing systems. It is shown that the Fault-Prone Multigrid Method is not resilient, unless the prolongation operation is protected. Strategies for fault detection and mitigation as well as protection of the prolongation operation are presented and tested, and a guideline for an optimal choice of parameters is devised. © 2017 Society for Industrial and Applied Mathematics.
引用
收藏
页码:C393 / C416
页数:23
相关论文
共 50 条
  • [1] IS THE MULTIGRID METHOD FAULT TOLERANT? THE MULTILEVEL CASE
    Ainsworth, Mark
    Glusa, Christian
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2017, 39 (06): : C393 - C416
  • [2] IS THE MULTIGRID METHOD FAULT TOLERANT? THE TWO-GRID CASE
    Ainsworth, Mark
    Glusa, Christian
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2017, 39 (02): : C116 - C143
  • [3] FAULT-TOLERANT PARALLEL MULTIGRID METHOD ON UNSTRUCTURED ADAPTIVE MESH
    Fung, Frederick
    Stals, Linda
    Deng, Quanling
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2024, 46 (05): : S145 - S169
  • [4] A Fault Tolerant Selective Harmonic Elimination Method for Modular Multilevel Converters
    Mohammadhassani, Ardavan
    Mehrizi-Sani, Ali
    2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [5] A Novel Fault-Tolerant Control Method for Modular Multilevel Converters
    Yang, Lixia
    Jia, Lixin
    Luo, Longfei
    Zhang, Yanbin
    Zhao, Linlin
    Zhang, Zhuxiang
    2019 IEEE 4TH INTERNATIONAL FUTURE ENERGY ELECTRONICS CONFERENCE (IFEEC), 2019,
  • [6] A New Fault-Tolerant Control Method for Cascaded Multilevel Inverter
    Wang, Baocheng
    Guo, Xiaoling
    Wang, Liqiao
    Li, Xin
    Sun, Xiaofeng
    2009 IEEE 6TH INTERNATIONAL POWER ELECTRONICS AND MOTION CONTROL CONFERENCE, VOLS 1-4, 2009, : 608 - 611
  • [7] A multilevel converter topology with fault tolerant ability
    Chen, AL
    Hu, L
    Chen, LF
    Deng, Y
    Yao, G
    He, XN
    APEC 2004: NINETEENTH ANNUAL IEEE APPLIED POWER ELECTRONICS CONFERENCE AND EXPOSITION, VOLS 1-3, 2004, : 1610 - 1616
  • [8] A Simple Fault Tolerant Multilevel Inverter Topology
    Soni, Nayana
    Borghate, Vijay B.
    Maddugari, Santosh Kumar
    Ambhore, Datta
    Sabyasachi, Sidharth
    2018 8TH IEEE INDIA INTERNATIONAL CONFERENCE ON POWER ELECTRONICS (IICPE), 2018,
  • [9] Fault-tolerant multilevel converter topology
    Ceballos, Salvador
    Pou, Josep
    Gabiola, Igor
    Luis Villate, Jose
    Zaragoza, Jordi
    Boroyevich, Dushan
    2006 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, VOLS 1-7, 2006, : 1577 - 1582
  • [10] Comparison of fault-tolerant multilevel inverters
    Gleissner, Michael
    Maier, Robert
    Bakran, Mark-M.
    2017 19TH EUROPEAN CONFERENCE ON POWER ELECTRONICS AND APPLICATIONS (EPE'17 ECCE EUROPE), 2017,