A Novel Approach for Handling Soft Error in Conjugate Gradients

被引:0
|
作者
Ozturk, Muhammed Emin [1 ]
Renardy, Marissa [2 ]
Li, Yukun [2 ]
Agrawal, Gagan [1 ]
Chou, Ching-Shan [2 ]
机构
[1] Ohio State Univ, Comp Sci & Engn, Columbus, OH 43210 USA
[2] Ohio State Univ, Dept Math, Columbus, OH 43210 USA
关键词
Fault-tolerance; Soft errors; Iterative Solvers; Conjugate Gradients;
D O I
10.1109/HiPC.2018.00030
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Soft errors or bit flips have recently become an important challenge in high performance computing. In this paper, we focus on soft errors in a particular algorithm: conjugate gradients (CG). We present a series of techniques to detect soft errors in CG. We first derive a mathematical quantity that is monotonically decreasing. Next, we add a set of heuristics and combine our approach with previously established methods. We have extensively evaluated our method considering three distinct dimensions. First, we show that the F-score of our detection is significantly better than two other methods. Second, we show that for soft errors that are not detected by our method, the resulting inaccuracy in the final results are small, and better than those with other methods. Finally, we show that the runtime overheads of our method are lower than for other methods.
引用
收藏
页码:193 / 202
页数:10
相关论文
共 50 条
  • [21] Flexible conjugate gradients
    Notay, Y
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2000, 22 (04): : 1444 - 1460
  • [22] Erratum to: On computing quadrature-based bounds for the A-norm of the error in conjugate gradients
    Gérard Meurant
    Petr Tichý
    Numerical Algorithms, 2014, 66 : 679 - 680
  • [23] Matching Detection and Correction Schemes for Soft Error Handling in Sequential Logic
    Koser, Erol
    Miller, Felix
    Stechele, Walter
    2015 EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2015, : 706 - 713
  • [24] An Efficient Error Correction Coding Approach to Tolerate Soft Error
    Khan, Md. Mizanur Rahman
    Sadi, Muhammad Sheikh
    2012 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2012, : 500 - 505
  • [25] A new approach in handling soft decision making problems
    Cetkin, Vildan
    Aygunoglu, Abdulkadir
    Aygun, Halis
    JOURNAL OF NONLINEAR SCIENCES AND APPLICATIONS, 2016, 9 (01): : 231 - 239
  • [26] A Practical Approach for Handling Soft Errors in Iterative Applications
    Liu, Jiaqi
    Kurt, Mehmet Can
    Agrawal, Gagan
    2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015, 2015, : 158 - 161
  • [27] Novel approach in RICH data handling
    Ososkov, GA
    CZECHOSLOVAK JOURNAL OF PHYSICS, 1999, 49 : 145 - 160
  • [28] A Novel Soft Error Tolerant FPGA Architecture
    Amagasaki, Motoki
    Nakamura, Yuji
    Teraoka, Takuya
    Iida, Masahiro
    Sueyoshi, Toshinori
    2016 IFIP/IEEE INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2016,
  • [29] A Novel Soft Error Immunity SRAM Cell
    Liu, Xuemei
    Pan, Liyang
    Zhao, Xin
    Qiao, Fengying
    Wu, Dong
    Xu, Jun
    2013 IEEE INTERNATIONAL INTEGRATED RELIABILITY WORKSHOP FINAL REPORT (IRW), 2013, : 173 - 176
  • [30] Error Handling and Error Evaluation
    Bobertag
    ZEITSCHRIFT FUR PSYCHOLOGIE UND PHYSIOLOGIE DER SINNESORGANE, 1932, 125 (3-4): : 240 - 240