Self-Recovering Parallel Applications in Multi-Core Systems

被引:6
|
作者
Bizot, Gilles [1 ]
Avresky, Dimiter [2 ]
Chaix, Fabien [1 ]
Zergainoh, Nacer-Eddine [1 ]
Nicolaidis, Michael [1 ]
机构
[1] TIMA Lab CNRS UJF INP, Grenoble, France
[2] IRIANC, Boston, MA USA
关键词
Self-Recovering; Multi-Core Chip; Seamless Execution; Parallel Streaming Application; Adaptive Fault-Tolerant Routing;
D O I
10.1109/NCA.2011.14
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a Self-Recovering strategy, which is able to "re-map" dynamically application tasks on a multi-core system, is presented. Based on run-time failure aware techniques, this Self-Recovering strategy guarantees seamlessly termination and delivering the expected results despite multiple node and link failures in a 2D mesh topology. It has been demonstrated, based on a statistical analysis, that the proposed technique is able to re-map the tasks of faulty nodes in a bounded number of steps. The theoretical results have been validated by simulations. The proposed technique is allowing to bypass multiple nodes, routers and links failures with a predictable number of hops. It has been demonstrated that the Motion JPEG-2000 application can be parallelized and formally represented as a Directed Acyclic Graph (DAG). It is worth noting that the proposed technique has been validated by the simulation of a 1000 cores system, in the presence of nodes and links failures up to 10%. Therefore, the proposed technique has been shown to be efficient for seamless execution of parallel streaming applications and to provide the Execution Time Reduction Ratio close to ideal.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] High-Performance Parallel Fault Simulation for Multi-Core Systems
    Karami, Masoomeh
    Haghbayan, Mohammad-hashem
    Ebrahimi, Masoumeh
    Nejatollahi, Hamid
    Tenhunen, Hannu
    Plosila, Juha
    2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 207 - 211
  • [32] METHOD OF SELF-RECOVERING EQUALIZATION FOR MULTILEVEL AMPLITUDE-MODULATION SYSTEMS
    SATO, Y
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1975, CO23 (06) : 679 - 682
  • [33] Parallel Discrete Event Simulation for Multi-Core Systems: Analysis and Optimization
    Wang, Jingjing
    Jagtap, Deepak
    Abu-Ghazaleh, Nael
    Ponomarev, Dmitry
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (06) : 1574 - 1584
  • [34] Efficient and Scalable Parallel Algorithm for Sorting Multisets on Multi-core Systems
    Zhong, Cheng
    Qu, Zeng-Yan
    Yang, Feng
    Yin, Meng-Xiao
    Li, Xia
    JOURNAL OF COMPUTERS, 2012, 7 (01) : 30 - 41
  • [35] ParSA: Parallel Simulated Annealing Placement Algorithm for Multi-core Systems
    Sanjabi, Mercedeh
    Jahanian, Ali
    Amanollahi, Saba
    Miralaei, Negar
    2012 16TH CSI INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND DIGITAL SYSTEMS (CADS), 2012, : 19 - 24
  • [36] Generation of Schedule Tables on Multi-core Systems for AUTOSAR Applications
    Wang, Wenhao
    Camut, Fabrice
    Miramond, Benoit
    PROCEEDINGS OF THE 2016 CONFERENCE ON DESIGN AND ARCHITECTURES FOR SIGNAL & IMAGE PROCESSING, 2016, : 191 - 198
  • [37] Efficient self-recovering ASIC design
    Hamilton, SN
    Orailoglu, A
    IEEE DESIGN & TEST OF COMPUTERS, 1998, 15 (04): : 25 - 35
  • [38] Efficient self-recovering ASIC design
    Hamilton, Samuel N.
    Orailoglu, Alex
    IEEE Design and Test of Computers, 1998, 15 (04): : 25 - 35
  • [39] A self-recovering mechanochromic chiral π-gelator
    Kartha, Kalathil K.
    Nair, Vishnu S.
    Praveen, Vakayil K.
    Takeuchi, Masayuki
    Ajayaghosh, Ayyappanpillai
    JOURNAL OF MATERIALS CHEMISTRY C, 2019, 7 (05) : 1292 - 1297
  • [40] Using Multi-core Processor to support Network Parallel Image Processing Applications
    Ramraj, E.
    Rajan, A. Senthil
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2009, : 232 - +