Self-Recovering Parallel Applications in Multi-Core Systems

被引:6
|
作者
Bizot, Gilles [1 ]
Avresky, Dimiter [2 ]
Chaix, Fabien [1 ]
Zergainoh, Nacer-Eddine [1 ]
Nicolaidis, Michael [1 ]
机构
[1] TIMA Lab CNRS UJF INP, Grenoble, France
[2] IRIANC, Boston, MA USA
关键词
Self-Recovering; Multi-Core Chip; Seamless Execution; Parallel Streaming Application; Adaptive Fault-Tolerant Routing;
D O I
10.1109/NCA.2011.14
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a Self-Recovering strategy, which is able to "re-map" dynamically application tasks on a multi-core system, is presented. Based on run-time failure aware techniques, this Self-Recovering strategy guarantees seamlessly termination and delivering the expected results despite multiple node and link failures in a 2D mesh topology. It has been demonstrated, based on a statistical analysis, that the proposed technique is able to re-map the tasks of faulty nodes in a bounded number of steps. The theoretical results have been validated by simulations. The proposed technique is allowing to bypass multiple nodes, routers and links failures with a predictable number of hops. It has been demonstrated that the Motion JPEG-2000 application can be parallelized and formally represented as a Directed Acyclic Graph (DAG). It is worth noting that the proposed technique has been validated by the simulation of a 1000 cores system, in the presence of nodes and links failures up to 10%. Therefore, the proposed technique has been shown to be efficient for seamless execution of parallel streaming applications and to provide the Execution Time Reduction Ratio close to ideal.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] A parallel programming assessment for stream processing applications on multi-core systems
    Andrade, Gabriella
    Griebler, Dalvan
    Santos, Rodrigo
    Fernandes, Luiz Gustavo
    COMPUTER STANDARDS & INTERFACES, 2023, 84
  • [2] Automatic synthesis of self-recovering VLSI systems
    Orailoglu, A
    Karri, R
    IEEE TRANSACTIONS ON COMPUTERS, 1996, 45 (02) : 131 - 142
  • [3] Parallel Skyline Queries on Multi-Core Systems
    Liou, Meng-Zong
    Shu, Yi-Teng
    Chen, Wei-Mei
    2013 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2013, : 287 - 292
  • [4] Self-recovering filter
    Baranov, D.A.
    Danilov, N.V.
    Khimicheskoe I Neftegazovoe Mashinostroenie, 2001, (07): : 3 - 5
  • [5] On the maturity of parallel applications for asymmetric multi-core processors
    Chronaki, Kallia
    Moreto, Miguel
    Casas, Marc
    Rico, Alejandro
    Badia, Rosa M.
    Ayguade, Eduard
    Valero, Mateo
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 127 : 105 - 115
  • [6] Parallel and Distributed Simulation of networked Multi-Core Systems
    Wehner, Philipp
    Goehringer, Diana
    2014 INTERNATIONAL SYMPOSIUM ON SYSTEM-ON-CHIP (SOC), 2014,
  • [7] A Parallel Packet Processing Method On Multi-Core Systems
    Li, Yunchun
    Qiao, Xinxin
    2011 TENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2011, : 78 - 81
  • [8] Parallel Loop Self-Scheduling for Heterogeneous Cluster Systems with Multi-Core Computers
    Wu, Chao-Chin
    Lai, Lien-Fu
    Chiu, Po-Hsun
    2008 IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE, VOLS 1-3, PROCEEDINGS, 2008, : 251 - 256
  • [9] Enhanced Parallel Loop Self-Scheduling for Heterogeneous Multi-Core Cluster Systems
    Wu, Chao-Chin
    Huang, Liang-Tsung
    Lai, Lien-Fu
    Chen, Ming-Lung
    2009 10TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (ISPAN 2009), 2009, : 568 - +
  • [10] A self-recovering rake receiver for asynchronous CDMA systems
    Torlak, M
    Evans, BL
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2869 - 2872