共 50 条
- [21] MPI/FT™:: Architecture and taxonomies for fault-tolerant, message-passing middleware for performance-portable parallel computing FIRST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2001, : 26 - 33
- [22] FAIL-MPI: How fault-tolerant is fault-tolerant MPI? 2006 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, VOLS 1 AND 2, 2006, : 133 - +
- [23] Supporting Task-level Fault-Tolerance in HPC Workflows by Launching MPI Jobs inside MPI Jobs PROCEEDINGS OF WORKS 2017: 12TH WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE, 2017,
- [24] NR-MPI: a Non-stop and Fault Resilient MPI 2013 19TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2013), 2013, : 190 - 199
- [26] Portable distributed priority queues with MPI CONCURRENCY-PRACTICE AND EXPERIENCE, 1998, 10 (03): : 175 - 198
- [27] Run-Through Stabilization: An MPI Proposal for Process Fault Tolerance RECENT ADVANCES IN THE MESSAGE PASSING INTERFACE, 2011, 6960 : 329 - +
- [28] Evaluating and extending user-level fault tolerance in MPI applications INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2016, 30 (03): : 305 - 319
- [29] Supporting User-directed Fault Tolerance Over Standard MPI PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012), 2012, : 696 - 697