Conference proceeding
POSTER: Hardening Selective Protection across Multiple Program Inputs for HPC Applications
PPOPP'22: PROCEEDINGS OF THE 27TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, pp.437-438
01/01/2022
DOI: 10.1145/3503221.3508414
Abstract
With the ever-shrinking size of transistors and increasing scale of applications, silent data corruptions (SDCs) have become a common yet serious issue in HPC applications. Selective instruction duplication (SID) is a popular fault-tolerance technique that can obtain a high SDC coverage with low-performance overhead, as it selects the most vulnerable parts of a program for protection with priority. However, existing studies of SID are confined to single program input in the evaluation, assuming that the error resilience of the program remains similar across inputs, leading to a drastic loss of SDC coverage from SID when the protected program runs different inputs. Hence, we proposed Sentinel, an automated compiler-based framework to mitigate the loss of SDC coverage. Evaluation results show that Sentinel can effectively mitigate the loss of SDC coverage (up to 97.00%) across multiple inputs, which significantly hardens existing SID techniques.
Details
- Title: Subtitle
- POSTER: Hardening Selective Protection across Multiple Program Inputs for HPC Applications
- Creators
- Yafan Huang - University of IowaShengjian Guo - BaiduSheng Di - Argonne National LaboratoryGuanpeng Li - University of IowaFranck Cappello - Argonne National Laboratory
- Resource Type
- Conference proceeding
- Publication Details
- PPOPP'22: PROCEEDINGS OF THE 27TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, pp.437-438
- Publisher
- Assoc Computing Machinery
- DOI
- 10.1145/3503221.3508414
- Number of pages
- 2
- Grant note
- DE-AC02-06CH11357 / U.S. Department of Energy, Office of Science; United States Department of Energy (DOE)
- Language
- English
- Date published
- 01/01/2022
- Academic Unit
- Computer Science
- Record Identifier
- 9984411097802771
Metrics
5 Record Views