
Program Schedule:
Workshop Date: June 9
Program:
09:00 - 10:00 Welcome Address
10:00 - 10:30 Towards Resilient High Performance Applications Through Real Time
Reliability Metric Generation and Autonomous Failure Correction
(Clayton F. Chandler, Nathan DeBardeleben, Box Leangsuksun)
10:30 - 11:00 Break
11:00 - 11:30 Methodologies for Advance Warning of Compute Cluster Problems via
Statistical Analysis: A Case Study
(Ann Gentile, Jackson Mayo, Philippe Pebay, Diana Roe, David
Thompson, Matthew Wong)
11:30 - 12:00 Towards a Hardware Fault-Injection Testbed to Support Reproducible
Resiliency Experiments
(Ron Sass, Rahul R. Sharma, Nathan DeBardeleben)
12:00 - 12:30 Fault Injection Framework for System Resilience Evaluation
(Thomas Naughton, Wesley Bland, Geoffroy Vallee, Christian
Engelmann, Stephen L. Scott)
12:30 - 13:00 Closing Discussion; Funding Opportunities
Click here to go back home.