An Evaluation of Fault Mitigation Method Using Spare Nodes
-
- Kazumi Yoshinaga
- RIKEN AICS
-
- Toyohisa Kameyama
- RIKEN AICS
-
- Atsushi Hori
- RIKEN AICS
-
- Yutaka Ishikawa
- RIKEN AICS
Bibliographic Information
- Other Title
-
- 予備ノードを利用した故障後の実行継続手法の検討と評価
Search this article
Abstract
In the upcoming Exa-scale era, faults could happen more frequently than ever, and thus, fault tolerance (FT) is getting more important. Although many FT mechanisms to survive failures has been proposed so far, there is no discussion how a job should survive from failures. In this paper, we explore and discuss three fault mitigation methods how to survive from a failure using spare nodes without loosing execution efficiency. Finally, it is discussed to apply those proposed method to real applications.
Journal
-
- IPSJ SIG Notes
-
IPSJ SIG Notes 2014 (21), 1-9, 2014-12-02
Information Processing Society of Japan (IPSJ)
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1572261552813164032
-
- NII Article ID
- 110009850815
-
- NII Book ID
- AN10463942
-
- Text Lang
- ja
-
- Data Source
-
- CiNii Articles