An Evaluation of Fault Mitigation Method Using Spare Nodes

Bibliographic Information

Other Title
  • 予備ノードを利用した故障後の実行継続手法の検討と評価

Search this article

Abstract

In the upcoming Exa-scale era, faults could happen more frequently than ever, and thus, fault tolerance (FT) is getting more important. Although many FT mechanisms to survive failures has been proposed so far, there is no discussion how a job should survive from failures. In this paper, we explore and discuss three fault mitigation methods how to survive from a failure using spare nodes without loosing execution efficiency. Finally, it is discussed to apply those proposed method to real applications.

Journal

  • IPSJ SIG Notes

    IPSJ SIG Notes 2014 (21), 1-9, 2014-12-02

    Information Processing Society of Japan (IPSJ)

Details 詳細情報について

  • CRID
    1572261552813164032
  • NII Article ID
    110009850815
  • NII Book ID
    AN10463942
  • Text Lang
    ja
  • Data Source
    • CiNii Articles

Report a problem

Back to top