Standard Article

Communication-Induced Checkpointing Protocols and Rollback-Dependency Trackability: A Survey

  1. Jichiang Tsai1,
  2. Yi-Min Wang2

Published Online: 15 JAN 2008

DOI: 10.1002/9780470050118.ecse059

Wiley Encyclopedia of Computer Science and Engineering

Wiley Encyclopedia of Computer Science and Engineering

How to Cite

Tsai, J. and Wang, Y.-M. 2008. Communication-Induced Checkpointing Protocols and Rollback-Dependency Trackability: A Survey. Wiley Encyclopedia of Computer Science and Engineering. .

Author Information

  1. 1

    National Chung Hsing University, Taichung, Taiwan

  2. 2

    Microsoft Corporation, Redmond, Washington

Publication History

  1. Published Online: 15 JAN 2008

Abstract

Communication-induced checkpointing (CIC) can prevent a potential domino effect resulting from cascading rollbacks in a distributed computation. Moreover, it can be enhanced to achieve a stronger property, called rollback-dependency trackability (RDT), which allows all hidden rollback dependencies to be online trackable. In this article, we survey some important CIC and RDT protocols by introducing their basic concepts as well as by discussing their practical impacts.

Keywords:

  • distributed computations;
  • rollback recovery;
  • domino effect;
  • communication-induced checkpointing;
  • rollback-dependency trackability