reviewACM Computing SurveysSep 1, 2002Closed access

A survey of rollback-recovery protocols in message-passing systems

IBM Research - Austin · The University of Texas at Austin · +2 more institutions

Indexed incrossref

Abstract

This survey covers rollback-recovery techniques that do not require special language constructs. In the first part of the survey we classify rollback-recovery protocols into checkpoint-based and log-based. Checkpoint-based protocols rely solely on checkpointing for system state restoration. Checkpointing can be coordinated, uncoordinated, or communication-induced. Log-based protocols combine checkpointing with logging of nondeterministic events, encoded in tuples called determinants . Depending on how determinants are logged, log-based protocols can be pessimistic, optimistic, or causal. Throughout the survey, we highlight the research issues that are at the core of rollback-recovery and present the solutions…

Citation impact

1,793
total citations
FWCI
75.31
Percentile
100%
References
329
Citations per year

Authors

4

Topics & keywords

Keywords
  • Rollback
  • Computer science
  • Nondeterministic algorithm
  • Implementation
  • Protocol (science)
  • Distributed computing
  • Theoretical computer science
  • Database
No related works found for this paper.