articleJan 1, 2024GREEN OA

Why Do Computers Stop and What Can Be Done About It?

Indexed indatacite

Abstract

An analysis of the failure statistics of a commercially available fault-tolerant system shows that administration and software are the major contributors to failure. Various approaches to software fault tolerance are then discussed notably process-pairs, transactions and reliable storage. It is pointed out that faults in production software are often soft (transient) and that a transaction mechanism combined with persistent process-pairs provides fault-tolerant execution – the key to software fault-tolerance.

Citation impact

674
total citations
FWCI
Percentile
References
9
Citations per year

Authors

1

Topics & keywords

Keywords
  • Software fault tolerance
  • Computer science
  • Fault tolerance
  • Software
  • Database transaction
  • Reliability engineering
  • Key (lock)
  • Software engineering
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.