Treffer: A survey of fault tolerance techniques in distributed systems.
Weitere Informationen
This paper presents a thorough survey of fault tolerance mechanisms in distributed systems. It examines potential failure factors, available mechanisms, and their foundations, focusing on mechanisms explicitly developed for distributed systems. This paper summarizes how fault-tolerance techniques can be combined to provide various dependability characteristics. The primary goal of this paper is to serve as a guide to the extensive research and development activity in the domain of distributed systems, examining the current fault tolerance mechanisms and highlighting future avenues for research aiming to help in identifying areas for further exploration and innovation, providing a roadmap for their future work and emphasizing the significance of their contributions to the research community. [ABSTRACT FROM AUTHOR]