The failure detector abstraction
نویسندگان
چکیده
منابع مشابه
The Impact Failure Detector
This work proposes a new and flexible unreliable failure detector whose output is related to the trust level of a set of processes. By expressing the relevance of each process of the set by an impact factor value, our approach allows the tuning of the detector output, making possible a softer or stricter monitoring. The idea behind our proposal is that, according to an acceptable margin of fail...
متن کاملThe Φ Accrual Failure Detector
Detecting failures is a fundamental issue for fault-tolerance in distributed systems. Recently, many people have come to realize that failure detection ought to be provided as some form of generic service, similar to IP address lookup or time synchronization. However, this has not been successful so far. One of the reasons is the difficulty to satisfy several application requirements simultaneo...
متن کاملTwo-ways Adaptive Failure Detection with the φ-Failure Detector
It is widely recognized that distributed systems would greatly benefit from the availability of a generic failure detection service. Such a service can however prove useful only if it can adapt simultaneously to changing network conditions and conflicting application requirements. This paper presents a novel approach to adaptive failure detectors, called φ-failure detectors, which dynamically a...
متن کاملThe Weakest Failure Detector for Consensus * Solving
We determine what information about failures is necessary and sufficient to solve Consensus in asynchronous distributed systems subject to crash failures. In [CT91], we proved that <>IV, a failure detector that provides surprisingly little information about which processes have crashed, is sufficient to solve Consensus in asynchronous systems with a majority of correct processes. In this paper,...
متن کاملLow-Overhead Accrual Failure Detector
Failure detectors are one of the fundamental components for building a distributed system with high availability. In order to maintain the efficiency and scalability of failure detection in a complicated large-scale distributed system, accrual failure detectors that can adapt to multiple applications have been studied extensively. In this paper, an new accrual failure detector--LA-FD with low s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Computing Surveys
سال: 2011
ISSN: 0360-0300,1557-7341
DOI: 10.1145/1883612.1883616