Optimistic Message Logging for Independent Checkpointing in Message-Passing Systems
نویسندگان
چکیده
Message-passing systems with communication protocol transparent to the applications typically require message logging to ensure consistency between checkpoints. This paper describes a periodic independent checkpointing scheme with optimistic logging to reduce performance degradation during normal execution while keeping the recovery cost acceptable. Both time and space overhead for message logging can be reduced by detecting messages that need not be logged. A checkpoint space reclamation algorithm is presented to reclaim all checkpoints which are not useful for any possible future recovery. Communication trace-driven simulation for several hypercube programs is used to evaluate the techniques.
منابع مشابه
An Efficient Optimistic Message Logging Scheme for the Recoverable Mobile Computing Systems
This paper presents an efficient scheme to implement the optimistic message logging and the asynchronous recovery for the mobile computing environment. Most of the coordinated checkpointing schemes may not be suitable for the mobile environment, since the unreliable mobile hosts and the fragile network connection may hinder any kind of coordination for checkpointing and recovery. In this paper,...
متن کاملAn Asynchronous Recovery Scheme based on Optimistic Message Logging for the Mobile Computing Systems
To provide the fault-tolerance for the mobile computing systems, many checkpointing-based recovery schemes have been proposed. However, considering the nature of the mobile environment in which some mobile hosts (MHs) are often disconnected from the network and the probability of concurrent failures on MHs is high, any kind of coordination during the checkpointing and even during the recovery m...
متن کاملUsing Message Semantics for Fast-Output Commit in Checkpointing-and-Rollback Recovery
Checkpointing is a very effective technique to ensure the continuity of long-running applications in the occurrence of failures. However, one of the handicaps of coordinated checkpointing is the high latency for committing output from the application to the external world. Enhancing the checkpointing scheme with a message logging protocol is a good solution to reduce the output latency. The ide...
متن کاملWhy Optimistic Message Logging Has Not Been UsedIn
Much of the literature on message logging and checkpointing in the past decade has been based on a so-called optimistic approach 1] that places more emphasis on failure-free overhead than recovery ee-ciency. Our experience has shown that most telecommunications systems use a pessimistic approach because the main purpose of using message logging and checkpointing is to achieve fast and localized...
متن کاملImproving Message Logging Protocols Scalability through Distributed Event Logging
Message logging is an attractive solution to provide fault tolerance for message passing applications because it is more scalable than coordinated checkpointing. Sender-based message logging is a well known optimization that allows to save messages payload in the sender memory and so only the events corresponding to message receptions have to be logged reliably using an event logger. In existin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1992