نتایج جستجو برای: checkpointing

تعداد نتایج: 2665  

2007
Gustavo Maciel Dias Vieira

Distributed checkpointing algorithms play an important role in the majority of the fault tolerant software components existent today. Unfortunately, there is a lack of comprehensive and uniform performance testing of those algorithms. Our research focuses on the provision of a toolkit, Metapromela, that helps with the implementation and testing of distributed checkpointing algorithms. This pape...

Journal: :DEStech Transactions on Computer Science and Engineering 2017

1997
Youngbae Kim James S. Plank Jack J. Dongarra

Recently, an algorithm-based approach using diskless checkpointing has been developed to provide fault tolerance for high-performance matrix operations. With this approach, since fault tolerance is incorporated into the matrix operations, the matrix operations become resilient to any single processor failure or change with low overhead. In this paper, we present a technique called multiple chec...

2006
Bidyut Gupta Namdar Mogharreban Shahram Rahimi A. Vemuri

In this paper, we have proposed a new checkpointing / recovery algorithm for ring network architecture. The checkpointing algorithm produces a consistent set of checkpoints in a uni-directional network with the help of few control messages and also avoids the overhead of taking temporary checkpoints unlike most other existing checkpointing algorithms. The number of interrupts to the processes i...

Journal: :The SIJ Transactions on Computer Networks & Communication Engineering 2013

Journal: :CoRR 2015
Nitinder Mohan Pushpendra Singh

We consider the problem of checkpointing a distributed application efficiently in Content Centric Networks so that it can withstand transient failures. We present CCNCheck, a system which enables a sender optimized way of checkpointing distributed applications in CCN’s and provides an efficient mechanism for failure recovery in such applications. CCNCheck’s checkpointing mechanism is a fork of ...

Journal: :IEEE Trans. Parallel Distrib. Syst. 2003
Francesco Quaglia Andrea Santoro

This paper describes a non-blocking checkpointing mode in support of optimistic parallel discrete event simulation. This mode allows real concurrency in the execution of state saving and other simulation specific operations (e.g. event list update, event execution), with the aim at removing the cost of recording state information from the completion time of the parallel simulation application. ...

1997
James S. Plank John G. Webster

Checkpointing is the act of saving the state of a running program so that it may be reconstructed later in time. It is an important basic functionality in computing systems that paves the way for powerful tools in many elds of computer science. This article provides a comprehensive overview of checkpointing in uniprocessor and parallel processing systems, including deenitions, uses of checkpoin...

Journal: :Scalable Computing: Practice and Experience 2016
Eszter Kail Krisztián Karóczkai Péter Kacsuk Miklós Kozlovszky

Smart systems in telemedicine frequently use intelligent sensor devices at large scale. Practitioners can monitor non-stop the vital parameters of hundreds of patients in real-time. The most important pillars of remote patient monitoring services are communication and data processing. Large scale data processing is done mainly using workflows. Some workflows are working in real-time, more compl...

1998
Kuo-Feng Ssu Bin Yao Nuno Ferreira Neves

Conclusions ~~ The limited stable storage available in mobile-computing environments can make traditional checkpointing and message logging umuitable. Since storage on a mobile liost is not considered stable, most protocols designed for these environments save the checkpoints on base stations. Previous approaches have assumed that the base station always has sufficient disk space for storing ch...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید