نتایج جستجو برای: fault recovery

تعداد نتایج: 262091  

2002
P. Narasimhan C. Reverte S. Ratanotayanon

The Middleware for Embedded Adaptive Dependability (MEAD) infrastructure enhances large-scale distributed real-time embedded middleware applications with novel capabilities, including (i) transparent, yet tunable, fault tolerance in real time, (ii) proactive dependability, (iii) resource-aware system adaptation to crash, communication, partitioning and timing faults with (iv) scalable and fast ...

2008
Claudia Rusu Cristian Grecu Lorena Anghel

In this paper we propose a dynamically reconfigurable failure recovery scheme developed for Network-on-Chip (NoC) based systems. The recovery scheme is based on a checkpointing and rollback protocol and permits enhancing the system fault tolerance capabilities by exploiting information on traffic load and failure rate. The increased performance of the fault tolerance mechanism is achieved by si...

2008
Ch. D. V. Subba Rao M. M. Naidu

⎯ Checkpointing schemes facilitate fault recovery in distributed systems. The two-level fault recovery scheme of distributed system inherits the merits of both disk-based and diskless checkpointing schemes. The present work extends James S Plank’s Diskless checkpointing scheme (N+1 Parity) by introducing ‘Timeout’ to checkpoint programs with high locality of reference. This mechanism enables ap...

2003
Sandeep S. Kulkarni Ali Ebnenasir

In this paper, we focus on automated techniques to enhance the fault-tolerance of a nonmasking fault-tolerant program to masking. A masking program continually satisfies its specification even if faults occur. By contrast, a nonmasking program merely guarantees that after faults stop occurring, the program recovers to states from where it continually satisfies its specification. Until the recov...

2016
A. Fang I. Laguna K. Sato T. Islam K. Mohror Aiman Fang Ignacio Laguna Kento Sato Tanzima Islam Kathryn Mohror

Future high-performance computing systems may face frequent failures with their rapid increase in scale and complexity. Resilience to faults has become a major challenge for large-scale applications running on supercomputers, which demands fault tolerance support for prevalent MPI applications. Among failure scenarios, process failures are one of the most severe issues as they usually lead to t...

2007
Lawrence R. Klos Golden G. Richard Zhidong Xu

Julep is an object-oriented testbed designed for analysis and comparison of temporal diversity fault tolerance mechanisms. It is written in Java, and runs as a layer underneath a distributed application. Julep can run on any standard COTS platform with a JVM, in homogeneous or heterogeneous environments. Julep is designed to quickly and easily incorporate new process recovery mechanisms, allowi...

1997
Edgar Nett Michael Mock

A central problem in the design of fault-tolerant realtime systems is that desirable fault-tolerance properties are usually realized by mechanisms that counteract realtime guarantees. A prominent example is the All-orNothing property (also known as failure atomicity) known from transactions. This property normally is realized by the means of isolation and roll-back recovery. However, isolation ...

Journal: :IJAPUC 2015
Zhenpeng Xu Hairong Chen Weini Zeng

Many new characteristics are introduced in the mobile computing system, such as mobility, disconnections, finite power source, vulnerable to physical damage, lack of stable storage. Since the related log-based rollback recovery fault tolerant schemes may still lead to dramatic performance loss in failure-free or inconsistent recovery caused by the fault, a hybrid log-based fault tolerant scheme...

2002
Myron Hecht Herbert Hecht

1 0-7803-5846-5/00/$10.00 © 2000 IEEE Abstract— This paper describes the design and implementation of a software infrastructure for real-time fault tolerance for applications on long duration deep space missions. The infrastructure has advanced capabilities for Adaptive Fault Tolerance (AFT), i.e., the ability to change the recovery strategy based on the failure history, available resources, an...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید