نتایج جستجو برای: fault recovery

تعداد نتایج: 262091  

2009
G. Bertoni P. Castaldi N. Bertozzi M. Bonfè S. Simani

This paper addresses the development of a novel Active Fault Tolerant Control Scheme (AFTCS) which, when used with an independently designed guidance system, turns out to give an overall fault tolerant guidance and control system. This AFTCS methodology avoids a logic-based switching controller by exploiting an adaptive fault estimator whose design is based on the Non Linear Geometric Approach ...

Journal: :Softw., Pract. Exper. 1999
Anthony Egan David Kutz Dmitry Mikulin Rami G. Melhem Daniel Mossé

Even though real-time systems have the stringent constraint of completing tasks before their deadlines, many existing real-time operating systems do not implement fault tolerance capabilities. In this paper we summarize fault tolerant real-time scheduling policy for dynamic tasks with ready times and deadlines. Our focus in this paper is the implementation, which includes fault-tolerant schedul...

2013
Lanyue Lu Andrea C. Arpaci-Dusseau Remzi H. Arpaci-Dusseau

Lanyue Lu presented isolation file systems, providing fault isolation and quick recovery within a single file system. Because file systems are important data access interfaces in many environments, high availability is critical; however, a single fault can trigger a large-scale impact for the whole file system, such as remounting as read-only and a system crash. Lanyue explained how a metadata ...

2012
K. C. Joshi

Fault tolerance is the ability of a system to perform its function correctly even in the presence of internal faults. We should accept that, relying on software techniques for obtaining dependability means accepting some overhead in terms of increased size of code and reduced performance (or slower execution). N-version programming achieves redundancy through the use of multiple versions. Failu...

2013
Rinku Gupta Kamil Iskra Kazutomo Yoshii Pavan Balaji Pete Beckman

Faults and errors are an unavoidable aspect of high performance computing systems. Emerging exascale systems will contain billions of hardware components and complex software stacks. In addition, higher fabrication density and power challenges will further compound fault detection, management and recovery. Efficient fault tolerance and resiliency frameworks are thus of immense importance in the...

Journal: :IEICE Transactions 2011
Lihong Shang Mi Zhou Yu Hu Erfu Yang

Field programmable gate arrays (FPGAs) are widely used in reliability-critical systems due to their reconfiguration ability. However, with the shrinking device feature size and increasing die area, nowadays FPGAs can be deeply affected by the errors induced by electromigration and radiation. To improve the reliability of FPGA-based reconfigurable systems, a permanent fault recovery approach usi...

1998
Samuel Norman Hamilton Alex Orailoglu

Increasing chip density combined with heightened reliability expectations has spawned greater interest in fault tolerant design. In recent years, research into rollback and retry techniques has established them as an e ective approach to recovery from transient and intermittent faults. For applications with strict timing requirements, however, the high error latency inherent in retry approaches...

2002
Zhang Youhui

This paper presents a Checkpoint-based Rollback Recovery and Migration System for Message Passing Interface, ChaRM4MPI, for Linux Clusters. Some important fault tolerant mechanisms are designed and implemented in this system, which include coordinated checkpointing protocol, synchronized rollback recovery, process migration, and so on. Owing to ChaRM4MPI, the node transient faults can be recove...

1995
Mohamed F. Younis Grace Tsai Thomas J. Marlowe Alexander D. Stoyen

Achieving fault-tolerance using a primary-backup approach involves overhead of recovery such as activating the backup and propagating execution states, which may a ect the timeliness properties of real-time systems. We propose a semi-passive architecture for fault-tolerance and show that speculative execution can enhance overall performance and hence shorten the recovery time in the presence of...

2004
Naruemon Wattanapongsakorn

Reliability enhancement in software system is a crucial and challenging issue. Applying efficient fault-tolerant mechanism can fulfill the system reliability requirement. This paper proposes reliability models for hierarchical and hybrid fault-tolerant software systems considering failure dependencies or related faults in software components/versions. Our system models are based on the classica...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید