نتایج جستجو برای: fault tolerance

تعداد نتایج: 176292  

2014
Jasbir Kaur

Fault tolerance in cloud computing platforms and applications is a crucial issue. This issue is especially difficult since cloud computing relies by nature on a complex splitting into many layers. This paper analyses the implementation of fault tolerance in such a complex cloud computing environment with a focus on FCFS and SJF along with MPIL method with fault tolerance property. The proposed ...

Journal: :Parallel Processing Letters 2011
Blesson Varghese Gerard T. McKee Vassil Alexandrov

The work reported in this paper is motivated towards validating an alternative approach for fault tolerance over traditional methods like checkpointing that constrain efficacious fault tolerance. Can agent intelligence be used to achieve fault tolerant parallel computing systems? If so, “What agent capabilities are required for fault tolerance?”, “What parallel computational tasks can benefit f...

Journal: :CoRR 2016
Vincenzo De Florio

The central topic of this book is application-level fault-tolerance, that is the methods, architectures, and tools that allow to express a fault-tolerant system in the application software of our computers. Application-level fault-tolerance is a sub-class of software fault-tolerance that focuses on the problems of expressing the problems and solutions of fault-tolerance in the top layer of the ...

1995
Kevin Kwiat Salim Hariri

EEcient use of redundant computing resources is desirable in fault tolerance. To meet its fault tolerance requirements, a task may need all the computing modules; but when a lesser amount of fault tolerance is suucient, then multiple tasks should be allowed to execute concurrently. Systems supporting this type of operation usually have their fault tolerance controlled by software. Fault toleran...

2015

This paper describes the software approach of fault tolerance for shared memory multi core system using PLR.PLR uses a software-centric approach transient fault tolerance which ensuring a correct software execution. This scheme is used at user space level which does not necessitate changes to the original application.PLR create a set of redundant process per application process. In this scheme ...

2009
Rui Borralho Pedro Fontes Ana Antunes Fernando Morgado Dias

FTSET is a software tool that deals with fault tolerance of Artificial Neural Networks. This tool is capable of evaluating the fault tolerance degree of a previously trained Artificial Neural Network given its inputs ranges, the weights and the architecture. The FTSET is also capable of improving the fault tolerance by applying a technique of splitting the connections of the network that are mo...

2017
Alexander Romanovsky Alessandro Garcia

Software maintenance is a crucial phase of the software development life cycle. It is important to facilitate this stage, complying with both functional and non-functional requirements. However, very often the main focus is made on the functional features of the application, whereas fault tolerance mechanisms are neglected and as a result do not provide sufficient maintainability and reusabilit...

1992
Walter L. Heimerdinger Charles B. Weinstock

A major problem in transitioning fault tolerance practices to the practitioner community is a lack of a common view of what fault tolerance is, and how it can help in the design of reliable computer systems. This document takes a step towards making fault tolerance more understandable by proposing a conceptual framework. The framework provides a consistent vocabulary for fault tolerance concept...

2011
Joshua Hursey Thomas Naughton Geoffroy Vallée Richard L. Graham

The lack of fault tolerance is becoming a limiting factor for application scalability in HPC systems. The MPI does not provide standardized fault tolerance interfaces and semantics. The MPI Forum’s Fault Tolerance Working Group is proposing a collective fault tolerant agreement algorithm for the next MPI standard. Such algorithms play a central role in many fault tolerant applications. This pap...

Journal: :Theor. Comput. Sci. 2012
Ali Ebnenasir Reza Hajisheykhi Sandeep S. Kulkarni

Due to their increasing complexity, today’s SoC (System on Chip) systems are subject to a variety of faults (e.g., soft errors, component crash, etc.), thereby making fault tolerance a highly important property of such systems. However, designing fault tolerance is a complex task in part due to the large scale of integration of SoC systems and different levels of abstraction provided by modern ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید