نتایج جستجو برای: parallel fault

تعداد نتایج: 285709  

1994
Jörn Altmann Frank Balbach Axel Hein

The primary focus in the analysis of massively parallel supercomputers has traditionally been on their performance. However, their complex network topologies, large number of processors, and sophisticated system software can make them very unreliable. If every failure of one of the many components of a massively parallel computer could shut down the machine, the machine would be useless. Theref...

2010
Guozhang Wang

MapReduce [10] gives us an appropriate model for distributed parallel computing. There are several features which are proved useful: 1) centralized job distribution. 2) Fault tolerance mechanism for both masters and workers. Although there is controversies about MapReduce capability to replace standard RDBMS [12, 13], it is reasonable that existing proposals to use MapReduce in relational data ...

2013
Minesh B. Amin

SPM.Python is a scalable, parallel fault-tolerant version of the serial Python language, and can be deployed to create parallel capabilities to solve problems in domains spanning finance, life sciences, electronic design, IT, visualization, and research. Software developers may use SPM.Python to augment new or existing (Python) serial scripts for scalability across parallel hardware. Alternativ...

Journal: :Future Generation Comp. Syst. 2008
Darius Buntinas Camille Coti Thomas Hérault Pierre Lemarinier Laurence Pilard Ala Rezmerita Eric Rodriguez Franck Cappello

A long-term trend in high-performance computing is the increasing number of nodes in parallel computing platforms, which entails a higher failure probability. Fault tolerant programming environments should be used to guarantee the safe execution of critical applications. Research in fault tolerant MPIs has led to the development of several fault tolerant MPI environments. Different approaches a...

2009
George Bosilca Camille Coti Thomas Hérault Pierre Lemarinier Jack J. Dongarra

Next generation HPC platforms are expected to feature millions of cores distributed over hundreds of thousands of nodes, leading to scalability and fault-tolerance issues for both applications and runtime environments dedicated to run on such machines. Most parallel applications are developed using a communication API such as MPI, implemented in a library that runs on top of a dedicated runtime...

1997
Masato Suzuki Takuya Katayama

We present a novel replication technique for parallel applications where instances of the replicated application are active on di erent group of processors called replicas. The replication technique is based on the FTAG (Fault Tolerant Attribute Grammar) computation model. FTAG is a functional and attribute based model [4]. The developed replication technique implements \active parallel replica...

Journal: :Software Engineering Journal 1991
Narain H. Gehani

Concurrent C is an upward-compatible parallel extension of C which runs on a variety of uniprocessors and multiprocessors. A Concurrent C program consists of a set of processes which execute in parallel and interact with each other by sending messages. Fault-Tolerant (FT) Concurrent C, an extension of Concurrent C, is a tool for writing fault-tolerant distributed programs, based on the replicat...

1998
Stergios I. Roumeliotis Gaurav S. Sukhatme George A. Bekey

This paper introduces a method to detect and identify faults in wheeled mobile robots. The idea behind the method is to use adaptive estimation to predict (in parallel) the outcome of several faults. Models of the system behavior under each type of fault are embedded in the various parallel estimators (each of which is a Kalman Filter). Each lter is thus tuned to a particular fault. Using its e...

2015
Jing Zhao Xu Gao Bin Li Xiangdong Liu Xing Guan

Abstract: Multi-phase motors are gaining more attention due to the advantages of good fault tolerance capability and high power density, etc. By applying dual-rotor technology to multi-phase machines, a five-phase dual-rotor permanent magnet synchronous motor (DRPMSM) is researched in this paper to further promote their torque density and fault tolerance capability. It has two rotors and two se...

2000
Martin Kutrib Jan-Thomas Lowe

The general capabilities of fault tolerant computations in one-way and two-way linear cellular arrays are investigated in terms of pattern recognition. The defective processing elements (cells) that cause the misoperations are assumed to behave as follows. Dependent on the result of a self-diagnosis they store their working state locally such that it becomes visible to the neighbors. A non-work...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید