Evaluating Scalability of the 2-d Fft on Parallel Computers
نویسندگان
چکیده
Parallel computers have demonstrated a remarkable potential for achieving high performance at a reasonable cost for many computer vision and image processing (CVIP) applications. A major obstacle to the use of parallel computers is the lack of a universally accepted metric to study the scalability of parallel algorithms and architectures. In this paper, we apply diierent scalability measures to various 2-D FFT algorithms and target architectures and compare the expected performance to the measured results. A number of algorithms in computer vision and image processing exhibit regular communication patterns similar to the 2-D FFT. We can therefore extrapolate our observations to determine which aspects of these measures are relevant to the scalability analysis of other similar image processing algorithms.
منابع مشابه
Parallel implementation and scalability analysis of 3D Fast Fourier Transform using 2D domain decomposition
3D FFT is computationally intensive and at the same time requires global or collective communication patterns. The efficient implementation of FFT on extreme scale computers is one of the grand challenges in scientific computing. On parallel computers with a distributed memory, different domain decompositions are possible to scale 3D FFT computation. In this paper, we argue that 2D domain decom...
متن کاملThe Scalability of FFT on Parallel Computers
In this paper, we present the scalability analysis of parallel Fast Fourier Transform algorithm on mesh and hypercube connected multicomputers using the isoefficiency metric. The isoefficiency function of an algorithm architecture combination is defined as the rate at which the problem size should grow with the number of processors to maintain a fixed efficiency. On the hypercube architecture, ...
متن کاملScalability of Parallel Spatial Direct Numerical Simulations on Intel Hypercube and Ibm Sp1 and Sp2 Scalability of Parallel Spatial Direct Numerical Simulations on Intel Hypercube and Ibm Sp1 and Sp2
The implementation and performance of a parallel spatial direct numerical simulation (PSDNS) approach on the Intel iPSC/860 hypercube and IBM SP1 and SP2 parallel computers is documented. Spatially evolving disturbances associated with the laminar-to-turbulent transition in boundary-layer ows are computed with the PSDNS code. The feasibility of using the PSDNS to perform transition studies on t...
متن کاملA High-Performance FFT Algorithm for Vector Supercomputers
Many traditional algorithms for computing the fast Fourier transform (FFT) on conventional computers are unacceptable for advanced vector and parallel computers because they involve nonunit, power-of-two memory strides. This paper presents a practical technique for computing the fast Fourier transform that completely avoids all such strides and appears to be near-optimal for a variety of curren...
متن کاملParallel scaling of Teter’s minimization for Ab Initio calculations
We propose a parallelization scheme for the conjugate gradient method by Teter et. al. and report a detailed analysis of its scalability. We use MPI collective operations exclusively to take advantage of optimized collective implementations with possible hardware support. Our parallel conjugate gradient calculation can be applied in addition to the already implemented parallelism in the applica...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993