GPGPU Performance Tuning – An illustrated example
نویسنده
چکیده
This tutorial describes some common techniques to improve performance of GPU-based implementations in linear algebra applications. The example presented here is a Jacobi iteration (commonly used as a smoother in multigrid scenarios) on a sparse matrix arising from Finite Element discretizations of standard operators. However, none of that advanced background is neccessary to understand the GPU-specific examples given here.
منابع مشابه
Automatic Performance Tuning of SpMV on GPGPU
Sparse Matrix-Vector Multiplication (SpMV) is an important computational kernel in scientific applications that tends to perform poorly on modern processors because of irregular memory accesses. GPU have evolved into a very attractive hardware platform for general purpose computations due to their high floating-point computation performance, which results in that GPGPU becomes the hot and popul...
متن کاملPerformance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU)
General-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to more traditional multicore systems of today, GPGPUs have distinctly higher degrees of hardware multithreading (hundreds of hardware thread...
متن کاملOn the Limits of GPU Acceleration
This paper throws a small “wet blanket” on the hot topic of GPGPU acceleration, based on experience analyzing and tuning both multithreaded CPU and GPU implementations of three computations in scientific computing. These computations—(a) iterative sparse linear solvers; (b) sparse Cholesky factorization; and (c) the fast multipole method—exhibit complex behavior and vary in computational intens...
متن کاملAdvanced Optimizations of An Implicit Navier-Stokes Solver on GPGPU
General-purpose computing on graphics processing units (GPGPU) is a massive fine-grain parallel computation platform, which is is particularly attractive for CFD tasks due to its potential of one or two magnitudes of performance improvement with relatively low capital investment. Many successful attempts have been reported in recent years (see, for example [1, 2, 3, 4, 5, 6]). Although early at...
متن کاملImplementing Cfd (computational Fluid Dynamics) in Opencl for Building Simulation
Though researchers in computer graphics have started to use the GPGPU (General Purposed Graphics Processing Unit) method to speed up their procedural programs, these techniques are seldom used in the building simulation field. It is possible to apply the GPGPU method to many simulation scenarios (i.e. human evacuation, shadow simulation) to speed up performance. In this paper, CFD is used as an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005