Characterization and transformation of unstructured control flow in bulk synchronous GPU applications

نویسندگان

  • Haicheng Wu
  • Gregory Frederick Diamos
  • Jin Wang
  • Si Li
  • Sudhakar Yalamanchili
چکیده

In this paper we identify important classes of program control flows in applications targeted to commercially available graphics processing units (GPUs) and characterize their presence in real workloads such as those that occur in CUDA and OpenCL. Broadly, control flow can be characterized as structured or unstructured. It is shown that most existing techniques for handling divergent control in bulk synchronous GPU applications handle structured control flow efficiently, some are incapable of executing unstructured control flow directly, and none handles unstructured control flow efficiently. An approach to reduce the impact of this problem is provided. An unstructured-to-structured control flow transformation for CUDA kernels is implemented and its performance impact on a large class of GPU applications is assessed. The results quantify the importance of improving support for programs with unstructured control flow on GPUs. The transformation can also be used in a JIT compiler pass to execute programs with unstructured control flow on the GPU devices that do not support unstructured control flow. This is an important capability for execution portability of applications using GPU accelerators.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Characterization and Transformation of Unstructured Control Flow in GPU Applications

Hardware and compiler techniques for mapping data-parallel programs with divergent control flow to SIMD architectures have recently enabled the emergence of new GPGPU programming models such as CUDA and OpenCL. Although this technology is widely used, commodity GPUs use different schemes to implement it, and the performance limitations of these different schemes under real workloads are not wel...

متن کامل

Pressure-Velocity Coupled Finite Volume Solution of Steady Incompressible Invscid Flow Using Artificial Compressibility Technique

Application of the computer simulation for solving the incompressible flow problems motivates developing efficient and accurate numerical models. The set of Inviscid Incompressible Euler equations can be applied for wide range of engineering applications. For the steady state problems, the equation of continuity can be simultaneously solved with the equations of motion in a coupled manner using...

متن کامل

Effective Resource-Driven Loop Splitting for Large Unstructured Mesh Applications on GPUs

Unstructured mesh applications are widely used in science and industry for simulating phenomena as diverse as turbomachinery components of jet engines and blood flow in arteries. These are examples of irregular applications that are difficult to optimize for accelerator targets such as GPUs. Splitting loops is a standard technique used for optimizing GPU applications. It breaks down large compl...

متن کامل

Fuzzy Logic-Based Vector Control of Permanent Magnet Synchronous Motor Using Stacked Matrix Converter for Railway Traction Applications

Recently, Permanent Magnet Synchronous Motors (PMSMs) have been widely accepted and employed in traction and railway transportation applications due to their various advantages such as small inertia, high efficiency and high torque density. In this paper, in order to use these motors as traction drives in an effective way, the vector control scheme is employed. In this strategy, the stator curr...

متن کامل

Two-Dimensional Boundary-Conforming Orthogonal Grids for External and Internal Flows Using Schwarz-Christoffel Transformation

In this paper, a Schwarz-Christoffel method for generating two-dimensional grids for a variety of complex internal and external flow configurations based on the numerical integration procedure of the Schwarz-Christoffel transformation has been developed by using Mathematica, which is a general purpose symbolic-numerical-graphical mathematics software. This method is highly accurate (fifth order...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJHPCA

دوره 26  شماره 

صفحات  -

تاریخ انتشار 2012