High order accurate simulation of compressible flows on GPU clusters over Software Distributed Shared Memory
نویسندگان
چکیده
The advent of multicore processors during the past decade and especially the recent introduction of many-core Graphics Processing Units (GPUs) open new horizons to large-scale, high-resolution simulations for a broad range of scientific fields. Residing at the forefront of advancements in multiprocessor technology, GPUs are often chosen as co-processors when intensive parts of applications need to be computed. Among the various domains, the scientific area of Computational Fluid Dynamics (CFD) is a potential candidate that could significantly benefit from the utilization of many-core GPUs. In order to investigate this possibility, we herein evaluate the performance of a high order accurate method for the simulation of compressible flows. Targeting computer systems with multiple GPUs, the current implementation and the respective performance evaluation are taking place on a GPU cluster. With respect to using these GPUs, this paper offers an alternative to the mainstream approach of message passing by considering shared memory abstraction. In the implementations presented in this paper, the updates on shared data are not explicitly coded by the programmer across the simulation phases, but are propagated through Software Distributed Shared Memory (SDSM). This way, we intend to preserve a unified memory view that extends the memory hierarchy from the node level to the cluster level. Such an extension could significantly facilitate the porting of multithreaded codes at GPU clusters. Our results indicate that the presented approach is competitive with the message passing paradigm and they lay grounds for further research on the use of shared memory abstraction for future GPU clusters. 2014 Elsevier Ltd. All rights reserved.
منابع مشابه
An OpenMP Programming Toolkit for Hybrid CPU/GPU Clusters Based on Software Unified Memory
Recently, hybrid CPU/GPU cluster has drawn much attention from the researchers of high performance computing because of amazing energy efficiency and adaptable resource exploitation. However, the programming of hybrid CPU/GPU clusters is very complex because it requires users to learn new programming interfaces such as CUDA and OpenCL, and combine them with MPI and OpenMP. To address this probl...
متن کاملEfficient Parallel Algorithm for Direct Numerical Simulation of Turbulent Flows
A distributed algorithm for a high-order-accurate finite-difference approach to the direct numerical simulation (DNS) of transition and turbulence in compressible flows is described. This work has two major objectives. The first objective is to demonstrate that parallel and distributed-memory machines can be successfully and efficiently used to solve computationally intensive and input/output i...
متن کاملAccelerating high-order WENO schemes using two heterogeneous GPUs
A double-GPU code is developed to accelerate WENO schemes. The test problem is a compressible viscous flow. The convective terms are discretized using third- to ninth-order WENO schemes and the viscous terms are discretized by the standard fourth-order central scheme. The code written in CUDA programming language is developed by modifying a single-GPU code. The OpenMP library is used for parall...
متن کاملGPU computing of compressible flow problems by a meshless method with space-filling curves
A graphic processing unit (GPU) implementation of a meshless method for solving compressible flow problems is presented in this paper. Least-square fit is used to discretise the spatial derivatives of Euler equations and an upwind scheme is applied to estimate the flux terms. The compute unified device architecture (CUDA) C programming model is employed to efficiently and flexibly port the mesh...
متن کاملA Second Order Accurate Method in Simulation of Underwater Explosion
In this paper, a numerical scheme is proposed for the multi-fluid compressible flows. This method is applied to the problem of underwater explosion. The proposed scheme is basically the extension of Godunov method in gas dynamic problems to the multifluid environments and is second-order accurate in space. In this method, also, the problem of artificial mixing of two different phases on Euleria...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014