On optimal message vector length for block single parallel partition algorithm in a three-dimensional ADI solver
نویسندگان
چکیده
It has long been recognized that many direct parallel tridiagonal solvers are only efficient for solving a single tridiagonal equation of large sizes, and they become inefficient when naively used in a three-dimensional ADI solver. In order to improve the parallel efficiency of an ADI solver using a direct parallel solver, we implement the single parallel partition (SPP) algorithm in conjunction with message vectorization, which aggregates several communication messages into one to reduce the communication costs. The measured performances show that the longest allowable message vector length (MVL) is not necessarily the best choice. To understand this observation and optimize the performance, we propose an improved model that takes the cache effect into consideration. The optimal MVL for achieving the best performance is shown to depend on number of processors and grid sizes. Similar dependence of the optimal MVL is also found for the popular block pipelined method. 2009 Elsevier Inc. All rights reserved.
منابع مشابه
A Message-Passing Distributed Memory Parallel Algorithm for a Dual-Code Thin Layer, Parabolized Navier-Stokes Solver
In this study, the results of parallelization of a 3-D dual code (Thin Layer, Parabolized Navier-Stokes solver) for solving supersonic turbulent flow around body and wing-body combinations are presented. As a serial code, TLNS solver is very time consuming and takes a large part of memory due to the iterative and lengthy computations. Also for complicated geometries, an exceeding number of grid...
متن کاملNear-Minimum-Time Motion Planning of Manipulators along Specified Path
The large amount of computation necessary for obtaining time optimal solution for moving a manipulator on specified path has made it impossible to introduce an on line time optimal control algorithm. Most of this computational burden is due to calculation of switching points. In this paper a learning algorithm is proposed for finding the switching points. The method, which can be used for both ...
متن کاملThree-dimensional simulations of spiral waves in reacting and diffusing media on DSM computers
Parallel ADI and CG methods based on the time-linearization of the three-dimensional , reaction-diffusion equations which govern the propagation of spiral waves in excitable media, have been implemented on both shared-and distributed-memory computers. ADI has been implemented by means of a dynamic block cartesian decomposition and its efficiency tends to that of the CG technique as the mesh is ...
متن کاملA Parallel Three-Dimensional Incompressible Navier-Stokes Solver with a Parallel Multigrid Kernel
The development and applications of a parallel, time-dependent, three-dimensional incompressible Navier-Stokes flow solver and a parallel multigrid elliptic kernel are described. The flow solver is based on a second-order projection method applied to a staggered finite-difference grid. The multigrid algorithms implemented in the parallel elliptic kernel, which is used by the flow solver, are V-...
متن کاملAnalysis of Transient Flow in the Case of Secondary Injection for Transient Vector Control (RESEARCH NOTE)
The purpose of this paper is to analyze the flow field structure in transient state and performance of secondary injection system for thrust vectoring in divergent section of a two-dimensional nozzle. Secondary injection for thrust vectoring in a two-dimensional nozzle is studied by solving three-dimensional Reynolds-averaged equations by means of fluent solver. Spalart-allmaras model was used ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Applied Mathematics and Computation
دوره 215 شماره
صفحات -
تاریخ انتشار 2009