Accelerating advanced preconditioning methods on hybrid architectures

نویسندگان

چکیده

Many problems, in diverse areas of science and engineering, involve the solution largescale sparse systems linear equations. In most these scenarios, they are also a computational bottleneck, therefore their efficient on parallel architectureshas motivated tremendous volume research.This dissertation targets use GPUs to enhance performance using iterative methods complemented with state-of-the-art preconditioned techniques. particular, we study ILUPACK, package for via Krylov subspace that relies modern inverse-based multilevel ILU (incomplete LU) preconditioning technique.We present new data-parallel versions preconditioner important solvers contained significantly improve its without affecting accuracy. Additionally existing task-parallel ILUPACK shared- distributed-memory inclusion GPU acceleration. The results obtained show sensible reduction runtime methods, as well possibility addressing large-scale problems efficiently.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accelerating Particle Image Velocimetry Using Hybrid Architectures

High Performance Computing (HPC) applications are mapped to a cluster of multi-core processors communicating using high speed interconnects. More computational power is harnessed with the addition of hardware accelerators such as Graphics Processing Unit (GPU) cards and Field Programmable Gate Arrays (FPGAs). Particle Image Velocimetry (PIV) is an embarrassingly parallel application that can be...

متن کامل

Accelerating Direction-Optimized Breadth First Search on Hybrid Architectures

Large scale-free graphs are famously difficult to process efficiently: the skewed vertex degree distribution makes it difficult to obtain balanced partitioning. Our research instead aims to turn this into an advantage by partitioning the workload to match the strength of the individual computing elements in a Hybrid, GPU-accelerated architecture. As a proof of concept we focus on the direction-...

متن کامل

Accelerating the computation of FLAPW methods on heterogeneous architectures

Legacy codes in computational science and engineering have been very successful in providing essential functionality to researchers. However, they are not capable of exploiting the massive parallelism provided by emerging heterogeneous architectures. The lack of portable performance and scalability puts them at high risk: either they evolve or they are doomed to disappear. One example of legacy...

متن کامل

Accelerating the Hybrid Monte Carlo algorithm with ILU preconditioning

The pseudofermion action of the Hybrid Monte Carlo (HMC) algorithm for dynamical fermions is modified to directly incorporate Incomplete LU (ILU) factorisation. This reduces the stochastic noise and allows a larger molecular dynamics step-size to be taken, cutting the computational cost. Numerical tests using the two-flavour Schwinger model are presented, where a two-step ILU preconditioning of...

متن کامل

Accelerating Discrete Wavelet Transforms on Parallel Architectures

The 2-D discrete wavelet transform (DWT) can be found in the heart of many image-processing algorithms. Until recently, several studies have compared the performance of such transform on various shared-memory parallel architectures, especially on graphics processing units (GPUs). All these studies, however, considered only separable calculation schemes. We show that corresponding separable part...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Clei Electronic Journal

سال: 2021

ISSN: ['0717-5000']

DOI: https://doi.org/10.19153/cleiej.24.1.6