Accelerating advanced preconditioning methods on hybrid architectures
نویسندگان
چکیده
Many problems, in diverse areas of science and engineering, involve the solution largescale sparse systems linear equations. In most these scenarios, they are also a computational bottleneck, therefore their efficient on parallel architectureshas motivated tremendous volume research.This dissertation targets use GPUs to enhance performance using iterative methods complemented with state-of-the-art preconditioned techniques. particular, we study ILUPACK, package for via Krylov subspace that relies modern inverse-based multilevel ILU (incomplete LU) preconditioning technique.We present new data-parallel versions preconditioner important solvers contained significantly improve its without affecting accuracy. Additionally existing task-parallel ILUPACK shared- distributed-memory inclusion GPU acceleration. The results obtained show sensible reduction runtime methods, as well possibility addressing large-scale problems efficiently.
منابع مشابه
Accelerating Particle Image Velocimetry Using Hybrid Architectures
High Performance Computing (HPC) applications are mapped to a cluster of multi-core processors communicating using high speed interconnects. More computational power is harnessed with the addition of hardware accelerators such as Graphics Processing Unit (GPU) cards and Field Programmable Gate Arrays (FPGAs). Particle Image Velocimetry (PIV) is an embarrassingly parallel application that can be...
متن کاملAccelerating Direction-Optimized Breadth First Search on Hybrid Architectures
Large scale-free graphs are famously difficult to process efficiently: the skewed vertex degree distribution makes it difficult to obtain balanced partitioning. Our research instead aims to turn this into an advantage by partitioning the workload to match the strength of the individual computing elements in a Hybrid, GPU-accelerated architecture. As a proof of concept we focus on the direction-...
متن کاملAccelerating the computation of FLAPW methods on heterogeneous architectures
Legacy codes in computational science and engineering have been very successful in providing essential functionality to researchers. However, they are not capable of exploiting the massive parallelism provided by emerging heterogeneous architectures. The lack of portable performance and scalability puts them at high risk: either they evolve or they are doomed to disappear. One example of legacy...
متن کاملAccelerating the Hybrid Monte Carlo algorithm with ILU preconditioning
The pseudofermion action of the Hybrid Monte Carlo (HMC) algorithm for dynamical fermions is modified to directly incorporate Incomplete LU (ILU) factorisation. This reduces the stochastic noise and allows a larger molecular dynamics step-size to be taken, cutting the computational cost. Numerical tests using the two-flavour Schwinger model are presented, where a two-step ILU preconditioning of...
متن کاملAccelerating Discrete Wavelet Transforms on Parallel Architectures
The 2-D discrete wavelet transform (DWT) can be found in the heart of many image-processing algorithms. Until recently, several studies have compared the performance of such transform on various shared-memory parallel architectures, especially on graphics processing units (GPUs). All these studies, however, considered only separable calculation schemes. We show that corresponding separable part...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Clei Electronic Journal
سال: 2021
ISSN: ['0717-5000']
DOI: https://doi.org/10.19153/cleiej.24.1.6