gpu parallel computation

GPU-accelerated voxelwise hepatic perfusion quantification.

Journal: :Physics in medicine and biology 2012

H Wang Y Cao

Voxelwise quantification of hepatic perfusion parameters from dynamic contrast enhanced (DCE) imaging greatly contributes to assessment of liver function in response to radiation therapy. However, the efficiency of the estimation of hepatic perfusion parameters voxel-by-voxel in the whole liver using a dual-input single-compartment model requires substantial improvement for routine clinical app...

متن کامل

Parallelizing Apriori Algorithm on GPU

2016

K. Spandana

Parallel computing is a form of computation in which many calculations are carried out simultaneously, operating on the principle that large problems can often be divided into smaller ones, which are then solved concurrently. Now Graphics Processing Unit (GPU) has taken a major role in high performance computing for generic applications. Compute Unified Device Architecture (CUDA) programming mo...

متن کامل

Synkhronos: a Multi-GPU Theano Extension for Data Parallelism

Journal: :CoRR 2017

Adam Stooke Pieter Abbeel

We present Synkhronos, an extension to Theano for multi-GPU computations leveraging data parallelism. Our framework provides automated execution and synchronization across devices, allowing users to continue to write serial programs without risk of race conditions. The NVIDIA Collective Communication Library is used for high-bandwidth inter-GPU communication. Further enhancements to the Theano ...

متن کامل

NVIDA CUDA Architecture-Based Parallel SAT Solver

2010

Yandong Wang Alan Kaminsky James Heliotis

The SAT problem is the first NP-complete problem. So far there is no algorithm that can solve it in polynomial time. Over the past decade, the development of efficient and scalable algorithms has dramatically leveraged the ability of solving SAT problem instances involving tens of thousands of variables and millions of constraints. But as industry demand is increasing, a faster SAT solver is ne...

متن کامل

An Application-Oriented Approach for Accelerating Data-Parallel Computation with Graphics Processing Unit

2008

S. Ponce J. Huang S. I. Park C. Khoury Y. Cao F. Quek W. Feng

This paper presents a novel parallelization and quantitative characterization of various optimization strategies for dataparallel computation on a graphics processing unit (GPU) using NVIDIA’s new GPU programming framework, Compute Unified Device Architecture (CUDA). CUDA is an easy-to-use development framework that has drawn the attention of many different application areas looking for dramati...

متن کامل

Performing DCT8x8 Computation on GPU Using NVIDIA CUDA Technology

2013

Jitendra Jain Jagdamb Behari Srivastava R. B. Singh

In this paper, we have proposed sequential and parallel Discrete Cosine Transform (DCT) in compute unified device architecture (CUDA) libraries. The introduction of programmable pipeline in the graphics processing units (GPU) has enabled configurability. GPU which is available in every computer has a tremendous feat of highly parallel SIMD processing, but its capability is often under-utilized....

متن کامل

Multi-GPU Accelerated Parallel Algorithm of Wallis Transformation for Image Enhancement

2014

Han Xiao Yu-Pu Song Qing-Lei Zhou

With the development of satellite remote sensing technology, satellite remote sensing data obtained by the amount will increase rapidly. Consequently, the process of Wallis transformation is faced with such challenges as large data size, high intensity, high computational complexity and large computational quantity, and so on. A fast algorithm and efficient implementation of Wallis filtering ba...

متن کامل

Data Partitioning Strategy of GPU Heterogeneous Clusters Based on Learning

2016

Jianjiang Li Wei Chen Hongyan Zheng Peng Zhang Yajun Liu

With the rapid progress of computational science and computer simulation ability, a lot of properties can be predicted by the powerful ability of parallel computation before the actual research and development. With the development of high performance computer architecture, GPU is more and more widely used in high performance computation field as an emerging architecture, and a growing number o...

متن کامل

A Comparative Evaluation of the Gpu vs. the Cpu for Parallelization of Evolutionary Algorithms through Multiple Independent Runs

2017

Anna Syberfeldt Tom Ekblom

Multiple independent runs of an evolutionary algorithm in parallel are often used to increase the efficiency of parameter tuning or to speed up optimizations involving inexpensive fitness functions. A GPU platform is commonly adopted in the research community to implement parallelization, and this platform has been shown to be superior to the traditional CPU platform in many previous studies. H...

متن کامل

Accelerating high-order WENO schemes using two heterogeneous GPUs

Journal: Journal of Computational Applied Mechanics 2017

Hossein Mahmoodi Darian,

A double-GPU code is developed to accelerate WENO schemes. The test problem is a compressible viscous flow. The convective terms are discretized using third- to ninth-order WENO schemes and the viscous terms are discretized by the standard fourth-order central scheme. The code written in CUDA programming language is developed by modifying a single-GPU code. The OpenMP library is used for parall...

متن کامل