gpu parallel computation

Analysis of GPU Computation of Parabolic, Bessel, Wright and Riemann Zeta Functions

Journal: :ITM web of conferences 2021

This paper deals with GPU computing of special mathematical functions that are used in Fractional Calculus. The graphics processing unit (GPU) has grown to be an integral part nowadays’s mainstream structures. a novel parallel approach for NVIDIA’s hardware is speed up the algorithm. A comparison sequential code, vectorized code and performed. We have successfully reduced computation time using...

متن کامل

Title : The Application of Genetic Programming to General Purpose GPU Computation

2006

Eamon Phelan

Paper Abstract: The recent advances in GPU hardware have resulted in the wide availability of programmable commodity hardware suited to massively parallel computation. This paper examines the application of the evolutionary computation approach of Genetic Programming to explore the space of general purpose GPU programs.

متن کامل

Parallelized computation for computer simulation of electrocardiograms using personal computers with multi-core CPU and general-purpose GPU

Journal: :Computer methods and programs in biomedicine 2010

Wenfeng Shen Daming Wei Weimin Xu Xin Zhu Shizhong Yuan

Biological computations like electrocardiological modelling and simulation usually require high-performance computing environments. This paper introduces an implementation of parallel computation for computer simulation of electrocardiograms (ECGs) in a personal computer environment with an Intel CPU of Core (TM) 2 Quad Q6600 and a GPU of Geforce 8800GT, with software support by OpenMP and CUDA...

متن کامل

Analysis of GPU Parallel Computing based on Matlab

Journal: :CoRR 2015

Mingzhe Wang Bo Wang Qiu He Xiuxiu Liu Kunshuai Zhu

Mingzhe Wang, Bo Wang, Qiu He, Xiuxiu Liu, Kunshuai Zhu (School of Computer and Control Engineering, University of Chinese Academy of Sciences, Huairou, Beijing 101408, China) Abstract: Matlab is very widely used in scientific computing, but Matlab computational efficiency is lower than C language program. In order to improve the computing speed, some toolbox can use GPU to accelerate the compu...

متن کامل

A novel parallel algorithm for near-field computation in N-body problem on GPU

2011

Jianchen Shan Yongmei Lei

A novel efficient parallel algorithm for the near-field computation in N-body problem on the Graphics Processing Unit (GPU) architecture is proposed in this paper. This algorithm evolved from the BPB algorithm [1] which is proposed in the author’s previous work. This novel algorithm is based on the Newton’s third law and Z-order Space Filling Curve (Z-SFC). Half of the computations are reduced ...

متن کامل

APUNet: Revitalizing GPU as Packet Processing Accelerator

2017

Younghwan Go Muhammad Asim Jamshed YoungGyoun Moon Changho Hwang KyoungSoo Park

Many research works have recently experimented with GPU to accelerate packet processing in network applications. Most works have shown that GPU brings a significant performance boost when it is compared to the CPUonly approach, thanks to its highly-parallel computation capacity and large memory bandwidth. However, a recent work argues that for many applications, the key enabler for high perform...

متن کامل

Development for Parallel Hopfield Neural Network Implemented for GPU

2006

He Zheng

This paper presents some new implementations of parallel Hopfield neural network model -Cauchy machineon Graphic Processor Units (GPU). The main operators in the parallel Hopfield neural work are loaded into RGBA textures so that input calculation, output update and terminal condition check are implemented by fragment processors on GPU. In addition, the matrix-vector multiplication is realized ...

متن کامل

Comparison of Parallel CUDA and OpenMP Implementations of Particle Swarm Optimization

2014

O. Tolga Altinoz A. Egemen Yılmaz

Since the physical constraints on micro computing devices have forced the researchers to design next generation chips, the significance of the parallelization and distributed computing grow in importance. In this study, a sequential implementation of the Particle Swarm Optimization algorithm is converted into a concurrent version, which is executed on the cores of both CPU and GPU. For this rea...

متن کامل

LightScan: Faster Scan Primitive on CUDA Compatible Manycore Processors

Journal: :CoRR 2016

Yongchao Liu Srinivas Aluru

Scan (or prefix sum) is a fundamental and widely used primitive in parallel computing. In this paper, we present LightScan, a faster parallel scan primitive for CUDA-enabled GPUs, which investigates a hybrid model combining intrablock computation and inter-block communication to perform a scan. Our algorithm employs warp shuffle functions to implement fast intra-block computation and takes adva...

متن کامل

CC_deg_time_per_s_comparison.eps

2010

Frank Dehne Kumanan Yogaratnam

In this paper, we explore the limits of graphics processors (GPUs) for general purpose parallel computing by studying problems that require highly irregular data access patterns: parallel graph algorithms for list ranking and connected components. Such graph problems represent a worst case scenario for coalescing parallel memory accesses on GPUs which is critical for good GPU performance. Our e...

متن کامل