نتایج جستجو برای: gpu parallel computation

تعداد نتایج: 358612  

2011
Ákos Szlávecz Gábor Hesz Tamás Bükki Balázs Benyó

Parallel projection based Single Photon Emission Computed Tomography (SPECT) is one of the most widely used nuclear imaging technique even nowadays. Serious artefacts are produced in the reconstructed images due to the non-homogeneous attenuation medium and the distance dependent spatial resolution (DDSR) of the parallel imaging. Effective non-uniform attenuation correction and DDSR reduction p...

2012
Min Li

(ABSTRACT) With the advances of very large scale integration (VLSI) technology, the feature size has been shrinking steadily together with the increase in the design complexity of logic circuits. As a result, the efforts taken for designing, testing, and debugging digital systems have increased tremendously. Although the electronic design automation (EDA) algorithms have been studied extensivel...

Journal: :Journal of Parallel and Distributed Computing 2021

• Multi-GPU and Unified Memory implementation of the Multi-Zone NAS Benchmarks. Analysis programmability performance effects Memory. per-GPU allocation have similar programming efforts. Unified-Memory version outperforms manual from 1.1x to 1.85x. GPU-based computing systems become a widely accepted solution for high-performance-computing (HPC) domain. GPUs shown highly competitive performance-...

2015
Ying Yang Chunfang Wang Yuyu Gao Jingwei Xing

Improving terrain tile data selection efficiency, real-time loading of visible tile data and building GPU-based continuous Level of Details (LOD) are the key technologies for large scale terrain rendering on GPU. In this article, in order to reduce terrain tile data selection time, we build double layers tile quad tree for massive terrain data and organize tile data by designing Z-order space f...

2009
Andrew Thall

The Lucas-Lehmer test provides a deterministic algorithm for testing whether, for a prime number p, Mp = 2−1 is also a prime number. The current work demonstrates that this test can be effectively implemented on a parallel graphics processing unit (GPU). The parallelization was achieved by two main parallel methods: (1) fast multiplication using parallel Fast Fourier transforms in extended prec...

2015
Zahid Ansari Asif Afzal Sudarshan Nayak

The advent of high performance computing (HPC) and graphics processing units (GPU), present an enormous computation resource for large data transactions (big data) that require parallel processing for robust and prompt data analysis. In this paper, we take an overview of four parallel programming models, OpenMP, CUDA, MapReduce, and MPI. The goal is to explore literature on the subject and prov...

Journal: :Computer-Aided Design 2013
Liang Shuai Xiaohu Guo Miao Jin

Periodic centroidal Voronoi tessellation (CVT) in hyperbolic space provides a nice theoretical framework for computing the constrained CVT on high-genus (genus > 1) surfaces. This paper addresses two computational issues related to such hyperbolic CVT framework: (1) efficient reduction of unnecessary site copies in neighbor domains on the universal covering space, based on two special rules; (2...

Journal: :CoRR 2017
Attilio Fiandrotti Sophie Fosson Chiara Ravazzi Enrico Magli

Compressive sensing promises to enable bandwidth-efficient on-board compression of astronomical data by lifting the encoding complexity from the source to the receiver. The signal is recovered off-line, exploiting GPUs parallel computation capabilities to speedup the reconstruction process. However, inherent GPU hardware constraints limit the size of the recoverable signal and the speedup pract...

2014
Marcos Novalbos Jaime Gonzalez Miguel A. Otaduy Roberto Martinez Alberto Sanchez

Molecular dynamics simulations allow us to study the behavior of complex biomolecular systems by modeling the pairwise interaction forces between all atoms. Molecular systems are subject to slowly decaying electrostatic potentials, which turn molecular dynamics into an n-body problem. In this paper, we present a parallel and scalable solution to compute long-range molecular forces, based on the...

2015
Yang Su Zhijie Xu Xiangqian Jiang

The discrete wavelet transform (DWT) has been extensively studied and developed in various scientific and engineering fields. The multiresolution and local nature of the DWT facilitates applications requiring progressiveness and the capture of high-frequency details. However, the intensive computation of DWT caused by multilevel filtering/down-sampling will become a significant bottleneck in re...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید