نتایج جستجو برای: opencl

تعداد نتایج: 807  

Journal: :CoRR 2016
Dong Wang Jianjing An Ke Xu

Convolutional neural networks (CNNs) have been widely employed in many applications such as image classification, video analysis and speech recognition. Being computeintensive, CNN computations are mainly accelerated by GPUs with high power dissipations. Recently, studies were carried out exploiting FPGA as CNN accelerator because of its reconfigurability and energy efficiency advantage over GP...

Journal: :CoRR 2013
Dino Distefano Jérémy Dubreil

We present an automatic analysis technique for checking data races on OpenCL kernels. Our method defines symbolic execution techniques based on separation logic with suitable abstractions to automatically detect non-benign racy behaviours on kernels.

Journal: :CoRR 2015
John Wickerson Mark Batty

We study how the C11 memory model can be simplified and how it can be extended. Our first contribution is to propose a mild strengthening of the model that enables the rules pertaining to sequentially-consistent (SC) operations to be significantly simplified. We eliminate one of the total orders that candidate executions must range over, leading to a model that is significantly faster to simula...

2011
Jianbin Fang Ana Lucia Varbanescu

While many-core processors offer multiple layers of hardware parallelism to boost performance, applications are lagging behind in exploiting them effectively. A typical example is vector parallelism(SIMD), offered by many processors, but used by too few applications. In this paper we discuss two different strategies to enable the vectorization of naive OpenCL kernels. Further, we show how these...

2015
Yogendra Sharma Dinesh B. Kulkarni

The Weather Research and Forecasting model (WRF) is a simulating system developed for atmospheric weather prediction. WRF model is used for both operational as well as research purposes. The need for accurate weather and climate simulation to be carried out in shorter time is increasing day by day, which leads to the acceleration of existing Numerical Weather Prediction (NWP) system. This paper...

2016
Kazutomo Yoshii Hal Finkel Franck Cappello

The end of Moore’s law creates a significant turning point for computer architecture. Today, performance is largely limited by energy, power, and cooling. Heterogeneity and radical new architecture designs are keys to achieving higher energy proportionality. In mobile computing, heterogeneity is well adopted in system-on-chip designs (e.g., to improve battery life). In high-performance computin...

2014

This paper presented the single kernel multiple devices (SKMD) system, a framework that transparently orchestrates collaborative execution of a single data-parallel kernel across multiple asymmetric CPUs and GPUs. SKMD is an abstraction layer located between applications and the OpenCL library. It uses OpenCL as the intermediate language. SKMD transparently partitions an OpenCL kernel across mu...

Journal: :CoRR 2016
Ricardo Tapiador Antonio Rios-Navarro Alejandro Linares-Barranco Minkyu Kim Deepak Kadetotad Jae-sun Seo

Deep learning has significantly advanced the state of the art in artificial intelligence, gaining wide popularity from both industry and academia. Special interest is around Convolutional Neural Networks (CNN), which take inspiration from the hierarchical structure of the visual cortex, to form deep layers of convolutional operations, along with fully connected classifiers. Hardware implementat...

Journal: :Journal of Signal Processing Systems 2018

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید