نتایج جستجو برای: many core

تعداد نتایج: 1139064  

Journal: :Concurrency and Computation: Practice and Experience 2015
Pieter Hijma Rob van Nieuwpoort Ceriel J. H. Jacobs Henri E. Bal

ion increases portability (programs written for level perfect can run on both the GPU and the Xeon Phi), but incorporating hardware-specific details in the program can increase the performance significantly and also helps programmers to understand the performance in relation to the hardware.

2018
Tao Huang Detang Lu

In the past decades, multiple-point geostatistical methods (MPS) are increasing in popularity in various fields. Compared with the traditional techniques, MPS techni‐ ques have the ability to characterize geological reality that commonly has complex structures such as curvilinear and long-range channels by using high-order statistics for pattern reconstruction. As a result, the computational bu...

2014
Aaron Stillmaker Lucas Stillmaker Brent Bohnenstiehl Bevan Baas

As processors move from multi-core to many-core architectures, opportunities arise for energy-efficient enterprise computations, such as sorting, on large arrays of processors. This paper proposes three different energy-efficient sorting methods for the first phase of an external sort simulated on a varying sized fine-grained many-core processor arrays used as a co-processor to an Intel CPU, wh...

2008
Alexandra Fedorova Viren Kumar Vahid Kazempour Suprio Ray Pouya Alagheband

In this position paper, we present our vision for the scheduling infrastructure in a many-core hypervisor – the hypervisor targeted for many-core platforms. The key objectives of our system are scalability and heterogeneity-awareness. We see these as firstorder objectives, because future many-core processors will consist of thousands of cores and those cores will be heterogeneous. Since existin...

2012
Siegfried Benkner Enes Bajrovic Erich Marth Martin Sandrieser Raymond Namyst Samuel Thibault

With the increasing architectural diversity of many-core architectures the challenges of parallel programming and code portability will sharply rise. The EU project PEPPHER addresses these issues with a component-based approach to application development on top of a taskparallel execution model. Central to this approach are multi-architectural components which encapsulate different implementati...

2013
Gabor Drescher Timo Hönig Sebastian Maier Benjamin Oechslein Wolfgang Schröder-Preikschat

Number, variety, and organization of the on-chip processing elements of many-core processors demand a radical rethink in operating systems design. One may come from a multitude of allocatable units that bestows every execution thread its own core: single-threaded cores will be usual, multi-threaded cores will be unusual. The paper presents a scalability-aware kernel executive, Sake, that is cur...

2014
Jochem H. Rutgers

Processors incorporate more andmore cores. With the increasing core count, it becomes harder to implement convenient features like atomic operations, ordering of all memory operations, and hardware cache coherency. When these features are not supported by the hardware, applications become more complex. This makes programming these many-core architectures hard. This thesis defines programming mo...

2013
Yi Liu Xinwei Zhang Yonghui Wang Depei Qian Yali Chen Jin Wu

Transactional memory is an appealing technology which frees programmer from lock-based programming. However, most of current hardware transactional memory systems are proposed for multi-core processors, and may face some challenges with the increasing of processor cores in many-core systems, such as inefficient utilization of transactional buffers, unsolved problem of transactional buffer overf...

Journal: :CoRR 2015
Fabian Gieseke Cosmin E. Oancea Ashish Mahabal Christian Igel Tom Heskes

A buffer k-d tree is a k-d tree variant for massively-parallel nearest neighbor search. While providing valuable speed-ups on modern many-core devices in case both a large number of reference and query points are given, buffer k-d trees are limited by the amount of points that can fit on a single device. In this work, we show how to modify the original data structure and the associated workflow...

Journal: :CoRR 2016
Jan-Willem Buurlage Tom Bannink Abe Wits

The bulk-synchronous parallel (BSP) model provides a framework for writing parallel programs with predictable performance. In this paper we extend the BSP model to support what we will call pseudo-streaming algorithms for accelerators. We also generalize the BSP cost function to these algorithms, so that it is possible to predict the running time for programs targeting many-core accelerators an...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید