processor blocking

Holistic analysis of asynchronous real-time transactions with earliest deadline scheduling

Journal: :J. Comput. Syst. Sci. 2007

Rodolfo Pellizzoni Giuseppe Lipari

In distributed real-time systems, an application is often modeled as a set of real-time transactions, where each transaction is a chain of precedence-constrained tasks. Each task is statically allocated to a processor, and tasks allocated on the same processor are handled by a single-processor scheduling algorithm. Precedence constraints among tasks of the same transaction are modeled by proper...

متن کامل

Exploiting Application-Level Information to Reduce Memory Bandwidth Consumption

2002

Deepak Agarwal Donald Yeung

As processors continue to deliver higher levels of performance and as memory latency tolerance techniques become widespread to address the increasing cost of accessing memory, memory bandwidth will emerge as a major performance bottleneck. Rather than rely solely on wider and faster memories to address memory bandwidth shortages, an alternative is to use existing memory bandwidth more efficient...

متن کامل

Technical Report no . 2002 - 02 NOBLE : A Non - Blocking Inter - Process Communication Library 1 Håkan

2002

Håkan Sundell Philippas Tsigas

Many applications on shared memory multi-processor machines can benefit from the exploitation of parallelism that non-blocking synchronization offers. In this paper, we introduce a library support for multi-process non-blocking synchronization called NOBLE. NOBLE provides an interprocess communication interface that allows the user to select synchronization methods transparently to the one that...

متن کامل

Technical Report no . 2002 - 02 NOBLE : A Non - Blocking Inter - Process Communication Library 1 Håkan Sundell

2002

Håkan Sundell Philippas Tsigas

Many applications on shared memory multi-processor machines can benefit from the exploitation of parallelism that non-blocking synchronization offers. In this paper, we introduce a library support for multi-process non-blocking synchronization called NOBLE. NOBLE provides an interprocess communication interface that allows the user to select synchronization methods transparently to the one that...

متن کامل

Towards a Software Transactional Memory for Graphics Processors

2010

Daniel Cederman Philippas Tsigas Muhammad Tayyab Chaudhry

The introduction of general purpose computing on many-core graphics processor systems, and the general shift in the industry towards parallelism, has created a demand for ease of parallelization. Software transactional memory (STM) simplifies development of concurrent code by allowing the programmer to mark sections of code to be executed concurrently and atomically in an optimistic manner. In ...

متن کامل

Challenges with Probabilities in Response-Time Analysis of Real-Time Systems

2015

Thomas Nolte Meng Liu Björn Lisper

The Worst-Case Response-Time (WCRT) of a task is the maximum time after activation within which the task will finish its execution. The WCRT is constructed by the Worst-Case Execution Time (WCET) of the task itself, as well as the interference of other higher priority tasks, along with potential blocking by lower priority tasks if blocking is allowed by the task model. The interference of other...

متن کامل

FastForward for Efficient Pipeline Parallelism ; CU-CS-1028-07

2015

John Giacomoni Tipp Moseley Manish Vachharajani

High-rate core-to-core communication is critical for efficient pipeline-parallel software architectures. This paper presents the FastForward system, a software-only lowoverhead high-rate queue implementation for pipeline parallelism on multicore architectures. FastForward uses an architecturally-tuned domain-specific adaptation of concurrent lock-free queues to provide low-latency and lowoverhe...

متن کامل

Multicore-optimized wavefront diamond blocking for optimizing stencil updates

Journal: :SIAM J. Scientific Computing 2015

Tareq M. Malas Georg Hager Hatem Ltaief Holger Stengel Gerhard Wellein David E. Keyes

The importance of stencil-based algorithms in computational science has focused attention on optimized parallel implementations for multilevel cache-based processors. Temporal blocking schemes leverage the large bandwidth and low latency of caches to accelerate stencil updates and approach theoretical peak performance. A key ingredient is the reduction of data traffic across slow data paths, es...

متن کامل

Optimizing Transformations of Stencil Operations for Parallel Cache-based Architectures

1999

Federico Bassetti Kei Davis

This paper describes a new technique for optimizing serial and parallel stencil-and stencil-like operations for cache-based architectures. This technique takes advantage of the semantic knowledge implicitly in stencil-like computations. The technique is implemented as a source-to-source program transformation; because of its speci-city it could not be expected of a conventional compiler. Empiri...

متن کامل

A Practical Multi-word Compare-and-Swap Operation

2002

Timothy L. Harris Keir Fraser Ian Pratt

Work on non-blocking data structures has proposed extending processor designs with a compare-and-swap primitive, CAS2, which acts on two arbitrary memory locations. Experience suggested that current operations, typically single-word compare-and-swap (CAS1), are not expressive enough to be used alone in an eÆcient manner. In this paper we build CAS2 from CAS1 and, in fact, build an arbitrary mul...

متن کامل