Efficient Reduction from Block Hessenberg Form to Hessenberg Form Using Shared Memory
نویسندگان
چکیده
A new cache-efficient algorithm for reduction from block Hessenberg form to Hessenberg form is presented and evaluated. The algorithm targets parallel computers with shared memory. One level of look-ahead in combination with a dynamic load-balancing scheme significantly reduces the idle time and allows the use of coarse-grained tasks. The coarse tasks lead to high-performance computations on each processor/core. Speedups close to 13 over the sequential unblocked algorithm have been observed on a dual quad-core machine using one thread per core.
منابع مشابه
Parallel Reduction from Block Hessenberg to Hessenberg using MPI
In many scientific applications, eigenvalues of a matrix have to be computed. By first reducing a matrix from fully dense to Hessenberg form, eigenvalue computations with the QR algorithm become more efficient. Previous work on shared memory architectures has shown that the Hessenberg reduction is in some cases most efficient when performed in two stages: First reduce the matrix to block Hessen...
متن کاملParallel Reduction of a Block Hessenberg-Triangular Matrix Pair to Hessenberg-Triangular Form—Algorithm Design and Performance Results
The design, implementation and performance of a parallel algorithm for reduction of a matrix pair in block upper Hessenberg-Triangular form (Hr, T ) to upper Hessenberg-triangular form (H, T ) is presented. This reduction is the second stage in a two-stage reduction of a regular matrix pair (A, B) to upper Hessenberg-Triangular from. The desired upper Hessenberg-triangular form is computed usin...
متن کاملParallel Two-Stage Hessenberg Reduction using Tile Algorithms for Multicore Architectures
This paper describes a parallel Hessenberg reduction in the context of multicore architectures using tile algorithms. The Hessenberg reduction is very often used as a pre-processing step in solving dense linear algebra problems, such as the standard eigenvalue problem. Although expensive, orthogonal transformations are accepted techniques and commonly used for this reduction because they guaran...
متن کاملBlocked Algorithms for Reduction of a Regular Matrix Pair to Generalized Schur Form
This contribution considers the problem of transforming a regular matrix pair (A;B) to generalized Schur form. The focus is on blocked algorithms for the reduction process that typically includes two major steps. The rst is a two-stage reduction of a regular matrix pair (A;B) to condensed form (H;T ) using orthogonal transformations Q and Z such that H = QAZ is upper Hessenberg and T = QBZ is u...
متن کاملEfficient Algorithm for Simultaneous Reduction to the m-Hessenberg–Triangular–Triangular Form
This paper proposes an efficient algorithm for simultaneous reduction of three matrices. The algorithm is a blocked version of the algorithm described by Miminis and Page (1982) which reduces A to the m-Hessenberg form, and B and E to the triangular form. The m-Hessenberg– triangular–triangular form of matrices A, B and E is specially suitable for solving multiple shifted systems. Such shifted ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010