A New Much Faster and Simpler Algorithm for Lapack Dgels
نویسنده
چکیده
We present new algorithms for computing the linear least squares solution to overde-termined linear systems and the minimum norm solution to underdetermined linear systems. For both problems, we consider the standard formulation min kAX ? BkF and the transposed formulation min kA T X ? BkF , i.e, four diierent problems in all. The functionality of our implementation corresponds to that of the LAPACK routine DGELS. The new implementation is signiicantly faster and simpler. It outperforms the LAPACK DGELS for all matrix sizes tested. The improvement is usually 50{ 100% and it is as high as 400%. The four diierent problems of DGELS are essentially reduced to two, by use of explicit transposition of A. By explicit transposition we avoid computing Householder transformations on vectors with large stride. The QR factorization of block columns of A is performed using a recursive level-3 algorithm. By interleaving updates of B with the factorization of A, we reduce the number of oating point operations performed for the linear least squares problem. By avoiding redundant computations in the update of B we reduce the work needed to compute the minimum norm solution. Finally, we outline fully recursive algorithms for the four problems of DGELS as well as for QR factorization.
منابع مشابه
A Faster and Simpler Recursive Algorithm for the Lapack Routine Dgels
We present new algorithms for computing the linear least squares solution to overdetermined linear systems and the minimum norm solution to underdetermined linear systems. For both problems, we consider the standard formulation min ‖AX−B‖F and the transposed formulation min ‖AX−B‖F , i.e, four different problems in all. The functionality of our implementation corresponds to that of the LAPACK r...
متن کاملA high-performance algorithm for the linear least squares problem on SMP systems
We present new recursive serial and parallel algorithms for the linear least squares problem AX = B, where A is m by n, m n. The algorithms improve performance. This work is an extension of our work on QR factorization 4]. The key idea is to combine the computation of Q T B with the QR factorization, thereby saving computations compared to the standard LAPACK algorithm. Recursion allows us to r...
متن کاملLAPACK-Style Codes for Pivoted Cholesky and QR Updating
Routines exist in LAPACK for computing the Cholesky factorization of a symmetric positive definite matrix and in LINPACK there is a pivoted routine for positive semidefinite matrices. We present new higher level BLAS LAPACK-style codes for computing this pivoted factorization. We show that these can be many times faster than the LINPACK code. Also, with a new stopping criterion, there is more r...
متن کاملNumerical Algorithms for Linear and Nonlinear AlgebraExperience with a Recursive Perturbation
Recursive algorithms for symmetric indeenite linear systems are considered in the present paper. First, the diiculties with the recur-sive formulation of the LAPACK SYSV algorithm (which implements the Bunch-Kaufman pivoting strategy) are discussed. Then a recursive perturbation based algorithm is proposed and tested. The experiments show that the new algorithm can be about two times faster alt...
متن کاملA New Algorithm for Optimum Voltage and Reactive Power Control for Minimizing Transmission Lines Losses
Reactive power dispatch for voltage profile modification has been of interest Abstract to powerr utilities. Usually local bus voltages can be altered by changing generator voltages, reactive shunts, ULTC transformers and SVCs. Determination of optimum values for control parameters, however, is not simple for modern power system networks. Heuristic and rather intelligent algorithms have to be so...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000