Nonparametric Stein-type shrinkage covariance matrix estimators in high-dimensional settings

نویسنده

  • Anestis Touloumis
چکیده

Estimating a covariance matrix is an important task in applications where the number of variables is larger than the number of observations. In the literature, shrinkage approaches for estimating a high-dimensional covariance matrix are employed to circumvent the limitations of the sample covariance matrix. A new family of nonparametric Stein-type shrinkage covariance estimators is proposed whose members are written as a convex linear combination of the sample covariance matrix and of a predefined invertible target matrix. Under the Frobenius norm criterion, the optimal shrinkage intensity that defines the best convex linear combination depends on the unobserved covariance matrix and it must be estimated from the data. A simple but effective estimation process that produces nonparametric and consistent estimators of the optimal shrinkage intensity for three popular target matrices is introduced. In simulations, the proposed Stein-type shrinkage covariance matrix estimator based on a scaled identity matrix appeared to be up to 80% more efficient than existing ones in extreme high-dimensional settings. A colon cancer dataset was analyzed to demonstrate the utility of the proposed estimators. A rule of thumb for adhoc selection among the three commonly used target matrices is recommended. Keywords— Covariance matrix, High-dimensional settings, Nonparametric estimation, Shrinkage estimation

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Differenced-Based Double Shrinking in Partial Linear Models

Partial linear model is very flexible when the relation between the covariates and responses, either parametric and nonparametric. However, estimation of the regression coefficients is challenging since one must also estimate the nonparametric component simultaneously. As a remedy, the differencing approach, to eliminate the nonparametric component and estimate the regression coefficients, can ...

متن کامل

Shrinkage Estimators for High-Dimensional Covariance Matrices

As high-dimensional data becomes ubiquitous, standard estimators of the population covariance matrix become difficult to use. Specifically, in the case where the number of samples is small (large p small n) the sample covariance matrix is not positive definite. In this paper we explore some recent estimators of sample covariance matrices in the large p, small n setting namely, shrinkage estimat...

متن کامل

Comments on: Augmenting the bootstrap to analyze high dimensional genomic data Connections between the augmented bootstrap and the shrinkage covariance estimator

In their enlightening and stimulating paper Svitlana Tyekucheva and Francesca Chiaromonte propose an “augmented bootstrap” (AB) approach to estimate covariance structure in high-dimensional data. They show that the AB estimator performs well in a catalog of examples. Moreover, according to the authors no assumption of a sparsity rationale is made. This is in contrast to a competing and computat...

متن کامل

Multi - Target Shrinkage

Stein showed that the multivariate sample mean is outperformed by “shrinking” to a constant target vector. Ledoit and Wolf extended this approach to the sample covariance matrix and proposed a multiple of the identity as shrinkage target. In a general framework, independent of a specific estimator, we extend the shrinkage concept by allowing simultaneous shrinkage to a set of targets. Applicati...

متن کامل

The Stein phenomenon for monotone incomplete multivariate normal data

We establish the Stein phenomenon in the context of two-step, monotone incomplete data drawn from Np+q(μ,Σ), a (p+ q)-dimensional multivariate normal population with mean μ and covariance matrix Σ. On the basis of data consisting of n observations on all p+q characteristics and an additional N − n observations on the last q characteristics, where all observations are mutually independent, denot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 83  شماره 

صفحات  -

تاریخ انتشار 2015