Data fusion using factor analysis and low-rank matrix completion

نویسندگان

چکیده

Data fusion involves the integration of multiple related datasets. The statistical file-matching problem is a canonical data in multivariate analysis, where objective to characterise joint distribution set variables when only strict subsets marginal distributions have been observed. Estimation covariance matrix full challenging given missing-data pattern. Factor analysis models use lower-dimensional latent data-generating process, and this introduces low-rank components complete-data population matrix. structure factor model can be exploited estimate from incomplete via completion. We prove identifiability under conditions on number factors shared over observed subsets. Additionally, we provide an EM algorithm for parameter estimation. On several real datasets, gives smaller reconstruction errors problems than common approaches

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Low-Rank Matrix Completion

While datasets are frequently represented as matrices, real-word data is imperfect and entries are often missing. In many cases, the data are very sparse and the matrix must be filled in before any subsequent work can be done. This optimization problem, known as matrix completion, can be made well-defined by assuming the matrix to be low rank. The resulting rank-minimization problem is NP-hard,...

متن کامل

Low-Rank Matrix Completion using Nuclear Norm

5 Minimization of the nuclear norm is often used as a surrogate, convex relaxation, for finding 6 the minimum rank completion (recovery) of a partial matrix. The minimum nuclear norm 7 problem can be solved as a trace minimization semidefinite programming problem (SDP ). 8 The SDP and its dual are regular in the sense that they both satisfy strict feasibility. Interior 9 point algorithms are th...

متن کامل

Factor Matrix Nuclear Norm Minimization for Low-Rank Tensor Completion

Most existing low-n-rank minimization algorithms for tensor completion suffer from high computational cost due to involving multiple singular value decompositions (SVDs) at each iteration. To address this issue, we propose a novel factor matrix rank minimization method for tensor completion problems. Based on the CANDECOMP/PARAFAC (CP) decomposition, we first formulate a factor matrix rank mini...

متن کامل

Factor Matrix Trace Norm Minimization for Low-Rank Tensor Completion

Most existing low-n-rank minimization algorithms for tensor completion suffer from high computational cost due to involving multiple singular value decompositions (SVDs) at each iteration. To address this issue, we propose a novel factor matrix trace norm minimization method for tensor completion problems. Based on the CANDECOMP/PARAFAC (CP) decomposition, we first formulate a factor matrix ran...

متن کامل

Orthogonal Rank-One Matrix Pursuit for Low Rank Matrix Completion

In this paper, we propose an efficient and scalable low rank matrix completion algorithm. The key idea is to extend orthogonal matching pursuit method from the vector case to the matrix case. We further propose an economic version of our algorithm by introducing a novel weight updating rule to reduce the time and storage complexity. Both versions are computationally inexpensive for each matrix ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Statistics and Computing

سال: 2021

ISSN: ['0960-3174', '1573-1375']

DOI: https://doi.org/10.1007/s11222-021-10033-7