Non-Gaussian Statistical Models and Their Applications
نویسنده
چکیده
Statistical modeling plays an important role in various research areas. It provides a way to connect the data with the statistics. Based on the statistical properties of the observed data, an appropriate model can be chosen that leads to a promising practical performance. The Gaussian distribution is the most popular and dominant probability distribution used in statistics, since it has an analytically tractable Probability Density Function (PDF) and analysis based on it can be derived in an explicit form. However, various data in real applications have bounded support or semi-bounded support. As the support of the Gaussian distribution is unbounded, such type of data is obviously not Gaussian distributed. Thus we can apply some non-Gaussian distributions, e.g., the beta distribution, the Dirichlet distribution, to model the distribution of this type of data. The choice of a suitable distribution is favorable for modeling efficiency. Furthermore, the practical performance based on the statistical model can also be improved by a better modeling. An essential part in statistical modeling is to estimate the values of the parameters in the distribution or to estimate the distribution of the parameters, if we consider them as random variables. Unlike the Gaussian distribution or the corresponding Gaussian Mixture Model (GMM), a non-Gaussian distribution or a mixture of non-Gaussian distributions does not have an analytically tractable solution, in general. In this dissertation, we study several estimation methods for the non-Gaussian distributions. For the Maximum Likelihood (ML) estimation, a numerical method is utilized to search for the optimal solution in the estimation of Dirichlet Mixture Model (DMM). For the Bayesian analysis, we utilize some approximations to derive an analytically tractable solution to approximate the distribution of the parameters. The Variational Inference (VI) framework based method has been shown to be efficient for approximating the parameter distribution by several researchers. Under this framework, we adapt the conventional Factorized Approximation (FA) method to the Extended Factorized Approximation (EFA) method and use it to approximate the parameter distribution in the beta distribution. Also, the Local Variational Inference (LVI) method is applied to approximate the predictive distribution of the beta distribution. Finally, by assigning a beta distribution to each element in the matrix, we proposed a variational Bayesian Nonnegative Matrix Factorization (NMF) for bounded support data. The performances of the proposed non-Gaussian model based methods are evaluated by several experiments. The beta distribution and the Dirichlet distribution are applied to model the Line Spectral Frequency (LSF) representation of the Linear Prediction (LP) model for statistical model based speech coding. For some image processing applications, the beta distribution is also applied. The proposed beta distribution based variational Bayesian NMF is applied for image restoration and collaborative filtering. Compared to some conventional statistical model based methods, the non-Gaussian model based methods show a promising improvement.
منابع مشابه
Conditional Dependence in Longitudinal Data Analysis
Mixed models are widely used to analyze longitudinal data. In their conventional formulation as linear mixed models (LMMs) and generalized LMMs (GLMMs), a commonly indispensable assumption in settings involving longitudinal non-Gaussian data is that the longitudinal observations from subjects are conditionally independent, given subject-specific random effects. Although conventional Gaussian...
متن کاملSpatial Latent Gaussian Models: Application to House Prices Data in Tehran City
Latent Gaussian models are flexible models that are applied in several statistical applications. When posterior marginals or full conditional distributions in hierarchical Bayesian inference from these models are not available in closed form, Markov chain Monte Carlo methods are implemented. The component dependence of the latent field usually causes increase in computational time and divergenc...
متن کاملParameter Estimation in Spatial Generalized Linear Mixed Models with Skew Gaussian Random Effects using Laplace Approximation
Spatial generalized linear mixed models are used commonly for modelling non-Gaussian discrete spatial responses. We present an algorithm for parameter estimation of the models using Laplace approximation of likelihood function. In these models, the spatial correlation structure of data is carried out by random effects or latent variables. In most spatial analysis, it is assumed that rando...
متن کاملNon-Gaussian Multivariate Statistical Models and their Applications
In recent years, the extension of classical Gaussian-based statistical models to non-Gaussian ones has received sustained attention and generated many research topics. There have been various approaches to develop multivariate non-Gaussian distributions. Two main directions to approach such extensions consist of multivariate skew-symmetric distributions and of copula models. Skew-symmetric dist...
متن کاملSpatial Interpolation Using Copula for non-Gaussian Modeling of Rainfall Data
‎One of the most useful tools for handling multivariate distributions of dependent variables in terms of their marginal distribution is a copula function‎. ‎The copula families capture a fair amount of attention due to their applicability and flexibility in describing the non-Gaussian spatial dependent data‎. ‎The particular properties of the spatial copula are rarely ...
متن کاملA Statistical Study of two Diffusion Processes on Torus and Their Applications
Diffusion Processes such as Brownian motions and Ornstein-Uhlenbeck processes are the classes of stochastic processes that have been investigated by researchers in various disciplines including biological sciences. It is usually assumed that the outcomes of these processes are laid on the Euclidean spaces. However, some data in physical, chemical and biological phenomena indicate that they cann...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011