Effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter of Poisson-gamma models for modeling motor vehicle crashes: a Bayesian perspective
نویسنده
چکیده
There has been considerable research conducted on the development of statistical models for predicting motor vehicle crashes on highway facilities. Over the last few years, there has been a significant increase in the application hierarchical Bayes methods for modeling motor vehicle crash data. Whether the inferences are estimated using classical or Bayesian methods, the most common probabilistic structure used for modeling this type of data remains the traditional Poisson-gamma (or Negative Binomial) model. Crash data collected for highway safety studies often have the unusual attributes of being characterized by low sample mean values and, due to the prohibitive costs of collecting data, small sample sizes. Previous studies have shown that the dispersion parameter of Poisson-gamma models can be seriously mis-estimated when the models are estimated using the maximum likelihood estimation (MLE) method for these extreme conditions. Despite important work done on this topic for the MLE, nobody has so far examined how low sample mean values and small sample sizes affect the posterior mean of the dispersion parameter of Poisson-gamma models estimated using the hierarchical Bayes method. The inverse dispersion parameter plays an important role in various types of highway safety studies. It is therefore vital to determine the conditions in which the inverse dispersion parameter may be mis-estimated for this category of models. To accomplish the objectives of this study, a simulation framework is developed to generate data from the Poisson-gamma distributions using different values describing the mean, the dispersion parameter, the sample size, and the prior specification. Vague and non-vague prior specifications are tested for determining the magnitude of the biases introduced by low sample mean values and small sample sizes. A series of datasets are also simulated from the Poisson-lognormal distributions, in the light of recent work done by statisticians on this mixed distribution. The study shows that a dataset characterized by a low sample mean combined with a small sample size can seriously affect the estimation of the posterior mean of the dispersion parameter when a vague prior specification is used to characterize the gamma hyper-parameter. The risk of a mis-estimated posterior mean can be greatly minimized when an appropriate non-vague prior distribution is used. Finally, the study shows that Poisson-lognormal models are recommended over Poissongamma models when assuming vague priors and whenever crash data characterized by low sample mean values are used for developing crash prediction models.
منابع مشابه
Modeling motor vehicle crashes using Poisson-gamma models: examining the effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter.
There has been considerable research conducted on the development of statistical models for predicting crashes on highway facilities. Despite numerous advancements made for improving the estimation tools of statistical models, the most common probabilistic structure used for modeling motor vehicle crashes remains the traditional Poisson and Poisson-gamma (or Negative Binomial) distribution; whe...
متن کاملEffects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter of Poisson-gamma models: A Bayesian Perspective
There has been considerable research conducted on the development of statistical models for predicting motor vehicle crashes on highway facilities. Many of these developments were performed for the likelihood-based or frequentist modeling approach. Over the last few years, there has been a significant increase in the application hierarchical Bayes method for modeling motor vehicle crashes. Whet...
متن کاملBayesian Inference for Spatial Beta Generalized Linear Mixed Models
In some applications, the response variable assumes values in the unit interval. The standard linear regression model is not appropriate for modelling this type of data because the normality assumption is not met. Alternatively, the beta regression model has been introduced to analyze such observations. A beta distribution represents a flexible density family on (0, 1) interval that covers symm...
متن کاملExamining the Application of Aggregated and Disaggregated Poisson-gamma Models Subjected to Low Sample Mean Bias
The costs of collecting crash and other related data can be very prohibitive. As a result, these data can often only be collected at a limited number of sites. One way to increase the sample size for developing reliable statistical models is to collect data at the same sites for a long time period. Two general classes of models have been proposed for modeling crash data using such datasets: dis...
متن کاملEffects of the Varying Dispersion Parameter of Poisson-gamma models on the estimation of Confidence Intervals of Crash Prediction models
The most common probabilistic structure of the models used by transportation safety analysts for modeling motor vehicle crashes are the traditional Poisson and Poissongamma (or Negative Binomial) distributions. Since crash data have been shown to exhibit over-dispersion, Poisson-gamma models are usually preferred over Poisson regression models. Up until recently, the dispersion parameter of Poi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007