An Incomplete Data Approach to the Ecological Inference Problem

نویسندگان

  • Kosuke Imai
  • Ying Lu
چکیده

In this paper, we propose to formulate ecological inference as a coarse data problem where only a subset of the complete-data sample space is observed. Applying the related assumptions and theoretical results of Heitjan and Rubin (1991), we formally identify three key factors that affect ecological inference; distributional, contextual and aggregation effects. Different modeling strategies are discussed to deal with distributional and contextual effects. While aggregation effects cannot be statistically adjusted, we show how to formally quantify the magnitude of such effects through the use of the Expectation-Maximization algorithm. The paper concludes with simulations and empirical applications that assess the performance of the proposed models. C-code used to implement the proposed method is available with easy-to-use R interface.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Marginal Analysis of A Population-Based Genetic Association Study of Quantitative Traits with Incomplete Longitudinal Data

A common study to investigate gene-environment interaction is designed to be longitudinal and population-based. Data arising from longitudinal association studies often contain missing responses. Naive analysis without taking missingness into account may produce invalid inference, especially when the missing data mechanism depends on the response process. To address this issue in the ana...

متن کامل

Bayesian and Likelihood Inference for 2 3 2 Ecological Tables: An Incomplete-Data Approach

Ecological inference is a statistical problem where aggregate-level data are used to make inferences about individual-level behavior. In this article, we conduct a theoretical and empirical study of Bayesian and likelihood inference for 2 2 ecological tables by applying the general statistical framework of incomplete data. We first show that the ecological inference problem can be decomposed in...

متن کامل

Fuzzy Inference System Approach in Deterministic Seismic Hazard, Case Study: Qom Area, Iran

Seismic hazard assessment like many other issues in seismology is a complicated problem, which is due to a variety of parameters affecting the occurrence of an earthquake. Uncertainty, which is a result of vagueness and incompleteness of the data, should be considered in a rational way. Using fuzzy method makes it possible to allow for uncertainties to be considered. Fuzzy inference system,...

متن کامل

Fuzzy Inference System Approach in Deterministic Seismic Hazard, Case Study: Qom Area, Iran

Seismic hazard assessment like many other issues in seismology is a complicated problem, which is due to a variety of parameters affecting the occurrence of an earthquake. Uncertainty, which is a result of vagueness and incompleteness of the data, should be considered in a rational way. Using fuzzy method makes it possible to allow for uncertainties to be considered. Fuzzy inference system,...

متن کامل

A Novel Type-2 Adaptive Neuro Fuzzy Inference System Classifier for Modelling Uncertainty in Prediction of Air Pollution Disaster (RESEARCH NOTE)

Type-2 fuzzy set theory is one of the most powerful tools for dealing with the uncertainty and imperfection in dynamic and complex environments. The applications of type-2 fuzzy sets and soft computing methods are rapidly emerging in the ecological fields such as air pollution and weather prediction. The air pollution problem is a major public health problem in many cities of the world. Predict...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005