Estimating Spatial Intensity and Variation in Risk from Locations Coarsened by Incomplete Geocoding
نویسنده
چکیده
The estimation of spatial intensity and relative risk are important inference problems in spatial epidemiologic studies. A standard component of data assimilation in these studies is the assignment of a geocode, i.e. point-level spatial coordinates, to the address of each subject in the study population. Unfortunately, when geocoding is performed by the pervasive method of street-segment matching to a georeferenced road file and subsequent interpolation, it is rarely completely successful. Typically, 10% to 30% of the addresses in the study population fail to geocode, potentially leading to a selection bias called geographic bias. Missing-data methods might be considered for dealing with this; however, since there is almost always some geographic information coarser than a point (e.g. a zip code) measured for the observations that fail to geocode, a coarsened-data analysis is more appropriate. This article develops coarsened-data spatial epidemiologic methods for use with incompletely geocoded data, which can reduce or even eliminate geographic bias. In particular, existing completedata methods for estimating intensity and variation in relative risk are modified so as to exploit the coarsened data. Both nonparametric (kernel smoothing) and likelihood-based estimation procedures are considered. The success of these procedures relies on modeling and estimating a function called the geocoding propensity function, to which considerable attention is given. Models based on the degree of rurality are featured, as it is well-known that the propensity of rural addresses to geocode is much lower than for non-rural addresses. Advantages of the coarsened-data analyses are demonstrated empirically.
منابع مشابه
Estimating Spatial Intensity and Variation in Risk from Locations Subject to Geocoding Errors
The accurate assignment of geocodes to the residences of subjects in a study population is an important component of the data acquisition/assimilation stage of a spatial epidemiological investigation. Unfortunately, however, it is not a simple matter to obtain accurate point-level geocodes. Recent investigations have demonstrated that when residential address geocoding is performed by the most ...
متن کاملIncomplete geocoding and spatial sampling: The effects of locational errors on population total estimation
Due to the increasing availability of georeferenced microdata in several fields of research, surveys can benefit greatly from the use of the most recent spatial sampling methods. These methods allow to select spatially balanced samples, which lead to particularly efficient estimates, by incorporating the distances among the exact locations of statistical units into the design. Unfortunately, si...
متن کاملNonparametric Estimation of Spatial Risk for a Mean Nonstationary Random Field}
The common methods for spatial risk estimation are investigated for a stationary random field. Because of simplifying, lets distribution is known, and parametric variogram for the random field are considered. In this paper, we study a nonparametric spatial method for spatial risk. In this method, we model the random field trend by a local linear estimator, and through bias-corrected residuals, ...
متن کاملGeostatistical analysis of health data with different levels of spatial aggregation.
This paper presents a geostatistical approach to combine two geographical sets of area-based data into the mapping of disease risk, with an application to the rate of prostate cancer late-stage diagnosis in North Florida. This methodology is used to combine individual-level data assigned to census tracts for confidentiality reasons with individual-level data that were allocated to ZIP codes bec...
متن کاملError and Bias in Determining Exposure Potential of Children at School Locations Using Proximity-Based GIS Techniques
BACKGROUND The widespread availability of powerful tools in commercial geographic information system (GIS) software has made address geocoding a widely employed technique in spatial epidemiologic studies. OBJECTIVE The objective of this study was to determine the effect of the positional error in geocoding on the analysis of exposure to traffic-related air pollution of children at school loca...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006