Analysis of Household Pulse Survey Public-Use Microdata via Unit-Level Models for Informative Sampling
نویسندگان
چکیده
The Household Pulse Survey, recently released by the U.S. Census Bureau, gathers information about respondents’ experiences regarding employment status, food security, housing, physical and mental health, access to health care, education disruption. Design-based estimates are produced for all 50 states District of Columbia (DC), as well 15 Metropolitan Statistical Areas (MSAs). Using public-use microdata, this paper explores effectiveness using unit-level model-based estimators that incorporate spatial dependence Survey. In particular, we consider Bayesian hierarchical both a binomial multinomial response under informative sampling. Importantly, demonstrate these models can be easily estimated Hamiltonian Monte Carlo through Stan software package. doing so, readily implemented in production environment. For responses, an empirical simulation study is conducted, which compares non-spatial models. Finally, Survey micro-data, provide analysis design-based demonstrates reduction standard errors approaches.
منابع مشابه
Sampling with Synthesis: A New Approach for Releasing Public Use Census Microdata
Many statistical agencies disseminate samples of census microdata, i.e., data on individual records, to the public. Before releasing the microdata, agencies typically alter identifying or sensitive values to protect data subjects’ confidentiality, for example by coarsening, perturbing, or swapping data. These standard disclosure limitation techniques distort relationships and distributional fea...
متن کاملPrivacy Protection from Sampling and Perturbation in Survey Microdata
Statistical agencies release microdata from social surveys as public-use files after applying statistical disclosure limitation (SDL) techniques. Disclosure risk is typically assessed in terms of identification risk, where it is supposed that small counts on cross-classified identifying key variables, i.e., a key, could be used to make an identification and confidential information may be learn...
متن کاملParametric Distributions of Complex Survey Data under Informative Probability Sampling
The sample distribution is defined as the distribution of the sample measurements given the selected sample. Under informative sampling, this distribution is different from the corresponding population distribution, although for several examples the two distributions are shown to be in the same family and only differ in some or all the parameters. A general approach of approximating the margina...
متن کاملUsing CART to Generate Partially Synthetic, Public Use Microdata
To limit disclosure risks, one approach is to release partially synthetic, public use microdata sets. These comprise the units originally surveyed, but some collected values, for example sensitive values at high risk of disclosure or values of key identifiers, are replaced with multiple imputations. This article presents and evaluates the use of classification and regression trees to generate p...
متن کاملThe 2006 Earnings Public-Use Microdata File: an introduction.
This article introduces the 2006 Earnings Public-Use File (EPUF) and provides important background information on the file's data fields. The EPUF contains selected demographic and earnings information for 4.3 million individuals drawn from a 1-percent sample of all Social Security numbers issued before January 2007. The data file provides aggregate earnings for 1937 to 1950 and annual earnings...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Stats
سال: 2022
ISSN: ['2571-905X']
DOI: https://doi.org/10.3390/stats5010010