Simultaneous Edit-Imputation for Continuous Microdata
نویسندگان
چکیده
منابع مشابه
Bayesian Simultaneous Edit and Imputation for Multivariate Categorical Data
In categorical data, it is typically the case that some combinations of variables are theoretically impossible, such as a three year old child who is married or a man who is pregnant. In practice, however, reported values often include such structural zeros due to, for example, respondent mistakes or data processing errors. To purge data of such errors, many statistical organizations use a proc...
متن کاملItem Imputation with the Discrete Edit System
The Fellegi-Holt algorithm (Fellegi and Holt 1976) provides a framework for item imputation by identifying for each record with one or more edit failures a minimal set of fields that must be imputed in order to satisfy a cohesive set of edits. The set of fields is minimal in the sense that there exists at least one joint value for the fields in the set such that, when this joint value is substi...
متن کاملDealing with Edit Constraints in Microdata Protection: Microaggregation
In this paper we discuss how most edit constraints can be taken into account in an effective way through microaggregation. We discuss different edit constraints and some variations of microaggregation that permits to deal with such constraints. We will also present our software to formalize and deal with such constraints in an automatic way.
متن کاملUsing Multiple Imputation to Integrate and Disseminate Confidential Microdata
In data integration contexts, two statistical agencies seek to merge their separate databases in one file. The agencies also may seek to disseminate data to the public based on the integrated file. These goals may be complicated by the agencies’ need to protect the confidentiality of database subjects, which could be at risk during the integration or dissemination stage. This article proposes s...
متن کاملCalibrated Imputation of Numerical Data under Linear Edit Restrictions
A common problem faced by statistical institutes is that data may be missing from collected datasets. The typical way to overcome this problem is to impute the missing data. The problem of imputing missing data is complicated by the fact that statistical data often have to satisfy certain edit rules and that values of variables across units sometimes have to sum up to known totals. For numerica...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: SSRN Electronic Journal
سال: 2015
ISSN: 1556-5068
DOI: 10.2139/ssrn.2698601