Characterizing and Managing Missing Structured Data in Electronic Health Records: Data Analysis

نویسندگان

  • Brett K Beaulieu-Jones
  • Daniel R Lavage
  • John W Snyder
  • Jason H Moore
  • Sarah A Pendergrass
  • Christopher R Bauer
چکیده

BACKGROUND Missing data is a challenge for all studies; however, this is especially true for electronic health record (EHR)-based analyses. Failure to appropriately consider missing data can lead to biased results. While there has been extensive theoretical work on imputation, and many sophisticated methods are now available, it remains quite challenging for researchers to implement these methods appropriately. Here, we provide detailed procedures for when and how to conduct imputation of EHR laboratory results. OBJECTIVE The objective of this study was to demonstrate how the mechanism of missingness can be assessed, evaluate the performance of a variety of imputation methods, and describe some of the most frequent problems that can be encountered. METHODS We analyzed clinical laboratory measures from 602,366 patients in the EHR of Geisinger Health System in Pennsylvania, USA. Using these data, we constructed a representative set of complete cases and assessed the performance of 12 different imputation methods for missing data that was simulated based on 4 mechanisms of missingness (missing completely at random, missing not at random, missing at random, and real data modelling). RESULTS Our results showed that several methods, including variations of Multivariate Imputation by Chained Equations (MICE) and softImpute, consistently imputed missing values with low error; however, only a subset of the MICE methods was suitable for multiple imputation. CONCLUSIONS The analyses we describe provide an outline of considerations for dealing with missing EHR data, steps that researchers can perform to characterize missingness within their own data, and an evaluation of methods that can be applied to impute clinical data. While the performance of methods may vary between datasets, the process we describe can be generalized to the majority of structured data types that exist in EHRs, and all of our methods and code are publicly available.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Characterizing and Managing Missing Structured Data in Electronic Health Records

Missing data is a challenge for all studies; however, this is especially true for electronic health record (EHR) based analyses. Failure to appropriately consider missing data can lead to biased results. Here, we provide detailed procedures for when and how to conduct imputation of EHR data. We demonstrate how the mechanism of missingness can be assessed, evaluate the performance of a variety o...

متن کامل

Strategies for Handling Missing Data in Electronic Health Record Derived Data

Electronic health records (EHRs) present a wealth of data that are vital for improving patient-centered outcomes, although the data can present significant statistical challenges. In particular, EHR data contains substantial missing information that if left unaddressed could reduce the validity of conclusions drawn. Properly addressing the missing data issue in EHR data is complicated by the fa...

متن کامل

طراحی مدل مفهومی سیستم گزارش دهی آزمایشگاه جهت تبادل داده با سامانه پرونده الکترونیک سلامت ایران

Introduction: Integration of health information systems based on a common language is essential to exchange data with the system. The study aimed to eliminate the existing problem in the integration of information system with electronic health records system through providing a conceptual model of laboratory reporting system, using the Unified Modeling Language and enable information system dev...

متن کامل

طراحی و ایجاد پرونده ی الکترونیک سلامت بیماران مول هیداتیفرم و بررسی میزان تکمیل اطلاعات در پرونده های کاغذی بیماران

Background and Aim: To provide effective care, health care providers need timely and appropriate information. Electronic records provide quick access and easy management of data. The aim of this study was to develop electronic health records for patients with hydatidiform mole and evaluation of completeness of medical records Materials and Methods: This applied study was conducted in 2017. Aft...

متن کامل

Evaluation of Barriers and Facilitators Affecting the Implementation of Electronic Health Records in Iran

Introduction: Despite the development of information technology in the field of health, the process of creating and using electronic health records is still difficult. Therefore, identifying the implementation barriers of this system contribute to eliminate them and adopt effective implementation strategies. Methods and Materials: The present study is a review article and the research populati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2018