Predicting early psychiatric readmission with natural language processing of narrative discharge summaries
نویسندگان
چکیده
The ability to predict psychiatric readmission would facilitate the development of interventions to reduce this risk, a major driver of psychiatric health-care costs. The symptoms or characteristics of illness course necessary to develop reliable predictors are not available in coded billing data, but may be present in narrative electronic health record (EHR) discharge summaries. We identified a cohort of individuals admitted to a psychiatric inpatient unit between 1994 and 2012 with a principal diagnosis of major depressive disorder, and extracted inpatient psychiatric discharge narrative notes. Using these data, we trained a 75-topic Latent Dirichlet Allocation (LDA) model, a form of natural language processing, which identifies groups of words associated with topics discussed in a document collection. The cohort was randomly split to derive a training (70%) and testing (30%) data set, and we trained separate support vector machine models for baseline clinical features alone, baseline features plus common individual words and the above plus topics identified from the 75-topic LDA model. Of 4687 patients with inpatient discharge summaries, 470 were readmitted within 30 days. The 75-topic LDA model included topics linked to psychiatric symptoms (suicide, severe depression, anxiety, trauma, eating/weight and panic) and major depressive disorder comorbidities (infection, postpartum, brain tumor, diarrhea and pulmonary disease). By including LDA topics, prediction of readmission, as measured by area under receiver-operating characteristic curves in the testing data set, was improved from baseline (area under the curve 0.618) to baseline+1000 words (0.682) to baseline+75 topics (0.784). Inclusion of topics derived from narrative notes allows more accurate discrimination of individuals at high risk for psychiatric readmission in this cohort. Topic modeling and related approaches offer the potential to improve prediction using EHRs, if generalizability can be established in other clinical cohorts.
منابع مشابه
Toward the Automatic Generation of the Entry Level CDA Documents
Objective: CDA (Clinical Document Architecture) is a markup standard for clinical document exchange. In order to increase the semantic interoperability of documents exchange, the clinical statements in the narrative blocks should be encoded with code values. Natural language processing (NLP) is required in order to transform the narrative blocks into the coded elements in the level 3 CDA docume...
متن کاملCrisis discharges and readmission risk in acute psychiatric male inpatients
BACKGROUND Severe pressures on beds in psychiatric services have led to the implementation of an early ("crisis") discharge policy in the Western Cape, South Africa. The study examined the effect of this policy and length of hospital stay (LOS) on readmission rates in one psychiatric hospital in South Africa. METHODS Discharge summaries of adult male patients (n = 438) admitted to Stikland Ps...
متن کاملAutomating ICD-9-CM Encoding Using Medical Language Processing: A Feasibility Study
Objective. To provide a qualitative evaluation of Natural Language Processing (NLP) based ICD-9CM (Encoding of narrative discharge summaries). Background. MedLEE is a NLP system that structures the information of textual medical reports. It was shown to be effective for decision support applications associated with narrative chest X-rays, mammograms, and Discharge Summaries (DS). Significance. ...
متن کاملA semantic lexicon for medical language processing.
OBJECTIVE Construction of a resource that provides semantic information about words and phrases to facilitate the computer processing of medical narrative. DESIGN Lexemes (words and word phrases) in the Specialist Lexicon were matched against strings in the 1997 Metathesaurus of the Unified Medical Language System (UMLS) developed by the National Library of Medicine. This yielded a "semantic ...
متن کاملSentiment Measured in Hospital Discharge Notes Is Associated with Readmission and Mortality Risk: An Electronic Health Record Study
Natural language processing tools allow the characterization of sentiment--that is, terms expressing positive and negative emotion--in text. Applying such tools to electronic health records may provide insight into meaningful patient or clinician features not captured in coded data alone. We performed sentiment analysis on 2,484 hospital discharge notes for 2,010 individuals from a psychiatric ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 6 شماره
صفحات -
تاریخ انتشار 2016