Application of data linkage techniques to Pacific Northwest commercial fishing injury and fatality data
نویسندگان
چکیده
Abstract Background Commercial fishing consistently has among the highest workforce injury and fatality rates in United States. Data related to commercial incidents are routinely collected by multiple organizations which do not currently coordinate or automatically link data. Each data set potential generate a more complete picture inform prevention efforts. Our objective was examine utility of using statistical linkage methods incident when personally identifiable information is available. Methods In this feasibility study, we identified true matches discrepancies between de-identified sets Python Record Linkage Toolkit. Four from Oregon Washington were linked: Fishing Incident Database, Vessel Casualty Nonfatal Injuries Trauma Registry. The each covered different date ranges within 2000–2017, containing 458, 524, 184, 11 cases respectively. Several classifiers evaluated. Results Naïve-Bayes classifier returned number these small sets. A total 41 8 close identified, 29 determined be duplicates. addition, highlighted 4 records that Washington. optimum match parameters date, state, vessel official number, people on board. Conclusions Statistical enables accurate, routine matching for such as those fishing. It provides needed improve accuracy existing records. also expanding sharpening details individual support occupational safety research.
منابع مشابه
the clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance
با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...
Penalized Lasso Methods in Health Data: application to trauma and influenza data of Kerman
Background: Two main issues that challenge model building are number of Events Per Variable and multicollinearity among exploratory variables. Our aim is to review statistical methods that tackle these issues with emphasize on penalized Lasso regression model. The present study aimed to explain problems of traditional regressions due to small sample size and m...
متن کاملconstruction and validation of translation metacognitive strategy questionnaire and its application to translation quality
like any other learning activity, translation is a problem solving activity which involves executing parallel cognitive processes. the ability to think about these higher processes, plan, organize, monitor and evaluate the most influential executive cognitive processes is what flavell (1975) called “metacognition” which encompasses raising awareness of mental processes as well as using effectiv...
Data quality and record linkage techniques
Preparing the books to read every day is enjoyable for many people. However, there are still many people who also don't like reading. This is a problem. But, when you can support others to start reading, it will be better. One of the books that can be recommended for new readers is data quality and record linkage techniques. This book is not kind of difficult book to read. It can be read and un...
متن کاملProbabilistic Linkage of Victorian Injury Data Records
Research utilising the available mass databases of real world road crashes compiled by Police, government and insurers in Victoria provides much useful information for injury prevention research purposes. The record linkage of these datasets and hospital injury records has the potential to maximise the use of available data by researchers to extend the understanding of the causes, outcomes and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Injury Epidemiology
سال: 2021
ISSN: ['2197-1714']
DOI: https://doi.org/10.1186/s40621-021-00323-z