A Comparison of State-of-the-art Classification Techniques for Expert Automobile Insurance Claim Fraud Detection

نویسندگان

  • Stijn Viaene
  • Richard A. Derrig
  • Bart Baesens
  • Guido Dedene
چکیده

Several state-of-the-art binary classification techniques are experimentally evaluated in the context of expert automobile insurance claim fraud detection. The predictive power of logistic regression, C4.5 decision tree, k-nearest neighbor, Bayesian learning multilayer perceptron neural network, least-squares support vector machine, naive Bayes, and tree-augmented naive Bayes classification is contrasted. For most of these algorithm types, we report on several operationalizations using alternative hyperparameter or design choices. We compare these in terms of mean percentage correctly classified (PCC) and mean area under the receiver operating characteristic (AUROC) curve using a stratified, blocked, ten-fold cross-validation experiment. We also contrast algorithm type performance visually by means of the convex hull of the receiver operating characteristic (ROC) curves associated with the alternative operationalizations per algorithm type. The study is based on a data set of 1,399 personal injury protection claims from 1993 accidents collected by the Automobile Insurers Bureau of Massachusetts. To stay as close to real-life operating conditions as possible, we consider only predictors that are known relatively early in the life of a claim. Furthermore, based on the qualification of each available claim by both a verbal expert assessment of suspicion of fraud and a ten-point-scale expert suspicion score, we can Stijn Viaene, Bart Baesens, and Guido Dedene are at the K. U. Leuven Department of Applied Economic Sciences, Leuven, Belgium. Richard Derrig is with the Automobile Insurers Bureau of Massachusetts, Boston. Presented at Fifth International Congress on Insurance: Mathematics & Economics July 23-25, 2001, Penn State University. This work was sponsored by the KBC Insurance Research Chair Management Informatics at the K. U. Leuven Department of Applied Economic Sciences. The KBC Research Chair was set up in September 1997 as a pioneering collaboration between the Leuven Institute for Research on Information Systems and the KBC Bank & Insurance group. We are grateful to the Automobile Insurers Bureau (AIB) of Massachusetts and the Insurance Fraud Bureau (IFB) of Massachusetts for providing us with the data that was used for this benchmark study.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast Unsupervised Automobile Insurance Fraud Detection Based on Spectral Ranking of Anomalies

Collecting insurance fraud samples is costly and if performed manually is very time consuming. This issue suggests usage of unsupervised models. One of the accurate methods in this regards is Spectral Ranking of Anomalies (SRA) that is shown to work better than other methods for auto insurance fraud detection specifically. However, this approach is not scalable to large samples and is not appro...

متن کامل

Computational Intelligence Models for Insurance Fraud Detection: A Review of a Decade of Research

This paper presents a review of the literature on the application of data mining techniques for the detection of insurance fraud. Academic literature were analyzed and classified into three types of insurance fraud (automobile insurance, crop insurance and healthcare insurance) and six classes of data mining techniques (classification, regression, clustering, prediction, outlier detection, and ...

متن کامل

Auto claim fraud detection using Bayesian learning neural networks

This article explores the explicative capabilities of neural network classifiers with automatic relevance determination weight regularization, and reports the findings from applying these networks for personal injury protection automobile insurance claim fraud detection. The automatic relevance determination objective function scheme provides us with a way to determine which inputs are most inf...

متن کامل

FRAUD CLASSIFICATION USING PRINCIPAL COMPONENT ANALYSIS OF RIDITs

This article introduces to the statistical and insurance literature a mathematical technique for an a priori classification of objects when no training sample exists for which the exact correct group membership is known. The article also provides an example of the empirical application of the methodology to fraud detection for bodily injury claims in automobile insurance. With this technique, p...

متن کامل

Using Kohonen's Self-Organizing Feature Map to Uncover Automobile Bodily Injury Claims Fraud

Claims fraud is an increasingly vexing problem confronting the insurance industry. In this empirical study, we apply Kohonen's Self-Organizing Feature Map to classify automobile bodily injury (BI) claims by the degree of fraud suspicion. Feed forward neural networks and a back propagation algorithm are used to investigate the validity of the Feature Map approach. Comparative experiments illustr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002