A comparison of state-of-the-art classification techniques for expert automobile insurance claim fraud detection

被引:108
|
作者
Viaene, S [1 ]
Derrig, RA
Baesens, B
Dedene, G
机构
[1] Katholieke Univ Leuven, Dept Appl Econ Sci, Louvain, Belgium
[2] Automobile Insurers Bur Massachusetts, Boston, MA USA
关键词
D O I
10.1111/1539-6975.00023
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Several state-of-the-art binary classification techniques are experimentally evaluated in the context of expert automobile insurance claim fraud detection. The predictive power of logistic regression, C4.5 decision tree, k-nearest neighbor, Bayesian learning multilayer perceptron neural network, least-squares support vector machine, naive Bayes, and tree-augmented naive Bayes classification is contrasted. For most of these algorithm types, we report on several operationalizations using alternative hyperparameter or design choices. We compare these in terms of mean percentage correctly classified (PCC) and mean area under the receiver operating characteristic (AUROC) curve using a stratified, blocked, ten-fold cross-validation experiment. We also contrast algorithm type performance visually by means of the convex hull of the receiver operating characteristic (ROC) curves associated with the alternative operationalizations per algorithm type. The study is based on a data set of 1,399 personal injury protection claims from 1993 accidents collected by the Automobile Insurers Bureau of Massachusetts. To stay as close to real-life operating conditions as possible, we consider only predictors that are known relatively early in the life of a claim. Furthermore, based on the qualification of each available claim by both a verbal expert assessment of suspicion of fraud and a ten-point-scale expert suspicion score, we can compare classification for different target/class encoding schemes. Finally, we also investigate the added value of systematically collecting nonflag predictors for suspicion of fraud modeling purposes. From the observed results, we may state that: (1) independent of the target encoding scheme and the algorithm type, the inclusion of nonflag predictors allows us to significantly boost predictive performance; (2) for all the evaluated scenarios, the performance difference in terms of mean PCC and mean AUROC between many algorithm type operationalizations turns out to be rather small; visual comparison of the algorithm type ROC curve convex hulls also shows limited difference in performance over the range of operating conditions; (3) relatively simple and efficient techniques such as linear logistic regression and linear kernel least-squares support vector machine classification show excellent overall predictive capabilities, and (smoothed) naive Bayes also performs well; and (4) the C4.5 decision tree operationalization results are rather disappointing; none of the tree operationalizations are capable of attaining mean AUROC performance in line with the best. Visual inspection of the evaluated scenarios reveals that the C4.5 algorithm type ROC curve convex hull is often dominated in large part by most of the other algorithm type hulls.
引用
收藏
页码:373 / 421
页数:49
相关论文
共 50 条
  • [21] EXPERT SYSTEMS IN AUDITING - THE STATE-OF-THE-ART
    MESSIER, WF
    HANSEN, JV
    AUDITING-A JOURNAL OF PRACTICE & THEORY, 1987, 7 (01): : 94 - 105
  • [22] Cryptography and state-of-the-art techniques
    Ahmed, Mohiuddin
    Sazzad, T.M. Shahriar
    Mollah, Md. Elias
    International Journal of Computer Science Issues, 2012, 9 (2 2-3): : 583 - 586
  • [23] TOLERANCING TECHNIQUES - THE STATE-OF-THE-ART
    ZHANG, HC
    HUQ, ME
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1992, 30 (09) : 2111 - 2135
  • [24] An up-to-date comparison of state-of-the-art classification algorithms
    Zhang, Chongsheng
    Liu, Changchang
    Zhang, Xiangliang
    Almpanidis, George
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 82 : 128 - 150
  • [25] A comprehensive survey on state-of-the-art video forgery detection techniques
    Mohiuddin, Sk
    Malakar, Samir
    Kumar, Munish
    Sarkar, Ram
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 33499 - 33539
  • [26] Anomaly Detection in IoT : State-of-the-Art Techniques and Implementation Insights
    Ferhi, Wafaa
    Hadjila, Mourad
    Moussaoui, Djillali
    Bouidaine, Al Baraa
    PROGRAM OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND AUTOMATIC CONTROL, ICEEAC 2024, 2024,
  • [27] A Survey of the State-of-the-Art Techniques for Cognitive Impairment Detection in the Elderly
    Fei, Zixiang
    Yang, Erfu
    Li, David
    Butler, Stephen
    Ijomah, Winifred
    Mackin, Neil
    ADVANCED COMPUTATIONAL METHODS IN LIFE SYSTEM MODELING AND SIMULATION, LSMS 2017, PT I, 2017, 761 : 143 - 161
  • [28] Leakage detection techniques for oil and gas pipelines: State-of-the-art
    Lu, Hongfang
    Iseley, Tom
    Behbahani, Saleh
    Fu, Lingdi
    TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2020, 98
  • [29] A comprehensive survey on state-of-the-art video forgery detection techniques
    Sk Mohiuddin
    Samir Malakar
    Munish Kumar
    Ram Sarkar
    Multimedia Tools and Applications, 2023, 82 : 33499 - 33539
  • [30] State-of-the-art computer vision techniques for automated sugarcane lodging classification
    Modi, Rajesh U.
    Chandel, Abhilash K.
    Chandel, Narendra S.
    Dubey, Kumkum
    Subeesh, A.
    Singh, Akhilesh K.
    Jat, Dilip
    Kancheti, Mrunalini
    FIELD CROPS RESEARCH, 2023, 291