Modeling Insurance Fraud Detection Using Imbalanced Data Classification

被引:38
|
作者
Hassan, Amira Kamil Ibrahim [1 ,2 ]
Abraham, Ajith [1 ,3 ]
机构
[1] Sudan Univ Sci & Technol, Dept Comp Sci, Khartoum, Sudan
[2] MIR Labs, Auburn, WA USA
[3] VSB Tech Univ Ostrava, IT4Innovat, Ostrava, Czech Republic
关键词
Insurance fraud detection; Imbalanced data; Decision tree; Support vector machine and artificial neural network; AUTOMOBILE INSURANCE; CLAIMS;
D O I
10.1007/978-3-319-27400-3_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an innovative insurance fraud detection method to deal with the imbalanced data distribution. The idea is based on building insurance fraud detection models using Decision tree (DT), Support vector machine (SVM) and Artificial Neural Network (ANN), on data partitions derived from under-sampling (with-replacement and without-replacement) of the majority class and merging it with the minority class. Throughout the paper, ten-fold cross validation method of testing is used. Its originality lies in the use of several partitioning under-sampling approaches and choosing the best. Results from a publicly available automobile insurance fraud detection data set demonstrate that DT performs slightly better than other algorithms, so DT model was used to compare between different partitioning-under-sampling approaches. Empirical results illustrate that the proposed model gave better results.
引用
收藏
页码:117 / 127
页数:11
相关论文
共 50 条
  • [41] A Model for the Detection of Insurance Fraud
    Belhadji E.B.
    Dionne G.
    Tarkhani F.
    The Geneva Papers on Risk and Insurance - Issues and Practice, 2000, 25 (4) : 517 - 538
  • [42] An efficient fraud detection framework with credit card imbalanced data in financial services
    Aya Abd El-Naby
    Ezz El-Din Hemdan
    Ayman El-Sayed
    Multimedia Tools and Applications, 2023, 82 : 4139 - 4160
  • [43] Synthesizing class labels for highly imbalanced credit card fraud detection data
    Robert K. L. Kennedy
    Flavio Villanustre
    Taghi M. Khoshgoftaar
    Zahra Salekshahrezaee
    Journal of Big Data, 11
  • [44] Facial Fraud Discrimination Using Detection and Classification
    Choi, Inho
    Kim, Daijin
    ADVANCES IN VISUAL COMPUTING, PT III, 2010, 6455 : 199 - 208
  • [45] Stacked generalizations in imbalanced fraud data sets using resampling methods
    Kerwin, Kathleen R.
    Bastian, Nathaniel D.
    JOURNAL OF DEFENSE MODELING AND SIMULATION-APPLICATIONS METHODOLOGY TECHNOLOGY-JDMS, 2021, 18 (03): : 175 - 192
  • [46] Prediction of Insurance Fraud Detection using Machine Learning Algorithms
    Rukhsar, Laiqa
    Bangyal, Waqas Haider
    Nisar, Kashif
    Nisar, Sana
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2022, 41 (01) : 33 - 40
  • [47] Vehicle Insurance Fraud Detection Based on Hybrid Approach for Data Augmentation
    Rubaidi, Zainab Saad
    Ammar, Boulbaba Ben
    Aouicha, Mohamed Ben
    JOURNAL OF INFORMATION ASSURANCE AND SECURITY, 2023, 18 (05): : 135 - 146
  • [48] Fraud Detection in Healthcare Insurance Claims Using Machine Learning
    Nabrawi, Eman
    Alanazi, Abdullah
    RISKS, 2023, 11 (09)
  • [49] Binary classification for imbalanced data using data conformity mechanism
    Zheng, Jian
    Ren, Shumiao
    Zhang, Jingyue
    Wang, Shiyan
    Li, Lin
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [50] Imbalanced Data Stream Classification Using Hybrid Data Preprocessing
    Bobowska, Barbara
    Klikowski, Jakub
    Wozniak, Michal
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 402 - 413