Use of Data Mining Techniques for Data Balancing and Fraud Detection in Automobile Insurance Claims

被引:1
|
作者
Padhi, Slokashree [1 ]
Panigrahi, Suvasini [1 ]
机构
[1] Veer Surendra Sai Univ Technol, Dept CSE, Sambalpur 768018, Odisha, India
关键词
Automobile insurance; Outliers; Box and whisker plot; Synthetic minority oversampling; Supervised classifier;
D O I
10.1007/978-981-15-1084-7_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel hybrid data balancing method based on both undersampling and oversampling with ensemble technique has been presented in this paper for efficiently detecting the auto insurance frauds. Initially, the skewness from the original imbalance dataset is removed by excluding outliers from the majority class samples using Box and Whisker plot and synthetic samples are generated from the minority class samples by using synthetic minority oversampling (SMOTE) technique. We employed three supervised classifiers, namely, support vector machine, multilayer perceptron, and K-nearest neighbors for classification purpose. The final classification results are obtained by aggregating the results obtained from these classifiers using the majority voting ensemble technique. Our model has been experimentally evaluated with a real-world automobile insurance dataset.
引用
收藏
页码:221 / 230
页数:10
相关论文
共 50 条
  • [31] A Comparison of Data Balancing Techniques for Credit Card Fraud Detection using Neural Network
    Uttam, Atul Kumar
    Sharma, Gaurav
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 1136 - 1140
  • [32] Financial fraud: Data mining application and detection
    Aziz, N. H. A.
    Zakaria, N. B.
    Mohamed, I. S.
    RECENT TRENDS IN SOCIAL AND BEHAVIOUR SCIENCES, 2014, : 341 - 344
  • [33] Automobile Insurance Fraud Detection using the Evidential Reasoning Approach and Data-Driven Inferential Modelling
    Liu, Xi
    Yang, Jian-Bo
    Xu, Dong-Ling
    Derrick, Karim
    Stubbs, Chris
    Stockdale, Martin
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [34] Signal detection of rosuvastatin by a novel data mining approach in health insurance claims database
    Choi, Nam-Kyong
    Chang, Yoosoo
    Hahn, Seokyung
    Park, Byung-Joo
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2008, 17 : S17 - S18
  • [35] Data Sharing for Fraud Detection in Insurance: Challenges and Possibilities
    Soilen-Knutsen, Carl Christophe Louis
    Tessem, Bjornar
    ICEIS: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2022, : 93 - 99
  • [36] Data misrepresentation detection for insurance underwriting fraud prevention
    Vandervorst, Felix
    Verbeke, Wouter
    Verdonck, Tim
    DECISION SUPPORT SYSTEMS, 2022, 159
  • [37] Review of Loan Fraud Detection Process in the Banking Sector Using Data Mining Techniques
    Esmail, Fahd Sabry
    Alsheref, Fahad Kamal
    Aboutabl, Amal Elsayed
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (02) : 229 - 239
  • [38] Study of Key Technologies of Insurance Claims Based on Data Mining
    Su Wei
    PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING, MANUFACTURING TECHNOLOGY AND CONTROL, 2016, 67 : 1896 - 1900
  • [39] Automobile Insurance Fraud Detection using Supervised Classifiers
    Prasasti, Iffa Maula Nur
    Dhini, Arian
    Laoh, Enrico
    2020 5TH INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS 2020), 2020, : 49 - 53
  • [40] Simulation and Detection of Healthcare Fraud in German Inpatient Claims Data
    Schrupp, Bernhard
    Klede, Kai
    Raab, Rene
    Eskofier, Bjoern
    COMPUTATIONAL SCIENCE, ICCS 2024, PT IV, 2024, 14835 : 239 - 246