Use of optimized Fuzzy C-Means clustering and supervised classifiers for automobile insurance fraud detection

被引:49
|
作者
Subudhi, Sharmila [1 ]
Panigrahi, Suvasini [1 ]
机构
[1] Veer Surendra Sai Univ Technol, Dept Comp Sci & Engn & IT, Burla 768018, Odisha, India
关键词
Fraud detection; Insurance claims; Genetic Algorithm; Fuzzy C-Means clustering; Supervised classifiers;
D O I
10.1016/j.jksuci.2017.09.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel hybrid approach for detecting frauds in automobile insurance claims by applying Genetic Algorithm (GA) based Fuzzy C-Means (FCM) clustering and various supervised classifier models. Initially, a test set is extracted from the original insurance dataset. The remaining train set is subjected to the clustering technique for undersampling after generating some meaningful clusters. The test instances are then segregated into genuine, malicious or suspicious classes after subjecting to the clusters. The genuine and fraudulent records are discarded, while the suspicious cases are further analyzed by four classifiers - Decision Tree (DT), Support Vector Machine (SVM), Group Method of Data Handling (GMDH) and Multi-Layer Perceptron (MLP) individually. The 10-fold cross validation method is used throughout the work for training and validation of the models. The efficacy of the proposed system is illustrated by conducting several experiments on a real world automobile insurance dataset. (C) 2017 The Authors. Production and hosting by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:568 / 575
页数:8
相关论文
共 50 条
  • [1] Two-stage automobile insurance fraud detection by using optimized fuzzy C-means clustering and supervised learning
    Subudhi S.
    Panigrahi S.
    International Journal of Information Security and Privacy, 2020, 14 (03): : 18 - 37
  • [2] Automobile Insurance Fraud Detection using Supervised Classifiers
    Prasasti, Iffa Maula Nur
    Dhini, Arian
    Laoh, Enrico
    2020 5TH INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS 2020), 2020, : 49 - 53
  • [3] Use of Possibilistic Fuzzy C-means Clustering for Telecom Fraud Detection
    Subudhi, Sharmila
    Panigrahi, Suvasini
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, CIDM 2016, 2017, 556 : 633 - 641
  • [4] Online Classifiers Based on Fuzzy C-means Clustering
    Jedrzejowicz, Joanna
    Jedrzejowicz, Piotr
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2013, 8083 : 427 - 436
  • [5] On Semi-Supervised Fuzzy c-Means Clustering
    Yasunori, Endo
    Yukihiro, Hamasuna
    Makito, Yamashiro
    Sadaaki, Miyamoto
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 1119 - +
  • [6] Optimized Fuzzy C-means Clustering Methods for Defect Detection on Leather Surface
    Mohammed, Khwaja Muinuddin Chisti
    Kumar, S. Srinivas
    Prasad, Gandikota
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2020, 79 (09): : 833 - 836
  • [7] Safe Semi-Supervised Fuzzy C-Means Clustering
    Gan, Haitao
    IEEE ACCESS, 2019, 7 : 95659 - 95664
  • [8] Fuzzy C-Means Text Clustering with Supervised Feature Selection
    Wang, Wei
    Wang, Chunheng
    Cui, Xia
    Wang, Ai
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2008, : 57 - 61
  • [9] Structural Damage Detection Based on Semi-supervised Fuzzy C-means Clustering
    Liu, Zhen
    Zhou, Qifeng
    Chi, Qijun
    Zhang, Yuanyuan
    Chen, Youling
    Qi, Sen
    2014 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2014), 2014, : 551 - 556
  • [10] Semi-supervised outlier detection based on fuzzy rough C-means clustering
    Xue, Zhenxia
    Shang, Youlin
    Feng, Aifen
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2010, 80 (09) : 1911 - 1921