Fraud risk assessment in car insurance using claims graph features in machine learning

被引:1
|
作者
Vorobyev, Ivan [1 ]
机构
[1] HSE Univ, Moscow, Russia
关键词
Fraud detection; Insurance claims; Machine learning; Graph features; Risk assessment;
D O I
10.1016/j.eswa.2024.124109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The article proposes a process for claims assessment in car insurance, which makes it possible to calculate the fraud rate on the annual set of claims using a reduced set of attributes and graph vertex properties. This approach improves the security of insurance companies ' assets against fraudulent attacks. A method for constructing a claims graph and extracting additional features from it for evaluation is described. It is shown that in order to build a graph, it is not necessary to have data on the connection of the claim participants. Two tests were carried out on a real opensource datasets with labelling of fraudulent cases. The results of the first one show the increase in classification metrics when using attributes obtained from the graph. The application of the proposed approach resulted in doubling the area under the Precision-Recall curve. The experimental results demonstrated high quality metrics for fraud detection, with a Recall rate of 83.33% and a Specificity rate of 91.05%. The second test confirmed the possibility of determining the insurance fraud level based on decision rule, which includes the condition of claims being connected to each other. The rule is able to detect claim groups with a high concentration of fraud, in which every second participant is a fraudster.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Bagging Vs. Boosting in Ensemble Machine Learning? An Integrated Application to Fraud Risk Analysis in the Insurance Sector
    Ming, Ruixing
    Mohamad, Osama
    Innab, Nisreen
    Hanafy, Mohamed
    APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [42] Extracting topological features to identify at-risk students using machine learning and graph convolutional network models
    Albreiki, Balqis
    Habuza, Tetiana
    Zaki, Nazar
    INTERNATIONAL JOURNAL OF EDUCATIONAL TECHNOLOGY IN HIGHER EDUCATION, 2023, 20 (01)
  • [43] Extracting topological features to identify at-risk students using machine learning and graph convolutional network models
    Balqis Albreiki
    Tetiana Habuza
    Nazar Zaki
    International Journal of Educational Technology in Higher Education, 20
  • [44] A robust and interpretable ensemble machine learning model for predicting healthcare insurance fraud
    Wang, Zeyu
    Chen, Xiaofang
    Wu, Yiwei
    Jiang, Linke
    Lin, Shiming
    Qiu, Gang
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [45] How machine learning is transforming the insurance sector: case of fraud detection in Morocco
    Hamdoun, Nabila
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2021, 6 (04) : 273 - 282
  • [46] Application of Machine Learning Methods to Risk Assessment of Financial Statement Fraud: Evidence from China
    Song, Xin-Ping
    Hu, Zhi-Hua
    Du, Jian-Guo
    Sheng, Zhao-Han
    JOURNAL OF FORECASTING, 2014, 33 (08) : 611 - 626
  • [47] Fraud Detection and Frequent Pattern Matching in Insurance claims using Data Mining Techniques
    Verma, Aayushi
    Taneja, Anu
    Arora, Anuja
    2017 TENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2017, : 84 - 90
  • [48] Performance comparative study of machine learning algorithms for automobile insurance fraud detection
    Itri, Bouzgarne
    Mohamed, Youssfi
    Mohammed, Qbadou
    Omar, Bouattane
    2019 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS 2019), 2019,
  • [49] AIRA-ML: Auto Insurance Risk Assessment- Machine Learning Model using Resampling Methods
    Elbhrawy, Ahmed Shawky
    Belal, Mohamed A.
    Hassanein, Mohamed Sameh
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 633 - 641
  • [50] Fake News Identification using Machine Learning Algorithms Based on Graph Features
    Tian, Yuxuan
    arXiv, 2022,