Guided Self-Training based Semi-Supervised Learning for Fraud Detection

被引:3
|
作者
Kumar, Awanish [1 ]
Ghosh, Soumyadeep [1 ]
Verma, Janu [1 ]
机构
[1] Mastercard, AI Garage, Gurgaon, India
关键词
adversarial attack; vulnerability detection; vulnerability mitigation; transaction level vulnerability; black box vulnerability detection;
D O I
10.1145/3533271.3561783
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Semi supervised learning has attracted attention of AI researchers in the recent past, especially after the advent of deep learning methods and their success in several real world applications. Most deep learning models require large amounts of labelled data, which is expensive to obtain. Fraud detection is a very important problem for several industries and large amount of data is often available. However, obtaining labelled data is cumbersome and hence semi-supervised learning is perfectly positioned to aid us in building robust and accurate supervised models. In this work, we consider different kinds of fraud detection paradigms and show that a self-training based semi-supervised learning approach can produce significant improvements over a model that has been training on a limited set of labelled data. We propose a novel self-training approach by using a guided sharpening technique using a pair of autoencoders which provide useful cues for incorporating unlabelled data in the training process. We conduct thorough experiments on three different real world databases and analysis to showcase the effectiveness of the approach. On the elliptic bitcoin fraud dataset, we show that utilizing unlabelled data improves the F-1 score of the model trained on limited labelled data by around 10%.
引用
收藏
页码:148 / 155
页数:8
相关论文
共 50 条
  • [41] Improving semi-supervised self-training with embedded manifold transduction
    Tao, Ye
    Zhang, Duzhou
    Cheng, Shengjun
    Tang, Xianglong
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2018, 40 (02) : 363 - 374
  • [42] An Auto-Adjustable Semi-Supervised Self-Training Algorithm
    Livieris, Ioannis E.
    Kanavos, Andreas
    Tampakas, Vassilis
    Pintelas, Panagiotis
    ALGORITHMS, 2018, 11 (09):
  • [43] A semi-supervised self-training method based on density peaks and natural neighbors
    Suwen Zhao
    Junnan Li
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 2939 - 2953
  • [44] A semi-supervised self-training method based on density peaks and natural neighbors
    Zhao, Suwen
    Li, Junnan
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 2939 - 2953
  • [45] Semi-supervised ASR by End-to-end Self-training
    Chen, Yang
    Wang, Weiran
    Wang, Chao
    INTERSPEECH 2020, 2020, : 2787 - 2791
  • [46] The GIST and RIST of Iterative Self-Training for Semi-Supervised Segmentation
    Teh, Eu Wern
    DeVries, Terrance
    Duke, Brendan
    Jiang, Ruowei
    Aarabi, Parham
    Taylor, Graham W.
    2022 19TH CONFERENCE ON ROBOTS AND VISION (CRV 2022), 2022, : 58 - 66
  • [47] SEMI-SUPERVISED SINGING VOICE SEPARATION WITH NOISY SELF-TRAINING
    Wang, Zhepei
    Giri, Ritwik
    Isik, Umut
    Valin, Jean-Marc
    Krishnaswamy, Arvindh
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 31 - 35
  • [48] Semi-supervised Sentiment Classification with Self-training on Feature Subspaces
    Gao, Wei
    Li, Shoushan
    Xue, Yunxia
    Wang, Meng
    Zhou, Guodong
    CHINESE LEXICAL SEMANTICS, 2014, 8922 : 231 - 239
  • [49] Semi-Supervised Self-Training Method Based on an Optimum-Path Forest
    Li, Junnan
    Zhu, Qingsheng
    IEEE ACCESS, 2019, 7 : 36388 - 36399
  • [50] A novel self-training semi-supervised deep learning approach for machinery fault diagnosis
    Long, Jianyu
    Chen, Yibin
    Yang, Zhe
    Huang, Yunwei
    Li, Chuan
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2023, 61 (23) : 8238 - 8251