Supervised Machine Learning Model to Predict the Bank Loan Application Using Binary Classification, Decision Tree and Random Forest

被引:0
|
作者
Gnanasekar, A. [1 ]
Rani, P. Shobha [1 ]
Akash, S. [1 ]
Arjunan, S. [1 ]
Devananth, A. [1 ]
机构
[1] RMD Engn Coll, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
关键词
Anomaly detection; ReMEMBeR; multi-contextual behavior profiling; filtering; online banking; fraud detection; E-COMMERCE TRANSACTIONS; CARD FRAUD DETECTION; SYSTEM;
D O I
10.9756/INT-JECSE/V14I2.499
中图分类号
G76 [特殊教育];
学科分类号
040109 ;
摘要
This paper presents an online fraud detection system that uses anomaly detection to monitor an individual's behavior pattern and compare it with its usage history, which is a representation of the user's normal behavior patterns. Fraud is indicated by any significant deviation from normal behavior. The mechanism suffers from three disadvantages. Limited observations from the historical data, assorted nature of transaction data, and highly distorted information lead to unusually high positive failure rates of anomaly detection. Therefore, we propose a ranking metric embedding-based multi-contextual behavior profiling (ReMEMBeR) model to incorporate the detection mechanism effectively. We transform the original anomaly detection problem into a pseudo-recommender system problem and solve it using a ranking metric embedding-based method. With collaborative filtering, an individual could utilize information from similar individuals implicitly and automatically, which alleviates the individual's possible lack of historical data. By the ranking scheme, the model is trained to maximize the ability to distinguish between legitimate and fraudulent transactions. This helps to make full use of label information and, thus, solves the data skewness problem to the utmost extent. The proposed model integrates multi-contextual behaviour patterns, from purely local to more global ones. Evaluating transactions against multi- contextual behaviour patterns could reduce the error rate and, hence, could bring down the false positive rate. By creating a contrast vector for each transaction based on the customer's past behavior sequence, we profile the differentiation rate of each current transaction against the customer's behavior preference.
引用
收藏
页码:4510 / 4518
页数:9
相关论文
共 50 条
  • [21] Machine learning for classification of cutaneous sebaceous neoplasms: implementing decision tree model using cytological and architectural features
    Kamyab-Hesari, Kambiz
    Azhari, Vahidehsadat
    Ahmadzade, Ali
    Amoli, Fahimeh Asadi
    Najafi, Anahita
    Hasanzadeh, Alireza
    Beikmarzehei, Alireza
    DIAGNOSTIC PATHOLOGY, 2023, 18 (01)
  • [22] Machine learning for classification of cutaneous sebaceous neoplasms: implementing decision tree model using cytological and architectural features
    Kambiz Kamyab-Hesari
    Vahidehsadat Azhari
    Ali Ahmadzade
    Fahimeh Asadi Amoli
    Anahita Najafi
    Alireza Hasanzadeh
    Alireza Beikmarzehei
    Diagnostic Pathology, 18
  • [23] GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran
    Naghibi, Seyed Amir
    Pourghasemi, Hamid Reza
    Dixon, Barnali
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2016, 188 (01) : 1 - 27
  • [24] GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran
    Seyed Amir Naghibi
    Hamid Reza Pourghasemi
    Barnali Dixon
    Environmental Monitoring and Assessment, 2016, 188
  • [25] Using Decision Tree Classification Model to Predict Payment Type in NYC Yellow Taxi
    Ismaeil, Hadeer
    Kholeif, Sherif
    Abdel-Fattah, Manal A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (03) : 238 - 244
  • [26] Using Decision Tree Classification Model to Predict Payment Type in NYC Yellow Taxi
    Ismaeil, Hadeer
    Kholeif, Sherif
    Abdel-Fattah, Manal A.
    International Journal of Advanced Computer Science and Applications, 2022, 13 (03): : 238 - 244
  • [27] Harnessing Machine Learning in Vocal Arts Medicine: A Random Forest Application for "Fach" Classification in Opera
    Wang, Zehui
    Mueller, Matthias
    Caffier, Felix
    Caffier, Philipp P.
    DIAGNOSTICS, 2023, 13 (18)
  • [28] Empirical Analysis of Application-Level Traffic Classification Using Supervised Machine Learning
    Park, Byungchul
    Won, Young J.
    Choi, Mi-Jung
    Kim, Myung-Sup
    Hong, James W.
    CHALLENGES FOR NEXT GENERATION NETWORK OPERATIONS AND SERVICE MANAGEMENT, PROCEEDINGS, 2008, 5297 : 474 - +
  • [29] Classification of Friction and Wear State of Wind Turbine Gearboxes Using Decision Tree and Random Forest Algorithms
    Pang, Jinshan
    Chen, Yuming
    He, Shizhong
    Qiu, Huihe
    Wu, Chili
    Mao, Lingbo
    JOURNAL OF TRIBOLOGY-TRANSACTIONS OF THE ASME, 2021, 143 (09):
  • [30] Using Decision Tree Classification Algorithm to Predict Learner Typologies for Project-Based Learning
    Gyimah, Esther
    Dake, Delali Kwasi
    2019 INTERNATIONAL CONFERENCE ON COMPUTING, COMPUTATIONAL MODELLING AND APPLICATIONS (ICCMA), 2019, : 130 - 134