Supervised Machine Learning Model to Predict the Bank Loan Application Using Binary Classification, Decision Tree and Random Forest

被引:0
|
作者
Gnanasekar, A. [1 ]
Rani, P. Shobha [1 ]
Akash, S. [1 ]
Arjunan, S. [1 ]
Devananth, A. [1 ]
机构
[1] RMD Engn Coll, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
关键词
Anomaly detection; ReMEMBeR; multi-contextual behavior profiling; filtering; online banking; fraud detection; E-COMMERCE TRANSACTIONS; CARD FRAUD DETECTION; SYSTEM;
D O I
10.9756/INT-JECSE/V14I2.499
中图分类号
G76 [特殊教育];
学科分类号
040109 ;
摘要
This paper presents an online fraud detection system that uses anomaly detection to monitor an individual's behavior pattern and compare it with its usage history, which is a representation of the user's normal behavior patterns. Fraud is indicated by any significant deviation from normal behavior. The mechanism suffers from three disadvantages. Limited observations from the historical data, assorted nature of transaction data, and highly distorted information lead to unusually high positive failure rates of anomaly detection. Therefore, we propose a ranking metric embedding-based multi-contextual behavior profiling (ReMEMBeR) model to incorporate the detection mechanism effectively. We transform the original anomaly detection problem into a pseudo-recommender system problem and solve it using a ranking metric embedding-based method. With collaborative filtering, an individual could utilize information from similar individuals implicitly and automatically, which alleviates the individual's possible lack of historical data. By the ranking scheme, the model is trained to maximize the ability to distinguish between legitimate and fraudulent transactions. This helps to make full use of label information and, thus, solves the data skewness problem to the utmost extent. The proposed model integrates multi-contextual behaviour patterns, from purely local to more global ones. Evaluating transactions against multi- contextual behaviour patterns could reduce the error rate and, hence, could bring down the false positive rate. By creating a contrast vector for each transaction based on the customer's past behavior sequence, we profile the differentiation rate of each current transaction against the customer's behavior preference.
引用
收藏
页码:4510 / 4518
页数:9
相关论文
共 50 条
  • [41] Interpretable machine learning with tree-based shapley additive explanations: Application to metabolomics datasets for binary classification
    Bifarin, Olatomiwa O.
    PLOS ONE, 2023, 18 (05):
  • [42] Using Decision Tree Aggregation with Random Forest Model to Identify Gut Microbes Associated with Colorectal Cancer
    Ai, Dongmei
    Pan, Hongfei
    Han, Rongbao
    Li, Xiaoxin
    Liu, Gang
    Xia, Li C.
    GENES, 2019, 10 (02):
  • [43] Sentiment Analysis and Fake Amazon Reviews Classification Using SVM Supervised Machine Learning Model
    Tabany, Myasar
    Gueffal, Meriem
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (01) : 49 - 58
  • [44] Implementation of Decision Tree Using C4.5 Algorithm in Decision Making of Loan Application by Debtor (Case Study: Bank Pasar of Yogyakarta Special Region)
    Amin, Rafik Khairul
    Indwiarti
    Sibaroni, Yuliant
    2015 3rd International Conference on Information and Communication Technology (ICoICT), 2015, : 75 - 80
  • [45] A Hybrid Model for Prediction of Diabetes Using Machine Learning Classification Algorithms and Random Projection
    Poornima, V.
    RamyaDevi, R.
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 139 (03) : 1437 - 1449
  • [46] The VMC Survey - LI. Classifying extragalactic sources using a probabilistic random forest supervised machine learning algorithm
    Pennock, Clara M.
    van Loon, Jacco Th.
    Cioni, Maria-Rosa L.
    Maitra, Chandreyee
    Oliveira, Joana M.
    Craig, Jessica E. M.
    Ivanov, Valentin D.
    Aird, James
    Anih, Joy O.
    Cross, Nicholas J. G.
    Dresbach, Francesca
    de Grijs, Richard
    Groenewegen, Martin A. T.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2025, 537 (02) : 1028 - 1055
  • [47] E- ANTENATAL ASSISTANCE CARE USING DECISION TREE ANALYTICS AND CLUSTER ANALYTICS BASED SUPERVISED MACHINE LEARNING
    Saranya, G.
    Geetha, G.
    Safa, M.
    2017 IEEE INTERNATIONAL CONFERENCE ON IOT AND ITS APPLICATIONS (IEEE ICIOT), 2017,
  • [48] Machine Learning Approach to Predict Physical Properties of Polypropylene Composites: Application of MLR, DNN, and Random Forest to Industrial Data
    Joo, Chonghyo
    Park, Hyundo
    Kwon, Hyukwon
    Lim, Jongkoo
    Shin, Eunchul
    Cho, Hyungtae
    Kim, Junghwan
    POLYMERS, 2022, 14 (17)
  • [49] DIRECT ESTIMATION OF ECOSYSTEM WATER USE EFFICIENCY USING THE RANDOM FOREST MACHINE LEARNING MODEL
    Sun, Yifei
    Huang, Lingxiao
    Wang, Junrui
    Liu, Meng
    Di, Suchuang
    Yang, Simin
    Zhang, Hang
    Zhang, Cen
    Tang, Ronglin
    2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2024), 2024, : 10550 - 10553
  • [50] Detecting industrial discharges at an advanced water reuse facility using online instrumentation and supervised machine learning binary classification
    Thompson, Kyle A. A.
    Branch, Amos
    Nading, Tyler
    Dziura, Thomas
    Salazar-Benites, Germano
    Wilson, Chris
    Bott, Charles
    Salveson, Andrew
    Dickenson, Eric R. V.
    FRONTIERS IN WATER, 2022, 4