Supervised Machine Learning Model to Predict the Bank Loan Application Using Binary Classification, Decision Tree and Random Forest

被引:0
|
作者
Gnanasekar, A. [1 ]
Rani, P. Shobha [1 ]
Akash, S. [1 ]
Arjunan, S. [1 ]
Devananth, A. [1 ]
机构
[1] RMD Engn Coll, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
关键词
Anomaly detection; ReMEMBeR; multi-contextual behavior profiling; filtering; online banking; fraud detection; E-COMMERCE TRANSACTIONS; CARD FRAUD DETECTION; SYSTEM;
D O I
10.9756/INT-JECSE/V14I2.499
中图分类号
G76 [特殊教育];
学科分类号
040109 ;
摘要
This paper presents an online fraud detection system that uses anomaly detection to monitor an individual's behavior pattern and compare it with its usage history, which is a representation of the user's normal behavior patterns. Fraud is indicated by any significant deviation from normal behavior. The mechanism suffers from three disadvantages. Limited observations from the historical data, assorted nature of transaction data, and highly distorted information lead to unusually high positive failure rates of anomaly detection. Therefore, we propose a ranking metric embedding-based multi-contextual behavior profiling (ReMEMBeR) model to incorporate the detection mechanism effectively. We transform the original anomaly detection problem into a pseudo-recommender system problem and solve it using a ranking metric embedding-based method. With collaborative filtering, an individual could utilize information from similar individuals implicitly and automatically, which alleviates the individual's possible lack of historical data. By the ranking scheme, the model is trained to maximize the ability to distinguish between legitimate and fraudulent transactions. This helps to make full use of label information and, thus, solves the data skewness problem to the utmost extent. The proposed model integrates multi-contextual behaviour patterns, from purely local to more global ones. Evaluating transactions against multi- contextual behaviour patterns could reduce the error rate and, hence, could bring down the false positive rate. By creating a contrast vector for each transaction based on the customer's past behavior sequence, we profile the differentiation rate of each current transaction against the customer's behavior preference.
引用
收藏
页码:4510 / 4518
页数:9
相关论文
共 50 条
  • [31] Multiclass cancer classification by using fuzzy support vector machine and binary decision tree with gene selection
    Mao, Y
    Zhou, XB
    Pi, DY
    Sun, YX
    Wong, STC
    JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2005, (02): : 160 - 171
  • [32] EMLARDE tree: ensemble machine learning based random de-correlated extra decision tree for the forest cover type prediction
    Guhan, T.
    Revathy, N.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (12) : 8525 - 8536
  • [33] Employees reviews classification and evaluation (ERCE) model using supervised machine learning approaches
    Muhammad Saqlain Rehan
    Furqan Rustam
    Saleem Ullah
    Safdar Hussain
    Arif Mehmood
    Gyu Sang Choi
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 3119 - 3136
  • [34] Employees reviews classification and evaluation (ERCE) model using supervised machine learning approaches
    Rehan, Muhammad Saqlain
    Rustam, Furqan
    Ullah, Saleem
    Hussain, Safdar
    Mehmood, Arif
    Choi, Gyu Sang
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (6) : 3119 - 3136
  • [35] Enhancing skin lesion Classification: A machine learning approach using KNN, XGBoost, and Random Forest
    Hussain, S. K. Rhaber
    Powar, Omkar S.
    2024 CONTROL INSTRUMENTATION SYSTEM CONFERENCE, CISCON 2024, 2024,
  • [36] National classification of surface-groundwater interaction using random forest machine learning technique
    Yang, Jing
    Griffiths, James
    Zammit, Christian
    RIVER RESEARCH AND APPLICATIONS, 2019, 35 (07) : 932 - 943
  • [37] Prediction of Heart Disease using Forest Algorithm over Decision Tree using Machine Learning with Improved Accuracy
    Raj, K. N. S. Shanmukha
    Thinakaran, K.
    CARDIOMETRY, 2022, (25): : 1520 - 1525
  • [38] Uncovering Los Angeles Tourists' Patterns Using Geospatial Analysis and Supervised Machine Learning with Random Forest Predictors
    Lee, Yuan-Yuan
    Chang, Yi Ling
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 1275 - 1280
  • [39] Lessons Learnt From Using the Machine Learning Random Forest Algorithm to Predict Virulence in Streptococcus pyogenes
    Buckley, Sean J.
    Harvey, Robert J.
    FRONTIERS IN CELLULAR AND INFECTION MICROBIOLOGY, 2021, 11
  • [40] PREDICTING THE OUTCOMES OF IN VITRO FERTILIZATION PROGRAMS USING A RANDOM FOREST MACHINE LEARNING MODEL
    Vladimirsky, G. M.
    Zhuravleva, M. A.
    Dashieva, A. E.
    Korneeva, I. E.
    Nazarenko, T. A.
    BULLETIN OF RUSSIAN STATE MEDICAL UNIVERSITY, 2023, (06): : 64 - 70