AutoEIS: Automatic feature embedding, interaction and selection on default prediction

被引:2
|
作者
Xiao, Kai [1 ,2 ,3 ]
Jiang, Xiaohan [1 ,2 ,3 ]
Hou, Peng [1 ,2 ,3 ]
Zhu, Hongbin [3 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200438, Peoples R China
[2] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai 200438, Peoples R China
[3] Fudan Univ, FinTech Insitute, Shanghai 200438, Peoples R China
关键词
Data mining; Default prediction; Feature embedding; Feature interaction; RISK-ASSESSMENT; CREDIT;
D O I
10.1016/j.ipm.2023.103526
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep models have shown the effectiveness in various areas, e.g., finance, healthcare and recommendation system. Among them, default prediction is a major application in the financial field. However, there are still problems remained, like insufficient learning for numerical feature encoding and difficulty of explicitly modelling high-order feature interactions. To address these issues, we propose an automatic end-to-end deep learning framework named AutoEIS. In this framework, we embed categorical and numerical features with distinct strategies. In particular, we design a Multi-field-aware Mixture-of-Experts (MfMoE) structure for numerical value embedding, which can simultaneously learn the single-field and global-field information. Then, by organically integrating attention mechanism, weighted-average aggregation and bilinear interaction, we can effectively generate significant high-order explicit interactions. Moreover, we integrate a DNN block to further capture the complex relationships among different variables. Comprehensive experiments on two real-world datasets of about 30,000 samples show the superiority of AutoEIS on default prediction, boosting average AUC and KS metrics over the best classical baseline by 0.49% and 3.5%, and the best of deep baselines by 0.48% and 2.55%. Furthermore, as a model-agnostic strategy, we can generalize MfMoE to other deep models like DeepFM, thereby boosting their performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Performance Evaluation of Feature Selection Techniques for Credit Default Prediction
    Ayogu, Ikechukwu Ignatius
    Popoola, Olugbemiga Solomon
    Mebawondu, Olamatanmi Josephine
    Ugwu, Chukwuemeka Christian
    Adetunmbi, Adebayo Olusola
    2022 IEEE NIGERIA 4TH INTERNATIONAL CONFERENCE ON DISRUPTIVE TECHNOLOGIES FOR SUSTAINABLE DEVELOPMENT (IEEE NIGERCON), 2022, : 205 - 209
  • [2] Bayesian Automatic Relevance Determination for Feature Selection in Credit Default Modelling
    Mbuvha, Rendani
    Boulkaibet, Illyes
    Marwala, Tshilidzi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 420 - 425
  • [3] Community-Based Feature Selection for Credit Card Default Prediction
    Wang, Qiucheng
    Hu, Yanmei
    Li, Jun
    COMPLEX NETWORKS & THEIR APPLICATIONS VI, 2018, 689 : 153 - 165
  • [4] A Feature Selection Based on Network Structure for Credit Card Default Prediction
    Hu, Yanmei
    Ren, Yuchun
    Wang, Qiucheng
    COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2019, 2019, 1042 : 275 - 286
  • [5] AutoFIS: Automatic Feature Interaction Selection in Factorization Models for Click-Through Rate Prediction
    Liu, Bin
    Zhu, Chenxu
    Li, Guilin
    Zhang, Weinan
    Lai, Jincai
    Tang, Ruiming
    He, Xiuqiang
    Li, Zhenguo
    Yu, Yong
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 2636 - 2645
  • [6] Feature selection by Universum embedding
    Li, Chun-Na
    Huang, Ling-Wei
    Shao, Yuan-Hai
    Guo, Tingting
    Mao, Yu
    PATTERN RECOGNITION, 2024, 153
  • [7] Improved Automatic Feature Selection Approach for Health Risk Prediction
    Gajare, Shreyal
    Sonawani, Shilpa
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2018), 2018, : 816 - 819
  • [8] Automatic Feature Selection for Desertion and Graduation Prediction: A Chilean Case
    Peralta, B.
    Poblete, T.
    Caro, L.
    PROCEEDINGS OF THE 2016 35TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016,
  • [9] Automatic Feature Selection in the SOPFs Dissolution Profiles Prediction Problem
    Salazar-Jimenez, J. E.
    Sanchez-Carvajal, J. D.
    Quiros-Gomez, B.
    Arias-Londono, J. D.
    PROCEEDINGS OF THE 10TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2017, : 52 - 58
  • [10] Automatic Feature Selection by Regularization to Improve Bug Prediction Accuracy
    Osman, Haidar
    Ghafari, Mohammad
    Nierstrasz, Oscar
    2017 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING TECHNIQUES FOR SOFTWARE QUALITY EVALUATION (MALTESQUE), 2017, : 27 - 32