AutoEIS: Automatic feature embedding, interaction and selection on default prediction

被引:2
|
作者
Xiao, Kai [1 ,2 ,3 ]
Jiang, Xiaohan [1 ,2 ,3 ]
Hou, Peng [1 ,2 ,3 ]
Zhu, Hongbin [3 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200438, Peoples R China
[2] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai 200438, Peoples R China
[3] Fudan Univ, FinTech Insitute, Shanghai 200438, Peoples R China
关键词
Data mining; Default prediction; Feature embedding; Feature interaction; RISK-ASSESSMENT; CREDIT;
D O I
10.1016/j.ipm.2023.103526
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep models have shown the effectiveness in various areas, e.g., finance, healthcare and recommendation system. Among them, default prediction is a major application in the financial field. However, there are still problems remained, like insufficient learning for numerical feature encoding and difficulty of explicitly modelling high-order feature interactions. To address these issues, we propose an automatic end-to-end deep learning framework named AutoEIS. In this framework, we embed categorical and numerical features with distinct strategies. In particular, we design a Multi-field-aware Mixture-of-Experts (MfMoE) structure for numerical value embedding, which can simultaneously learn the single-field and global-field information. Then, by organically integrating attention mechanism, weighted-average aggregation and bilinear interaction, we can effectively generate significant high-order explicit interactions. Moreover, we integrate a DNN block to further capture the complex relationships among different variables. Comprehensive experiments on two real-world datasets of about 30,000 samples show the superiority of AutoEIS on default prediction, boosting average AUC and KS metrics over the best classical baseline by 0.49% and 3.5%, and the best of deep baselines by 0.48% and 2.55%. Furthermore, as a model-agnostic strategy, we can generalize MfMoE to other deep models like DeepFM, thereby boosting their performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [11] LncRNA–protein interaction prediction with reweighted feature selection
    Guohao Lv
    Yingchun Xia
    Zhao Qi
    Zihao Zhao
    Lianggui Tang
    Cheng Chen
    Shuai Yang
    Qingyong Wang
    Lichuan Gu
    BMC Bioinformatics, 24
  • [12] An integration of deep learning with feature embedding for protein-protein interaction prediction
    Yao, Yu
    Du, Xiuquan
    Diao, Yanyu
    Zhu, Huaixu
    PEERJ, 2019, 7
  • [13] Dissimilarity Space Representations and Automatic Feature Selection for Protein Function Prediction
    De Santis, Enrico
    Martino, Alessio
    Rizzi, Antonello
    Mascioli, Fabio Massimo Frattale
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [14] LncRNA-protein interaction prediction with reweighted feature selection
    Lv, Guohao
    Xia, Yingchun
    Qi, Zhao
    Zhao, Zihao
    Tang, Lianggui
    Chen, Cheng
    Yang, Shuai
    Wang, Qingyong
    Gu, Lichuan
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [15] Graph embedding based feature selection
    Wei, Dan
    Li, Shutao
    Tan, Mingkui
    NEUROCOMPUTING, 2012, 93 : 115 - 125
  • [16] Automatic feature selection in neuroevolution
    Whiteson, Shimon
    Stone, Peter
    Stanley, Kenneth O.
    Mikkulainen, Risto
    Kohl, Nate
    GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 1225 - 1232
  • [17] Automatic feature selection for supervised learning in link prediction applications: a comparative study
    Pecli, Antonio
    Cavalcanti, Maria Claudia
    Goldschmidt, Ronaldo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 56 (01) : 85 - 121
  • [18] Automatic feature selection for supervised learning in link prediction applications: a comparative study
    Antonio Pecli
    Maria Claudia Cavalcanti
    Ronaldo Goldschmidt
    Knowledge and Information Systems, 2018, 56 : 85 - 121
  • [19] Feature Interaction for Streaming Feature Selection
    Zhou, Peng
    Li, Peipei
    Zhao, Shu
    Wu, Xindong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4691 - 4702
  • [20] Robust neighborhood embedding for unsupervised feature selection
    Liu, Yanfang
    Ye, Dongyi
    Li, Wenbin
    Wang, Huihui
    Gao, Yang
    KNOWLEDGE-BASED SYSTEMS, 2020, 193