AutoEIS: Automatic feature embedding, interaction and selection on default prediction

被引:2
|
作者
Xiao, Kai [1 ,2 ,3 ]
Jiang, Xiaohan [1 ,2 ,3 ]
Hou, Peng [1 ,2 ,3 ]
Zhu, Hongbin [3 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200438, Peoples R China
[2] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai 200438, Peoples R China
[3] Fudan Univ, FinTech Insitute, Shanghai 200438, Peoples R China
关键词
Data mining; Default prediction; Feature embedding; Feature interaction; RISK-ASSESSMENT; CREDIT;
D O I
10.1016/j.ipm.2023.103526
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep models have shown the effectiveness in various areas, e.g., finance, healthcare and recommendation system. Among them, default prediction is a major application in the financial field. However, there are still problems remained, like insufficient learning for numerical feature encoding and difficulty of explicitly modelling high-order feature interactions. To address these issues, we propose an automatic end-to-end deep learning framework named AutoEIS. In this framework, we embed categorical and numerical features with distinct strategies. In particular, we design a Multi-field-aware Mixture-of-Experts (MfMoE) structure for numerical value embedding, which can simultaneously learn the single-field and global-field information. Then, by organically integrating attention mechanism, weighted-average aggregation and bilinear interaction, we can effectively generate significant high-order explicit interactions. Moreover, we integrate a DNN block to further capture the complex relationships among different variables. Comprehensive experiments on two real-world datasets of about 30,000 samples show the superiority of AutoEIS on default prediction, boosting average AUC and KS metrics over the best classical baseline by 0.49% and 3.5%, and the best of deep baselines by 0.48% and 2.55%. Furthermore, as a model-agnostic strategy, we can generalize MfMoE to other deep models like DeepFM, thereby boosting their performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Feature detection with automatic scale selection
    Lindeberg, T
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 30 (02) : 79 - 116
  • [32] Feature Detection with Automatic Scale Selection
    Tony Lindeberg
    International Journal of Computer Vision, 1998, 30 : 79 - 116
  • [33] Automatic determination of size for feature selection
    Kudo, M
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVI, PROCEEDINGS: COMPUTER SCIENCE III, 2002, : 305 - 311
  • [34] Feature selection for automatic image annotation
    Setia, Lokesh
    Burkhardt, Hans
    PATTERN RECOGNITION, PROCEEDINGS, 2006, 4174 : 294 - 303
  • [35] Feature Selection for Link Prediction
    Xu, Ye
    Rockmore, Dan
    PROCEEDINGS OF THE 5TH PH.D. WORKSHOP ON INFORMATION AND KNOWLEDGE, 2012, : 25 - 32
  • [36] Feature selection in bankruptcy prediction
    Tsai, Chih-Fong
    KNOWLEDGE-BASED SYSTEMS, 2009, 22 (02) : 120 - 127
  • [37] Using Feature Selection Technique for Drug-Target Interaction Networks Prediction
    Yu, W.
    Jiang, Z.
    Wang, J.
    Tao, R.
    CURRENT MEDICINAL CHEMISTRY, 2011, 18 (36) : 5687 - 5693
  • [38] A comparative study of combining tree-based feature selection methods and classifiers in personal loan default prediction
    Guo, Weidong
    Zhou, Zach Zhizhong
    JOURNAL OF FORECASTING, 2022, 41 (06) : 1248 - 1313
  • [39] Automatic Engagement Prediction with GAP Feature
    Niu, Xuesong
    Han, Hu
    Zeng, Jiabei
    Sun, Xuran
    Shan, Shiguang
    Huang, Yan
    Yang, Songfan
    Chen, Xilin
    ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 599 - 603
  • [40] Latent energy preserving embedding for unsupervised feature selection
    Song, Zihao
    Song, Peng
    DIGITAL SIGNAL PROCESSING, 2022, 132