AutoEIS: Automatic feature embedding, interaction and selection on default prediction

被引:2
|
作者
Xiao, Kai [1 ,2 ,3 ]
Jiang, Xiaohan [1 ,2 ,3 ]
Hou, Peng [1 ,2 ,3 ]
Zhu, Hongbin [3 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200438, Peoples R China
[2] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai 200438, Peoples R China
[3] Fudan Univ, FinTech Insitute, Shanghai 200438, Peoples R China
关键词
Data mining; Default prediction; Feature embedding; Feature interaction; RISK-ASSESSMENT; CREDIT;
D O I
10.1016/j.ipm.2023.103526
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep models have shown the effectiveness in various areas, e.g., finance, healthcare and recommendation system. Among them, default prediction is a major application in the financial field. However, there are still problems remained, like insufficient learning for numerical feature encoding and difficulty of explicitly modelling high-order feature interactions. To address these issues, we propose an automatic end-to-end deep learning framework named AutoEIS. In this framework, we embed categorical and numerical features with distinct strategies. In particular, we design a Multi-field-aware Mixture-of-Experts (MfMoE) structure for numerical value embedding, which can simultaneously learn the single-field and global-field information. Then, by organically integrating attention mechanism, weighted-average aggregation and bilinear interaction, we can effectively generate significant high-order explicit interactions. Moreover, we integrate a DNN block to further capture the complex relationships among different variables. Comprehensive experiments on two real-world datasets of about 30,000 samples show the superiority of AutoEIS on default prediction, boosting average AUC and KS metrics over the best classical baseline by 0.49% and 3.5%, and the best of deep baselines by 0.48% and 2.55%. Furthermore, as a model-agnostic strategy, we can generalize MfMoE to other deep models like DeepFM, thereby boosting their performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Locality and similarity preserving embedding for feature selection
    Fang, Xiaozhao
    Xu, Yong
    Li, Xuelong
    Fan, Zizhu
    Liu, Hong
    Chen, Yan
    NEUROCOMPUTING, 2014, 128 : 304 - 315
  • [22] Sparse Graph Embedding Unsupervised Feature Selection
    Wang, Shiping
    Zhu, William
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (03): : 329 - 341
  • [23] Unsupervised Feature Selection via Hypergraph Embedding
    Zhang, Zhihong
    Ren, Peng
    Hancock, Edwin R.
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [24] CRIME EVENT EMBEDDING WITH UNSUPERVISED FEATURE SELECTION
    Zhu, Shixiang
    Xie, Yao
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3922 - 3926
  • [25] HYBRID FEATURE EMBEDDING FOR AUTOMATIC BUILDING OUTLINE EXTRACTION
    Ran, Weihang
    Yuan, Wei
    Shi, Xiaodan
    Fan, Zipei
    Shibasaki, Ryosuke
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 3652 - 3655
  • [26] Context and auto-interaction are all you need: Towards context embedding based QoS prediction via automatic feature interaction for high quality cloud API delivery
    Chen, Zhen
    Pan, Maosheng
    He, Pengfei
    Qi, Wenchao
    Liu, Linlin
    Shen, Limin
    You, Dianlong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 128 : 265 - 281
  • [27] Automatic feature selection by genetic algorithms
    Eberhardt, M
    Kossebau, FWH
    König, A
    ARTIFICIAL NEURAL NETS AND GENETIC ALGORITHMS, 2001, : 256 - 259
  • [28] Automatic feature selection in EUV scatterometry
    Ansuinelli, Paolo
    Coene, Wim M. J.
    Urbach, H. P.
    APPLIED OPTICS, 2019, 58 (22) : 5916 - 5923
  • [29] ExploreKit: Automatic Feature Generation and Selection
    Katz, Gilad
    Shin, Eui Chul Richard
    Song, Dawn
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 979 - 984
  • [30] Feature Selection for Automatic Taxonomy Induction
    Yang, Hui
    Callan, Jamie
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 684 - 685