Customer feature selection from high-dimensional bank direct marketing data for uplift modeling

被引:4
|
作者
Hu, Jinping [1 ]
机构
[1] Shenzhen Technol Univ, 3002 Lantian Rd, Shenzhen 518118, Guangdong, Peoples R China
关键词
Bank direct marketing; Feature selection; Redundant features; Relevant features; Uplift modeling; RELEVANCE; PREDICTION; CHURN;
D O I
10.1057/s41270-022-00160-z
中图分类号
F [经济];
学科分类号
02 ;
摘要
Uplift modeling estimates the incremental impact (i.e., uplift) of a marketing campaign on customer outcomes. These models are essential to banks' direct marketing efforts. However, bank data are often high-dimensional, with hundreds to thousands of customer features; and keeping irrelevant and redundant features in an uplift model can be computationally inefficient and adversely affect model performance. Therefore, banks must narrow their feature selection for uplift modeling. Yet, literature on feature selection has rarely focused on uplift modeling. This paper proposes several two-step feature selection approaches to uplift models, structured to cluster highly relevant, low-redundant feature subsets from high-dimensional banking data. Empirical experiments show that fewer features in a selected set (20 out of 180 features) lead to 68.6% of these uplift models performing as well or better than complete feature set models.
引用
收藏
页码:160 / 171
页数:12
相关论文
共 50 条
  • [31] A general framework of nonparametric feature selection in high-dimensional data
    Yu, Hang
    Wang, Yuanjia
    Zeng, Donglin
    BIOMETRICS, 2023, 79 (02) : 951 - 963
  • [32] Hybrid fast unsupervised feature selection for high-dimensional data
    Manbari, Zhaleh
    AkhlaghianTab, Fardin
    Salavati, Chiman
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 124 : 97 - 118
  • [33] FsNet: Feature Selection Network on High-dimensional Biological Data
    Singh, Dinesh
    Climente-Gonzalez, Hector
    Petrovich, Mathis
    Kawakami, Eiryo
    Yamada, Makoto
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [34] Multistage feature selection approach for high-dimensional cancer data
    Alkuhlani, Alhasan
    Nassef, Mohammad
    Farag, Ibrahim
    SOFT COMPUTING, 2017, 21 (22) : 6895 - 6906
  • [35] On online high-dimensional spherical data clustering and feature selection
    Amayri, Ola
    Bouguila, Nizar
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (04) : 1386 - 1398
  • [36] Feature Selection and Classification for High-Dimensional Incomplete Multimodal Data
    Deng, Wan-Yu
    Liu, Dan
    Dong, Ying-Ying
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [37] Diagonal Discriminant Analysis With Feature Selection for High-Dimensional Data
    Romanes, Sarah E.
    Ormerod, John T.
    Yang, Jean Y. H.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (01) : 114 - 127
  • [38] Genetic Programming for Feature Selection and Construction to High-Dimensional Data
    Ma, Jianbin
    Zhu, Man
    2024 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND INTELLIGENT SYSTEMS ENGINEERING, MLISE 2024, 2024, : 196 - 200
  • [39] Multistage feature selection approach for high-dimensional cancer data
    Alhasan Alkuhlani
    Mohammad Nassef
    Ibrahim Farag
    Soft Computing, 2017, 21 : 6895 - 6906
  • [40] Scalable Feature Selection in High-Dimensional Data Based on GRASP
    Moshki, Mohsen
    Kabiri, Peyman
    Mohebalhojeh, Alireza
    APPLIED ARTIFICIAL INTELLIGENCE, 2015, 29 (03) : 283 - 296