Customer feature selection from high-dimensional bank direct marketing data for uplift modeling

被引：4

作者：

Hu, Jinping ^{[1
]}

机构：

[1] Shenzhen Technol Univ, 3002 Lantian Rd, Shenzhen 518118, Guangdong, Peoples R China

来源：

JOURNAL OF MARKETING ANALYTICS | 2023年 / 11卷 / 02期

关键词：

Bank direct marketing; Feature selection; Redundant features; Relevant features; Uplift modeling; RELEVANCE; PREDICTION; CHURN;

D O I：

10.1057/s41270-022-00160-z

中图分类号：

F [经济];

学科分类号：

02 ;

摘要：

Uplift modeling estimates the incremental impact (i.e., uplift) of a marketing campaign on customer outcomes. These models are essential to banks' direct marketing efforts. However, bank data are often high-dimensional, with hundreds to thousands of customer features; and keeping irrelevant and redundant features in an uplift model can be computationally inefficient and adversely affect model performance. Therefore, banks must narrow their feature selection for uplift modeling. Yet, literature on feature selection has rarely focused on uplift modeling. This paper proposes several two-step feature selection approaches to uplift models, structured to cluster highly relevant, low-redundant feature subsets from high-dimensional banking data. Empirical experiments show that fewer features in a selected set (20 out of 180 features) lead to 68.6% of these uplift models performing as well or better than complete feature set models.

引用

页码：160 / 171

页数：12

共 50 条

[1] Customer feature selection from high-dimensional bank direct marketing data for uplift modeling
Jinping Hu
Journal of Marketing Analytics, 2023, 11 : 160 - 171
[2] Feature selection for high-dimensional data
Destrero A.
Mosci S.
De Mol C.
Verri A.
Odone F.
Computational Management Science, 2009, 6 (1) : 25 - 40
[3] Feature selection for high-dimensional data
Bolón-Canedo V.
Sánchez-Maroño N.
Alonso-Betanzos A.
Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
[4] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
Verleysen, Michel
NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : IS23 - IS25
[5] Feature selection for high-dimensional imbalanced data
Yin, Liuzhi
Ge, Yong
Xiao, Keli
Wang, Xuehua
Quan, Xiaojun
NEUROCOMPUTING, 2013, 105 : 3 - 11
[6] Feature selection for high-dimensional data in astronomy
Zheng, Hongwen
Zhang, Yanxia
ADVANCES IN SPACE RESEARCH, 2008, 41 (12) : 1960 - 1964
[7] A filter feature selection for high-dimensional data
Janane, Fatima Zahra
Ouaderhman, Tayeb
Chamlal, Hasna
JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17
[8] Feature selection for high-dimensional temporal data
Michail Tsagris
Vincenzo Lagani
Ioannis Tsamardinos
BMC Bioinformatics, 19
[9] Feature Selection with High-Dimensional Imbalanced Data
Van Hulse, Jason
Khoshgoftaar, Taghi M.
Napolitano, Amri
Wald, Randall
2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 507 - 514
[10] Feature selection for high-dimensional temporal data
Tsagris, Michail
Lagani, Vincenzo
Tsamardinos, Ioannis
BMC BIOINFORMATICS, 2018, 19

← 1 2 3 4 5 →