Customer feature selection from high-dimensional bank direct marketing data for uplift modeling

被引：4

作者：

Hu, Jinping ^{[1
]}

机构：

[1] Shenzhen Technol Univ, 3002 Lantian Rd, Shenzhen 518118, Guangdong, Peoples R China

来源：

JOURNAL OF MARKETING ANALYTICS | 2023年 / 11卷 / 02期

关键词：

Bank direct marketing; Feature selection; Redundant features; Relevant features; Uplift modeling; RELEVANCE; PREDICTION; CHURN;

D O I：

10.1057/s41270-022-00160-z

中图分类号：

F [经济];

学科分类号：

02 ;

摘要：

Uplift modeling estimates the incremental impact (i.e., uplift) of a marketing campaign on customer outcomes. These models are essential to banks' direct marketing efforts. However, bank data are often high-dimensional, with hundreds to thousands of customer features; and keeping irrelevant and redundant features in an uplift model can be computationally inefficient and adversely affect model performance. Therefore, banks must narrow their feature selection for uplift modeling. Yet, literature on feature selection has rarely focused on uplift modeling. This paper proposes several two-step feature selection approaches to uplift models, structured to cluster highly relevant, low-redundant feature subsets from high-dimensional banking data. Empirical experiments show that fewer features in a selected set (20 out of 180 features) lead to 68.6% of these uplift models performing as well or better than complete feature set models.

引用

页码：160 / 171

页数：12

共 50 条

[21] A hybrid feature selection method for high-dimensional data
Taheri, Nooshin
Nezamabadi-pour, Hossein
2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 141 - 145
[22] On the scalability of feature selection methods on high-dimensional data
Bolon-Canedo, V.
Rego-Fernandez, D.
Peteiro-Barral, D.
Alonso-Betanzos, A.
Guijarro-Berdinas, B.
Sanchez-Marono, N.
KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 56 (02) : 395 - 442
[23] A hybrid feature selection scheme for high-dimensional data
Ganjei, Mohammad Ahmadi
Boostani, Reza
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 113
[24] Evaluating Feature Selection Robustness on High-Dimensional Data
Pes, Barbara
HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 235 - 247
[25] Feature selection for classifying high-dimensional numerical data
Wu, YM
Zhang, AD
PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, 2004, : 251 - 258
[26] Feature selection from high-dimensional hyperspectral and polarimetric data for target detection
Chen, XW
Casasent, D
OPTICAL PATTERN RECOGNITION XV, 2004, 5437 : 171 - 178
[27] A Light Causal Feature Selection Approach to High-Dimensional Data
Ling, Zhaolong
Li, Ying
Zhang, Yiwen
Yu, Kui
Zhou, Peng
Li, Bo
Wu, Xindong
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 7639 - 7650
[28] Single Sequence Fast Feature Selection for High-Dimensional Data
Boldt, Francisco de Assis
Rauber, Thomas W.
Varejao, Flavio M.
2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 697 - 704
[29] Filter Feature Selection Performance Comparison in High-dimensional Data
Huertas, Carlos
Juarez-Ramirez, Reyes
2014 17TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2014,
[30] Feature selection based on geometric distance for high-dimensional data
Lee, J. -H.
Oh, S. -Y.
ELECTRONICS LETTERS, 2016, 52 (06) : 473 - 474

← 1 2 3 4 5 →