Tree-Based Feature Transformation for Purchase Behavior Prediction

被引:4
|
作者
Hou, Chunyan [1 ]
Chen, Chen [2 ]
Wang, Jinsong [1 ]
机构
[1] Tianjin Univ Technol, Sch Comp & Commun Engn, Tianjin, Peoples R China
[2] Nankai Univ, Coll Comp & Control Engn, Tianjin, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
feature transformation; purchase behavior prediction; DIMENSIONALITY;
D O I
10.1587/transinf.2017EDL8210
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the era of e-commerce, purchase behavior prediction is one of the most important issues to promote both online companies' sales and the consumers' experience. The previous researches usually use the feature engineering and ensemble machine learning algorithms for the prediction. The performance really depends on designed features and the scalability of algorithms because the large-scale data and a lot of categorical features lead to huge samples and the high-dimensional feature. In this study, we explore an alternative to use tree-based Feature Transformation (FT) and simple machine learning algorithms (e.g. Logistic Regression). Random Forest (RF) and Gradient Boosting decision tree (GB) are used for FT. Then, the simple algorithm, rather than ensemble algorithms, is used to predict purchase behavior based on transformed features. Tree-based FT regards the leaves of trees as transformed features, and can learn high-order interactions among original features. Compared with RF, if GB is used for FT, simple algorithms are enough to achieve better performance.
引用
收藏
页码:1441 / 1444
页数:4
相关论文
共 50 条
  • [31] Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking
    Tolomei, Gabriele
    Silvestri, Fabrizio
    Haines, Andrew
    Lalmas, Mounia
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 465 - 474
  • [32] Tree-based software quality estimation models for fault prediction
    Khoshgoftaar, TM
    Seliya, N
    EIGHTH IEEE SYMPOSIUM ON SOFTWARE METRICS, PROCEEDINGS, 2002, : 203 - 214
  • [33] Interpreting tree-based prediction models and their data in machining processes
    Bustillo, Andres
    Grzenda, Maciej
    Macukow, Bohdan
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2016, 23 (04) : 349 - 367
  • [34] A Decision Tree-Based Method for Protein Contact Map Prediction
    Santiesteban Toca, Cosme Ernesto
    Marquez Chamorro, Alfonso E.
    Asencio Cortes, Gualberto
    Aguilar-Ruiz, Jesus S.
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, 2011, 6623 : 153 - 158
  • [35] Prediction performance of improved decision tree-based algorithms: a review
    Mienye, Ibomoiye Domor
    Sun, Yanxia
    Wang, Zenghui
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE MATERIALS PROCESSING AND MANUFACTURING (SMPM 2019), 2019, 35 : 698 - 703
  • [36] Tree-based classification and regression Part 3: Tree-based procedures
    Gunter, B
    QUALITY PROGRESS, 1998, 31 (02) : 121 - 123
  • [37] A tree-based Mergesort
    Moffat, A
    Petersson, O
    Wormald, NC
    ACTA INFORMATICA, 1998, 35 (09) : 775 - 793
  • [38] A Tree-based Mergesort
    Alistair Moffat
    Ola Petersson
    Nicholas C. Wormald
    Acta Informatica, 1998, 35 : 775 - 793
  • [39] Comparing performance of non–tree-based and tree-based association mapping methods
    Katherine L. Thompson
    David W. Fardo
    BMC Proceedings, 10 (Suppl 7)
  • [40] An investigation of feature selection methods for soil liquefaction prediction based on tree-based ensemble algorithms using AdaBoost, gradient boosting, and XGBoost
    Selçuk Demir
    Emrehan Kutlug Sahin
    Neural Computing and Applications, 2023, 35 : 3173 - 3190