Gradient-Boosted Based Structured and Unstructured Learning

被引:0
|
作者
Gavito, Andrea Trevino [1 ]
Klabjan, Diego [1 ]
Utke, Jean [2 ]
机构
[1] Northwestern Univ, Evanston, IL 60208 USA
[2] Allstate Insurance Co, Northbrook, IL USA
关键词
Deep learning; Multimodal learning; Gradient boosting;
D O I
10.1007/978-3-031-44213-1_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose two frameworks to deal with problem settings in which both structured and unstructured data are available. Structured data problems are best solved by traditional machine learning models such as boosting and tree-based algorithms, whereas deep learning has been widely applied to problems dealing with images, text, audio, and other unstructured data sources. However, for the setting in which both structured and unstructured data are accessible, it is not obvious what the best modeling approach is to enhance performance on both data sources simultaneously. Our proposed frameworks allow joint learning on both kinds of data by integrating the paradigms of boosting models and deep neural networks. The first framework, the boosted-feature-vector deep learning network, learns features from the structured data using gradient boosting and combines them with embeddings from unstructured data via a two-branch deep neural network. Secondly, the two-weak-learner boosting framework extends the boosting paradigm to the setting with two input data sources. We present and compare first- and second-order methods of this framework. Our experimental results on both public and real-world datasets show performance gains achieved by the frameworks over selected baselines by magnitudes of 0.1%-4.7%.
引用
收藏
页码:439 / 451
页数:13
相关论文
共 50 条
  • [11] Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees
    Jolicoeur-Martineau, Alexia
    Fatras, Kilian
    Kachman, Tal
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [12] Bagging Gradient-Boosted Trees for High Precision, Low Variance Ranking Models
    Ganjisaffar, Yasser
    Caruana, Rich
    Lopes, Cristina Videira
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 85 - 94
  • [13] ENIGMA-NG: Efficient Neural and Gradient-Boosted Inference Guidance for E
    Chvalovsky, Karel
    Jakubuv, Jan
    Suda, Martin
    Urban, Josef
    AUTOMATED DEDUCTION, CADE 27, 2019, 11716 : 197 - 215
  • [14] Gradient-boosted spatiotemporal neural network for simulating underground hydrogen storage in aquifers
    Wang, Jian
    Hu, Zongwen
    Yan, Xia
    Yao, Jun
    Sun, Hai
    Yang, Yongfei
    Zhang, Lei
    Zhong, Junjie
    JOURNAL OF COMPUTATIONAL PHYSICS, 2025, 521
  • [15] Analysis of Protein and Fat in Milk Using Multiwavelength Gradient-Boosted Regression Tree
    Sheng, Tao
    Shi, Shengzhe
    Zhu, Yuanyang
    Chen, Debao
    Liu, Sheng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [16] Adapting and Evaluating Influence-Estimation Methods for Gradient-Boosted Decision Trees
    Brophy, Jonathan
    Hammoudeh, Zayd
    Lowd, Daniel
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [17] Mixed-Integer Convex Nonlinear Optimization with Gradient-Boosted Trees Embedded
    Mistry, Miten
    Letsios, Dimitrios
    Krennrich, Gerhard
    Lee, Robert M.
    Misener, Ruth
    INFORMS JOURNAL ON COMPUTING, 2021, 33 (03) : 1103 - 1119
  • [18] Machine learning-based evaluation of performance of silicon nitride waveguide fabrication: Gradient-boosted forests for predicting propagation and bend excess losses
    Hinum-Wagner, Jakob Wilhelm
    Hoerniann, Samuel Marko
    Feig, Gandolf
    Schmidt, Christoph
    Bergmann, Alexander
    Kraft, Henrik
    EOS ANNUAL MEETING, EOSAM 2024, 2024, 309
  • [19] Forecasting the default risk of Chinese listed companies using a gradient-boosted decision tree based on the undersampling technique
    Wang, Shanshan
    Chi, Guotai
    Zhou, Ying
    Chen, Li
    JOURNAL OF RISK MODEL VALIDATION, 2023, 17 (04): : 97 - 121
  • [20] Machine vision-based gradient-boosted tree and support vector regression for tool life prediction in turning
    Prashant J. Bagga
    Kaushik M. Patel
    Mayur A. Makhesana
    Şenol Şirin
    Navneet Khanna
    Grzegorz M. Krolczyk
    Adarsh D. Pala
    Kavan C. Chauhan
    The International Journal of Advanced Manufacturing Technology, 2023, 126 : 471 - 485