SigOpt Mulch: An intelligent system for AutoML of gradient boosted trees

被引：1

作者：

Sorokin, Aleksei ^{[1
]}

Zhu, Xinran ^{[2
]}

Lee, Eric Hans ^{[3
]}

Cheng, Bolong ^{[3
]}

机构：

[1] IIT, Chicago, IL USA

[2] Cornell Univ, Ithaca, NY USA

[3] SigOpt Intel Co, San Francisco, CA 94104 USA

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 273卷

关键词：

Automated machine learning (autoML); Hyperparameter optimization (HPO); Gradient boosted trees; EFFICIENT;

D O I：

10.1016/j.knosys.2023.110604

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Gradient boosted trees (GBTs) are ubiquitous models used by researchers, machine learning (ML) practitioners, and data scientists because of their robust performance, interpretable behavior, and ease-of-use. One critical challenge in training GBTs is the tuning of their hyperparameters. In practice, selecting these hyperparameters is often done manually. Recently, the ML community has advocated for tuning hyperparameters through black-box optimization and developed state-of-the-art systems to do so. However, applying such systems to tune GBTs suffers from two drawbacks. First, these systems are not model-aware, rather they are designed to apply to a generic model; this leaves significant optimization performance on the table. Second, using these systems requires domain knowledge such as the choice of hyperparameter search space, which is an antithesis to the automatic experimentation that black-box optimization aims to provide. In this paper, we present SigOpt Mulch, a model-aware hyperparameter tuning system specifically designed for automated tuning of GBTs that provides two improvements over existing systems. First, Mulch leverages powerful techniques in metalearning and multifidelity optimization to perform model-aware hyperparameter optimization. Second, it automates the process of learning performant hyperparameters by making intelligent decisions about the optimization search space, thus reducing the need for user domain knowledge. These innovations allow Mulch to identify good GBT hyperparameters far more efficiently-and in a more seamless and user-friendly way-than existing black-box hyperparameter tuning systems. (c) 2023 Published by Elsevier B.V.

引用

页数：12

共 50 条

[41] Automated proton track identification in MicroBooNE using gradient boosted decision trees
Woodruff, Katherine
18TH INTERNATIONAL WORKSHOP ON ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH (ACAT2017), 2018, 1085
[42] PredRSA: a gradient boosted regression trees approach for predicting protein solvent accessibility
Chao Fan
Diwei Liu
Rui Huang
Zhigang Chen
Lei Deng
BMC Bioinformatics, 17
[43] Sector categorization using gradient boosted trees trained on fundamental firm data
Fang, Ming
Kuo, Lilian
Shih, Frank
Taylor, Stephen
ALGORITHMIC FINANCE, 2020, 8 (3-4) : 91 - 99
[44] An Architecture as an Alternative to Gradient Boosted Decision Trees for Multiple Machine Learning Tasks
Du, Lei
Song, Haifeng
Xu, Yingying
Dai, Songsong
ELECTRONICS, 2024, 13 (12)
[45] House price prediction with gradient boosted trees under different loss functions
Hjort, Anders
Pensar, Johan
Scheel, Ida
Sommervoll, Dag Einar
JOURNAL OF PROPERTY RESEARCH, 2022, 39 (04) : 338 - 364
[46] Scalable probabilistic forecasting in retail with gradient boosted trees: A practitioner's approach
Long, Xueying
Bui, Quang
Oktavian, Grady
Schmidt, Daniel F.
Bergmeir, Christoph
Godahewa, Rakshitha
Lee, Seong Per
Zhao, Kaifeng
Condylis, Paul
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2025, 279
[47] Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning
Minixhofer, Benjamin
Gritta, Milan
Iacobacci, Ignacio
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 303 - 313
[48] GBDT-MO: Gradient-Boosted Decision Trees for Multiple Outputs
Zhang, Zhendong
Jung, Cheolkon
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 3156 - 3167
[49] Road Crashes Analysis and Prediction using Gradient Boosted and Random Forest Trees
Elyassami, Sanaa
Hamid, Yasir
Habuza, Tetiana
2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 520 - 525
[50] MINIMAL CALCIUM STRESS USING THE GRADIENT MULCH PRODUCTION SYSTEM
GERALDSON, CM
COMMUNICATIONS IN SOIL SCIENCE AND PLANT ANALYSIS, 1979, 10 (1-2) : 163 - 169

← 1 2 3 4 5 →