Metafeature Selection via Multivariate Sparse-Group Lasso Learning for Automatic Hyperparameter Configuration Recommendation

被引:3
|
作者
Deng, Liping [1 ]
Chen, Wen-Sheng [2 ]
Xiao, Mingqing [1 ]
机构
[1] Southern Illinois Univ Carbondale, Sch Math & Stat Sci, Carbondale, IL 62901 USA
[2] Shenzhen Univ, Coll Math & Stat, Shenzhen 618060, Guangdong, Peoples R China
基金
美国国家科学基金会;
关键词
Classification algorithms; Task analysis; Metadata; Support vector machines; Optimization; Kernel; Feature extraction; Automatic hyperparameter recommendation; metafeature selection; metalearning (MtL); multivariate sparse-group Lasso (SGLasso); META; REGRESSION; ALGORITHMS; SEARCH;
D O I
10.1109/TNNLS.2023.3263506
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of classification algorithms is mainly governed by the hyperparameter settings deployed in applications, and the search for desirable hyperparameter configurations usually is quite challenging due to the complexity of datasets. Metafeatures are a group of measures that characterize the underlying dataset from various aspects, and the corresponding recommendation algorithm fully relies on the appropriate selection of metafeatures. Metalearning (MtL), aiming to improve the learning algorithm itself, requires development in integrating features, models, and algorithm learning to accomplish its goal. In this article, we develop a multivariate sparse-group Lasso (SGLasso) model embedded with MtL capacity in recommending suitable configurations via learning. The main idea is to select the principal metafeatures by removing those redundant or irregular ones, promoting both efficiency and performance in the hyperparameter configuration recommendation. To be specific, we first extract the metafeatures and classification performance of a set of configurations from the collection of historical datasets, and then, a metaregression task is established through SGLasso to capture the main characteristics of the underlying relationship between metafeatures and historical performance. For a new dataset, the classification performance of configurations can be estimated through the selected metafeatures so that the configuration with the highest predictive performance in terms of the new dataset can be generated. Furthermore, a general MtL architecture combined with our model is developed. Extensive experiments are conducted on 136 UCI datasets, demonstrating the effectiveness of the proposed approach. The empirical results on the well-known SVM show that our model can effectively recommend suitable configurations and outperform the existing MtL-based methods and the well-known search-based algorithms, such as random search, Bayesian optimization, and Hyperband.
引用
收藏
页码:12540 / 12552
页数:13
相关论文
共 28 条
  • [21] Unsupervised feature selection via joint local learning and group sparse regression
    Yue WU
    Can WANG
    Yue-qing ZHANG
    Jia-jun BU
    Frontiers of Information Technology & Electronic Engineering, 2019, 20 (04) : 538 - 553
  • [22] Unsupervised feature selection via joint local learning and group sparse regression
    Wu, Yue
    Wang, Can
    Zhang, Yue-qing
    Bu, Jia-jun
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2019, 20 (04) : 538 - 553
  • [23] Unsupervised feature selection via joint local learning and group sparse regression
    Yue Wu
    Can Wang
    Yue-qing Zhang
    Jia-jun Bu
    Frontiers of Information Technology & Electronic Engineering, 2019, 20 : 538 - 553
  • [24] Evaluating the Predictive Power of Multivariate Tensor-based Morphometry in Alzheimers Disease Progression via Convex Fused Sparse Group Lasso
    Tsao, Sinchai
    Gajawelli, Niharika
    Zhou, Jiayu
    Shi, Jie
    Ye, Jieping
    Wang, Yalin
    Lepore, Natasha
    MEDICAL IMAGING 2014: IMAGE PROCESSING, 2014, 9034
  • [25] Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity
    Xu, Shiyun
    Bu, Zhiqi
    Chaudhari, Pratik
    Barnett, Ian J.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT III, 2023, 14171 : 343 - 359
  • [26] Automatic Spatial-Spectral Feature Selection for Hyperspectral Image via Discriminative Sparse Multimodal Learning
    Zhang, Qian
    Tian, Yuan
    Yang, Yiping
    Pan, Chunhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (01): : 261 - 279
  • [27] Automatic Optic Disc Detection in Retinal Images via Group Sparse Regularization Extreme Learning Machine
    Zhou, Wei
    Wu, Chengdong
    Du, Wenyou
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 11053 - 11058
  • [28] FedGroup-Prune: IoT Device Amicable and Training-Efficient Federated Learning via Combined Group Lasso Sparse Model Pruning
    Chen, Ziyao
    Peng, Jialiang
    Kang, Jiawen
    Niyato, Dusit
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (24): : 40921 - 40932