An index of effective number of variables for uncertainty and reliability analysis in model selection problems

被引:0
|
作者
Martino, Luca [1 ]
Morgado, Eduardo [1 ]
Castillo, Roberto San Millan [1 ]
机构
[1] Univ Rey Juan Carlos, Campus Fuenlabrada, Madrid, Spain
关键词
Model selection; Elbow detection; Information criterion; Effective Sample Size (ESS); Gini index; Uncertainty analysis; Variable importance; MARGINAL LIKELIHOOD; CROSS-VALIDATION; ORDER; ALGORITHM;
D O I
10.1016/j.sigpro.2024.109735
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An index of an effective number of variables (ENV) is introduced for model selection in nested models. This is the case, for instance, when we have to decide the order of a polynomial function or the number of bases in a nonlinear regression, choose the number of clusters in a clustering problem, or the number of features in a variable selection application (to name few examples). It is inspired by the idea of the maximum area under the curve (AUC). The interpretation of the ENV index is identical to the effective sample size (ESS) indices concerning a set of samples. The ENV index improves drawbacks of the elbow detectors described in the literature and introduces different confidence measures of the proposed solution. These novel measures can be also employed jointly with the use of different information criteria, such as the well-known AIC and BIC, or any other model selection procedures. Comparisons with classical and recent schemes are provided in different experiments involving real datasets. Related Matlab code is given.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Uncertainty Analysis of a Numerical Model for Calculating an Effective Source Height
    Ono H.
    Sada K.
    Transactions of the Atomic Energy Society of Japan, 2022, 21 (03) : 127 - 143
  • [32] Epistemic uncertainty-based reliability analysis for engineering system with hybrid evidence and fuzzy variables
    Wang, Chong
    Matthies, Hermann G.
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2019, 355 : 438 - 455
  • [33] Variable selection for single-index-driven autoregressive model with linear explanatory variables
    Huang, Jingwen
    Wang, Dehui
    STATISTICS, 2025, 59 (02) : 402 - 425
  • [34] An effective Kriging-based approximation for structural reliability analysis with random and interval variables
    Zhang, Xufang
    Wu, Zhenguang
    Ma, Hui
    Pandey, Mahesh D.
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2021, 63 (05) : 2473 - 2491
  • [35] An effective Kriging-based approximation for structural reliability analysis with random and interval variables
    Xufang Zhang
    Zhenguang Wu
    Hui Ma
    Mahesh D. Pandey
    Structural and Multidisciplinary Optimization, 2021, 63 : 2473 - 2491
  • [36] How digital economy index selection and model uncertainty will affect energy green transition
    Huang, Chenchen
    Lin, Boqiang
    ENERGY ECONOMICS, 2024, 136
  • [37] An effective evidence theory-based reliability analysis algorithm for structures with epistemic uncertainty
    Wang W.
    Xue H.
    Gao H.
    Quality and Reliability Engineering International, 2021, 37 (02) : 841 - 855
  • [38] Model uncertainty of SPT based methods for reliability analysis of soil liquefaction
    Sebaaly, G.
    Rahhal, M. E.
    EARTHQUAKE GEOTECHNICAL ENGINEERING FOR PROTECTION AND DEVELOPMENT OF ENVIRONMENT AND CONSTRUCTIONS, 2019, 4 : 4906 - 4913
  • [39] An optimal model using data envelopment analysis for uncertainty metrics in reliability
    Zu, Tianpei
    Kang, Rui
    Wen, Meilin
    Yang, Yi
    SOFT COMPUTING, 2018, 22 (16) : 5561 - 5568
  • [40] Reliability analysis of structures based on a probability-uncertainty hybrid model
    Zhang, Lei
    Zhang, Jianguo
    You, Lingfei
    Zhou, Shuang
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2019, 35 (01) : 263 - 279