PREDICTION OF THE NASH THROUGH PENALIZED MIXTURE OF LOGISTIC REGRESSION MODELS

被引:0
|
作者
Morvan, Marie [1 ]
Devijver, Emilie [2 ]
Giacofci, Madison [1 ]
Monbet, Valerie [1 ]
机构
[1] Univ Rennes, CNRS, IRMAR UMR 6625, Rennes, France
[2] Univ Grenoble Alpes, Grenoble INP, CNRS, INRIA, Grenoble, France
来源
ANNALS OF APPLIED STATISTICS | 2021年 / 15卷 / 02期
关键词
Mixture regression model; prediction; variable selection; heterogeneous data; spectrometry data; FINITE MIXTURE; MAXIMUM-LIKELIHOOD; VARIABLE SELECTION; EM ALGORITHM;
D O I
10.1214/20-AOAS1409
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper an appropriate and interpretable diagnosis statistical model is proposed to predict Nonalcoholic Steatohepatitis (NASH) from near infrared spectrometry data. In this disease, unknown patients' profiles are expected to lead to a different diagnosis. The model has then to take into account the heterogeneity of the data and the dimension of the spectrometric data. To this end, we propose to fit a mixture model on the joint distribution of the diagnostic binary variable and the covariates selected in the spectra. The penalized maximum likelihood estimator is considered. In practice, a twofold penalty on both regression coefficients and covariance parameters is imposed. Automatic selection criteria, such as the AIC and BIC, are used to select the amount of shrinkage and the number of clusters. The performance of the overall procedure is evaluated by a simulation study, and its application on the NASH data set is analyzed. The model leads to better prediction performance than competitive methods and provides highly interpretable results.
引用
收藏
页码:952 / 970
页数:19
相关论文
共 50 条
  • [21] ON THE ESTIMATION OF PREDICTION ERRORS IN LOGISTIC-REGRESSION MODELS
    ZHANG, P
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1994, 23 (10) : 2881 - 2894
  • [22] Parameter-expanded ECME algorithms for logistic and penalized logistic regression
    Henderson, Nicholas C.
    Ouyang, Zhongzhe
    COMPUTATIONAL STATISTICS, 2025,
  • [23] Continuous speech recognition with penalized logistic regression machines
    Birkenes, Oystein
    Matsui, Tomoko
    Tanabe, Kunio
    Myrvoll, Tor Andre
    2006 7TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2006, : 110 - +
  • [24] Robust penalized logistic regression with truncated loss functions
    Park, Seo Young
    Liu, Yufeng
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2011, 39 (02): : 300 - 323
  • [25] A penalized approach to covariate selection through quantile regression coefficient models
    Sottile, Gianluca
    Frumento, Paolo
    Chiodi, Marcello
    Bottai, Matteo
    STATISTICAL MODELLING, 2020, 20 (04) : 369 - 385
  • [26] Spatiotemporal Exposure Prediction with Penalized Regression
    Nathan A. Ryder
    Joshua P. Keller
    Journal of Agricultural, Biological and Environmental Statistics, 2023, 28 : 260 - 278
  • [27] Handling outliers in bankruptcy prediction models based on logistic regression
    Szanto, Tunde Katalin
    PUBLIC FINANCE QUARTERLY-HUNGARY, 2023, 69 (03): : 89 - 103
  • [28] SIMULTANEOUS PREDICTION INTERVALS FOR MULTINOMIAL LOGISTIC-REGRESSION MODELS
    SAMBAMOORTHI, N
    ERVIN, VJ
    THOMAS, G
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1994, 23 (03) : 815 - 829
  • [29] Poisson Mixture Regression Models for Heart Disease Prediction
    Mufudza, Chipo
    Erol, Hamza
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2016, 2016
  • [30] Spatiotemporal Exposure Prediction with Penalized Regression
    Ryder, Nathan A.
    Keller, Joshua P.
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2023, 28 (02) : 260 - 278