Multivariate contaminated normal mixture regression modeling of longitudinal data based on joint mean-covariance model

被引:1
|
作者
Niu, Xiaoyu [1 ]
Tian, Yuzhu [1 ]
Tang, Manlai [2 ]
Tian, Maozai [3 ]
机构
[1] Northwest Normal Univ, Sch Math & Stat, Lanzhou, Peoples R China
[2] Univ Hertfordshire, Sch Phys Engn & Comp Sci, Hatfield, Herts, England
[3] Renmin Univ China, Sch Stat, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
ECM algorithm; joint mean-covariance modeling; mixture contaminated normal model; modified cholesky decomposition; outlier detection; MAXIMUM-LIKELIHOOD-ESTIMATION;
D O I
10.1002/sam.11653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outliers are common in longitudinal data analysis, and the multivariate contaminated normal (MCN) distribution in model-based clustering is often used to detect outliers and provide robust parameter estimates in each subgroup. In this paper, we propose a method, the mixture of MCN (MCNM), based on the joint mean-covariance model, specifically designed to analyze longitudinal data characterized by mild outliers. Our model can automatically detect outliers in longitudinal data and provide robust parameter estimates in each subgroup. We use iteratively expectation-conditional maximization (ECM) algorithm and Aitken acceleration to estimate the model parameters, achieving both algorithm acceleration and stable convergence. Our proposed method simultaneously clusters the population, identifies progression patterns of the mean and covariance structures for different subgroups over time, and detects outliers. To demonstrate the effectiveness of our method, we conduct simulation studies under various cases involving different proportions and degrees of contamination. Additionally, we apply our method to real data on the number of people infected with AIDS in 49 countries or regions from 2001 to 2021. Results show that our proposed method effectively clusters the data based on various mean progression trajectories. In summary, our proposed MCNM based on the joint mean-covariance model and MCD of covariance matrices provides a robust method for clustering longitudinal data with mild outliers. It effectively detects outliers and identifies progression patterns in different groups over time, making it valuable for various applications in longitudinal data analysis.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Smoothing and Mean-Covariance Estimation of Functional Data with a Bayesian Hierarchical Model
    Yang, Jingjing
    Zhu, Hongxiao
    Choi, Taeryon
    Cox, Dennis D.
    BAYESIAN ANALYSIS, 2016, 11 (03): : 649 - 670
  • [42] VARIABLE SELECTION IN ROBUST JOINT MEAN AND COVARIANCE MODEL FOR LONGITUDINAL DATA ANALYSIS
    Zheng, Xueying
    Fung, Wing Kam
    Zhu, Zhongyi
    STATISTICA SINICA, 2014, 24 (02) : 515 - 531
  • [43] Robust mixture regression modeling based on the normal mean-variance mixture distributions
    Naderi, Mehrdad
    Mirfarah, Elham
    Wang, Wan-Lun
    Lin, Tsung-, I
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 180
  • [44] Robust semiparametric modeling of mean and covariance in longitudinal data
    Ran, Mengfei
    Yang, Yihe
    Kano, Yutaka
    JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE, 2023, 6 (02) : 625 - 648
  • [45] Robust semiparametric modeling of mean and covariance in longitudinal data
    Mengfei Ran
    Yihe Yang
    Yutaka Kano
    Japanese Journal of Statistics and Data Science, 2023, 6 : 625 - 648
  • [46] Bayesian analysis of joint mean and covariance models for longitudinal data
    Xu, Dengke
    Zhang, Zhongzhan
    Wu, Liucang
    JOURNAL OF APPLIED STATISTICS, 2014, 41 (11) : 2504 - 2514
  • [47] COVARIANCE PATTERN MIXTURE MODELS FOR THE ANALYSIS OF MULTIVARIATE HETEROGENEOUS LONGITUDINAL DATA
    Anderlucci, Laura
    Viroli, Cinzia
    ANNALS OF APPLIED STATISTICS, 2015, 9 (02): : 777 - 800
  • [48] Modeling random effects for censored data by a multivariate normal regression model
    Klein, JP
    Pelz, C
    Zhang, MJ
    BIOMETRICS, 1999, 55 (02) : 497 - 506
  • [49] Comparison study of modeling covariance matrix for multivariate longitudinal data
    Kwak, Na Young
    Lee, Keunbaik
    KOREAN JOURNAL OF APPLIED STATISTICS, 2020, 33 (03) : 281 - 296
  • [50] Modeling the Cholesky factors of covariance matrices of multivariate longitudinal data
    Kohli, Priya
    Garcia, Tanya P.
    Pourahmadi, Mohsen
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 145 : 87 - 100