Modeling Longitudinal Data Using Matrix Completion

被引:0
|
作者
Kidzinski, Lukasz [1 ]
Hastie, Trevor [2 ]
机构
[1] Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Stat, Stanford, CA USA
基金
美国国家科学基金会;
关键词
Interpolation; Matrix completion; Matrix factorization; Multivariate longitudinal data; Regression; LATENT VARIABLE MODELS; PRINCIPAL-COMPONENTS; JOINT MODELS; MISSING DATA;
D O I
10.1080/10618600.2023.2257257
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In clinical practice and biomedical research, measurements are often collected sparsely and irregularly in time, while the data acquisition is expensive and inconvenient. Examples include measurements of spine bone mineral density, cancer growth through mammography or biopsy, a progression of defective vision, or assessment of gait in patients with neurological disorders. Practitioners often need to infer the progression of diseases from such sparse observations. A classical tool for analyzing such data is a mixed-effect model where time is treated as both a fixed effect (population progression curve) and a random effect (individual variability). Alternatively, researchers use Gaussian processes or functional data analysis, assuming that observations are drawn from a certain distribution of processes. While these models are flexible, they rely on probabilistic assumptions, require very careful implementation, and tend to be slow in practice. In this study, we propose an alternative elementary framework for analyzing longitudinal data motivated by matrix completion. Our method yields estimates of progression curves by iterative application of the Singular Value Decomposition. Our framework covers multivariate longitudinal data, and regression and can be easily extended to other settings. As it relies on existing tools for matrix algebra, it is efficient and easy to implement. We apply our methods to understand trends of progression of motor impairment in children with Cerebral Palsy. Our model approximates individual progression curves and explains 30% of the variability. Low-rank representation of progression trends enables identification of different progression trends in subtypes of Cerebral Palsy.
引用
收藏
页码:551 / 566
页数:16
相关论文
共 50 条
  • [21] SAR Imaging With Undersampled Data via Matrix Completion
    Yang, Dong
    Liao, Guisheng
    Zhu, Shengqi
    Yang, Xi
    Zhang, Xuepan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (09) : 1539 - 1543
  • [22] Matrix Completion Methods for Causal Panel Data Models
    Athey, Susan
    Bayati, Mohsen
    Doudchenko, Nikolay
    Imbens, Guido
    Khosravi, Khashayar
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (536) : 1716 - 1730
  • [23] Matrix rearrangement for ISAR echo completion with undersampled data
    Wang, Shuo
    Han, Yusheng
    Zhu, Hong
    SEVENTH SYMPOSIUM ON NOVEL PHOTOELECTRONIC DETECTION TECHNOLOGY AND APPLICATIONS, 2021, 11763
  • [24] Matrix Completion, Counterfactuals, and Factor Analysis of Missing Data
    Bai, Jushan
    Ng, Serena
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (536) : 1746 - 1763
  • [25] FUNCTIONAL DATA ANALYSIS BY MATRIX COMPLETION1
    Descary, Marie-Helene
    Panaretos, Victor M.
    ANNALS OF STATISTICS, 2019, 47 (01): : 1 - 38
  • [26] The em algorithm for kernel matrix completion with auxiliary data
    Tsuda, K
    Akaho, S
    Asai, K
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (01) : 67 - 81
  • [27] Structured Matrix Completion with Applications to Genomic Data Integration
    Cai, Tianxi
    Cai, T. Tony
    Zhang, Anru
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (514) : 621 - 633
  • [28] Data Poisoning Attacks on Graph Convolutional Matrix Completion
    Zhou, Qi
    Ren, Yizhi
    Xia, Tianyu
    Yuan, Lifeng
    Chen, Linqiang
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2019, PT II, 2020, 11945 : 427 - 439
  • [29] Modeling nonstationary longitudinal data
    Núñez-Antón, V
    Zimmerman, DL
    BIOMETRICS, 2000, 56 (03) : 699 - 705
  • [30] Modeling longitudinal data.
    Gueorguieva, Ralitza
    BIOMETRICS, 2006, 62 (03) : 944 - 945