Structured Matrix Completion with Applications to Genomic Data Integration

被引:51
|
作者
Cai, Tianxi [1 ]
Cai, T. Tony [1 ]
Zhang, Anru [1 ]
机构
[1] Univ Penn, Dept Stat, Wharton Sch, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Constrained minimization; Genomic data integration; Low-rank matrix; Matrix completion; Singular value decomposition; Structured matrix completion; LOW-RANK MATRIX; MISSING VALUE ESTIMATION; GENE-EXPRESSION DATA; OVARIAN-CANCER; GENOTYPE IMPUTATION; PENALIZATION; ALGORITHM; MODEL;
D O I
10.1080/01621459.2015.1021005
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Matrix completion has attracted significant recent attention in many fields including statistics, applied mathematics, and electrical engineering. Current literature on matrix completion focuses primarily on-independent sampling models under which the individual observed entries are sampled independently. Motivated by applications in genomic data integration, we propose a new framework of structured matrix completion (SMC) to treat structured rnissingness by design. Specifically, our proposed method aims at efficient matrix recovery when a subset of the rows and columns of an approximately low-rank matrix are observed. We provide theoretical justification for the proposed SMC method and derive lower bound for the estimation errors, which together establish the optimal rate of recovery over certain classes of approximately low-rank matrices. Simulation studies show that the method performs well in finite sample under a variety of configurations. The method is applied to integrate several ovarian cancer genomic studies with different extent of genomic measurements, which enables us to construct more accurate prediction rules for ovarian cancer survival. Supplementary materials for this article are available online.
引用
收藏
页码:621 / 633
页数:13
相关论文
共 50 条
  • [21] A Close Look at Matrix Completion with Noisy Data
    Yang, Ke
    Wu, Shuai
    Wang, Kai
    Wang, Yi
    Wang, Chaofei
    2022 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION ENGINEERING, ICRAE, 2022, : 358 - 364
  • [22] Efficient matrix completion for seismic data reconstruction
    Kumar, Rajiv
    Da Silva, Curt
    Akalin, Okan
    Aravkin, Aleksandr Y.
    Mansour, Hassan
    Recht, Benjamin
    Herrmann, Felix J.
    GEOPHYSICS, 2015, 80 (05) : V97 - V113
  • [23] Recovery of Damped Exponentials Using Structured Low Rank Matrix Completion
    Balachandrasekaran, Arvind
    Magnotta, Vincent
    Jacob, Mathews
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (10) : 2087 - 2098
  • [24] Environmental Monitoring in Wireless Sensor Networks using Structured Matrix Completion
    He, Jingfei
    Zhou, Yatong
    Sun, Guiling
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [25] On the genomic analysis of data from structured populations
    de los Campos, G.
    Sorensen, D.
    JOURNAL OF ANIMAL BREEDING AND GENETICS, 2014, 131 (03) : 163 - 164
  • [26] An automated integration approach for semi-structured and structured data
    Lim, SJ
    Ng, YK
    PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON COOPERATIVE DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2000, : 12 - 21
  • [27] Schemas for integration and translation of structured and semi-structured data
    Beeri, C
    Milo, T
    DATABASE THEORY - ICDT'99, 1999, 1540 : 296 - 313
  • [28] Novel matrix forms of rough set flow graphs with applications to data integration
    Chitcharoen, Doungrat
    Pattaraintakorn, Puntip
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2010, 60 (10) : 2880 - 2897
  • [29] Matrix Completion with Ordering Relation Constraints and its Applications
    Huang, Yizhen
    Guan, Yepeng
    Wang, Jiawen
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (04)
  • [30] INTEGRATION OF MULTIPLE GENOMIC IMAGING DATA FOR THE STUDY OF SCHIZOPHRENIA USING JOINT NONNEGATIVE MATRIX FACTORIZATION
    Wang, Min
    Huang, Ting-Zhu
    Calhoun, Vince D.
    Fang, Jian
    Wang, Yu-Ping
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1083 - 1087