Structured Matrix Completion with Applications to Genomic Data Integration

被引:51
|
作者
Cai, Tianxi [1 ]
Cai, T. Tony [1 ]
Zhang, Anru [1 ]
机构
[1] Univ Penn, Dept Stat, Wharton Sch, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Constrained minimization; Genomic data integration; Low-rank matrix; Matrix completion; Singular value decomposition; Structured matrix completion; LOW-RANK MATRIX; MISSING VALUE ESTIMATION; GENE-EXPRESSION DATA; OVARIAN-CANCER; GENOTYPE IMPUTATION; PENALIZATION; ALGORITHM; MODEL;
D O I
10.1080/01621459.2015.1021005
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Matrix completion has attracted significant recent attention in many fields including statistics, applied mathematics, and electrical engineering. Current literature on matrix completion focuses primarily on-independent sampling models under which the individual observed entries are sampled independently. Motivated by applications in genomic data integration, we propose a new framework of structured matrix completion (SMC) to treat structured rnissingness by design. Specifically, our proposed method aims at efficient matrix recovery when a subset of the rows and columns of an approximately low-rank matrix are observed. We provide theoretical justification for the proposed SMC method and derive lower bound for the estimation errors, which together establish the optimal rate of recovery over certain classes of approximately low-rank matrices. Simulation studies show that the method performs well in finite sample under a variety of configurations. The method is applied to integrate several ovarian cancer genomic studies with different extent of genomic measurements, which enables us to construct more accurate prediction rules for ovarian cancer survival. Supplementary materials for this article are available online.
引用
收藏
页码:621 / 633
页数:13
相关论文
共 50 条
  • [1] Structured matrix estimation and completion
    Klopp, Olga
    Luz, Yu
    Tsybakov, Alexandre B.
    Zhou, Harrison H.
    BERNOULLI, 2019, 25 (4B) : 3883 - 3911
  • [2] Matrix Completion for Structured Observations
    Molitor, Denali
    Needell, Deanna
    2018 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2018,
  • [3] AN ADAPTATION FOR ITERATIVE STRUCTURED MATRIX COMPLETION
    Adams, Henry
    Kassab, Lara
    Needell, Deanna
    FOUNDATIONS OF DATA SCIENCE, 2021, 3 (04): : 765 - 787
  • [4] An Adaptation for Iterative Structured Matrix Completion
    Kassab, Lara
    Adams, Henry
    Needell, Deanna
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 1451 - 1456
  • [5] A Joint Matrix Completion and Filtering Model for Influenza Serological Data Integration
    Yuan, Xiao-Tong
    Zhang, Tong
    Wan, Xiu-Feng
    PLOS ONE, 2013, 8 (07):
  • [6] Matrix Completion in the Unit Hypercube via Structured Matrix Factorization
    Bugliarello, Emanuele
    Jain, Swayambhoo
    Rakesh, Vineeth
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2038 - 2044
  • [7] Sparse MIMO Radar via Structured Matrix Completion
    Chi, Yuejie
    2013 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2013, : 321 - 324
  • [8] Robust Spectral Compressed Sensing via Structured Matrix Completion
    Chen, Yuxin
    Chi, Yuejie
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2014, 60 (10) : 6576 - 6601
  • [9] Data Integration Approach for Semi-structured and Structured Data (Linked Data)
    Kettouch, Mohamed Salah
    Luca, Cristina
    Hobbs, Mike
    Fatima, Arooj
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2015, : 820 - 825
  • [10] Data integration and genomic medicine
    Louie, Brenton
    Mork, Peter
    Martin-Sanchez, Fernando
    Halevy, Alon
    Tarczy-Hornoch, Peter
    JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (01) : 5 - 16