Linear Regression With Shuffled Data: Statistical and Computational Limits of Permutation Recovery

被引:64
|
作者
Pananjady, Ashwin [1 ]
Wainwright, Martin J. [1 ,2 ]
Courtade, Thomas A. [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Correspondence estimation; permutation recovery; unlabeled sensing; information-theoretic bounds; random projections; INFERENCE; SIGNALS; UNION; CODES;
D O I
10.1109/TIT.2017.2776217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Consider a noisy linear observation model with an unknown permutation, based on observing y = Pi*Ax* + w, where x* epsilon R-d is an unknown vector, Pi* is an unknown n x n permutation matrix, and w epsilon R-n is additive Gaussian noise. We analyze the problem of permutation recovery in a random design setting in which the entries of matrix A are drawn independently from a standard Gaussian distribution and establish sharp conditions on the signal-to-noise ratio, sample size n, and dimension d under which Pi* is exactly and approximately recoverable. On the computational front, we show that the maximum likelihood estimate of Pi* is NP-hard to compute for general d, while also providing a polynomial time algorithm when d = 1.
引用
收藏
页码:3286 / 3300
页数:15
相关论文
共 50 条
  • [1] Linear Regression with an Unknown Permutation: Statistical and Computational Limits
    Pananjady, Ashwin
    Wainwright, Martin J.
    Courtade, Thomas A.
    2016 54TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2016, : 417 - 424
  • [2] Sparse Recovery with Shuffled Labels: Statistical Limits and Practical Estimators
    Zhang, Hang
    Li, Ping
    2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 1760 - 1765
  • [3] Generalized Shuffled Linear Regression
    Li, Feiran
    Fujiwara, Kent
    Okura, Fumio
    Matsushita, Yasuyuki
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6454 - 6463
  • [4] A Pseudo-Likelihood Approach to Linear Regression With Partially Shuffled Data
    Slawski, Martin
    Diao, Guoqing
    Ben-David, Emanuel
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2021, 30 (04) : 991 - 1003
  • [5] Shuffled Linear Regression with Erroneous Observations
    Saab, Samer S.
    Saab, Khaled Kamal
    Saab, Samer S., Jr.
    2019 53RD ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2019,
  • [6] Statistical-Computational Tradeoffs in Mixed Sparse Linear Regression
    Arpino, Gabriel
    Venkataramanan, Ramji
    THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
  • [7] Shuffled Linear Regression with Outliers in Both Covariates and Responses
    Li, Feiran
    Fujiwara, Kent
    Okura, Fumio
    Matsushita, Yasuyuki
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (03) : 732 - 751
  • [8] Permutation Predictor Tests in Linear Regression
    Ryu, Hye Min
    Woo, Min Ah
    Lee, Kyungjin
    Yoo, Jae Keun
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2013, 20 (02) : 147 - 155
  • [9] Shuffled Linear Regression with Outliers in Both Covariates and Responses
    Feiran Li
    Kent Fujiwara
    Fumio Okura
    Yasuyuki Matsushita
    International Journal of Computer Vision, 2023, 131 : 732 - 751
  • [10] Label Robust and Differentially Private Linear Regression: Computational and Statistical Efficiency
    Liu, Xiyang
    Jain, Prateek
    Kong, Weihao
    Oh, Sewoong
    Suggala, Arun Sai
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,