On Learning Mixtures of Well-Separated Gaussians

被引:35
|
作者
Regev, Oded [1 ]
Vijayaraghavan, Aravindan [2 ]
机构
[1] NYU, Courant Inst Math Sci, New York, NY 10003 USA
[2] Northwestern Univ, Dept EECS, Evanston, IL USA
来源
2017 IEEE 58TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS) | 2017年
基金
美国国家科学基金会;
关键词
mixtures of Gaussians; unsupervised learning; clustering; parameter estimation; sample complexity; iterative algorithms; MODELS;
D O I
10.1109/FOCS.2017.17
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the problem of efficiently learning mixtures of a large number of spherical Gaussians, when the components of the mixture are well separated. In the most basic form of this problem, we are given samples from a uniform mixture of k standard spherical Gaussians with means mu(1),..., mu(k) is an element of R-d, and the goal is to estimate the means up to accuracy delta using poly(k, d, 1/delta) samples. In this work, we study the following question: what is the minimum separation needed between the means for solving this task? The best known algorithm due to Vempala and Wang [JCSS 2004] requires a separation of roughly min{k, d}(1/4). On the other hand, Moitra and Valiant [FOCS 2010] showed that with separation o(1), exponentially many samples are required. We address the significant gap between these two bounds, by showing the following results. We show that with separation o(root log k), superpolynomially many samples are required. In fact, this holds even when the k means of the Gaussians are picked at random in d = O(log k) dimensions. We show that with separation Omega(root log k), poly(k, d, 1/delta) samples suffice. Notice that the bound on the separation is independent of delta. This result is based on a new and efficient "accuracy boosting" algorithm that takes as input coarse estimates of the true means and in time (and samples) poly(k, d, 1/d) outputs estimates of the means up to arbitrarily good accuracy d assuming the separation between the means is Omega(min{root log k,root d}) (independently of delta). The idea of the algorithm is to iteratively solve a "diagonally dominant" system of non-linear equations. We also (1) present a computationally efficient algorithm in d = O(1) dimensions with only Omega(root d) separation, and (2) extend our results to the case that components might have different weights and variances. These results together essentially characterize the optimal order of separation between components that is needed to learn a mixture of k spherical Gaussians with polynomial samples.
引用
收藏
页码:85 / 96
页数:12
相关论文
共 50 条
  • [1] Learning mixtures of separated nonspherical gaussians
    Arora, S
    Kannan, R
    ANNALS OF APPLIED PROBABILITY, 2005, 15 (1A): : 69 - 92
  • [2] Differentially Private Algorithms for Learning Mixtures of Separated Gaussians
    Kamath, Gautam
    Sheffet, Or
    Singhal, Vikrant
    Ullman, Jonathan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [3] Differentially Private Algorithms for Learning Mixtures of Separated Gaussians
    Kamath, Gautam
    Sheffet, Or
    Singhal, Vikrant
    Ullman, Jonathan
    2020 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2020,
  • [4] Well-Separated Spherical Designs
    Andriy Bondarenko
    Danylo Radchenko
    Maryna Viazovska
    Constructive Approximation, 2015, 41 : 93 - 112
  • [5] Well-Separated Spherical Designs
    Bondarenko, Andriy
    Radchenko, Danylo
    Viazovska, Maryna
    CONSTRUCTIVE APPROXIMATION, 2015, 41 (01) : 93 - 112
  • [6] Gradient flow for well-separated Skyrmions
    Irwin, PW
    Manton, NS
    PHYSICS LETTERS B, 1996, 385 (1-4) : 187 - 192
  • [7] SQ Lower Bounds for Learning Mixtures of Separated and Bounded Covariance Gaussians
    Diakonikolas, Ilias
    Kane, Daniel M.
    Pittas, Thanasis
    Zarifis, Nikos
    THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
  • [8] Projections of planar sets in well-separated directions
    Orponen, Tuomas
    ADVANCES IN MATHEMATICS, 2016, 297 : 1 - 25
  • [9] Refined PSO Clustering for Not Well-Separated Data
    Sunny, Chilankamol
    Kumar, Shibu K. B.
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023, 35 (06) : 831 - 847
  • [10] On well-separated sets and fast multipole methods
    Engblom, Stefan
    APPLIED NUMERICAL MATHEMATICS, 2011, 61 (10) : 1096 - 1102