On Learning Mixtures of Well-Separated Gaussians

被引:35
|
作者
Regev, Oded [1 ]
Vijayaraghavan, Aravindan [2 ]
机构
[1] NYU, Courant Inst Math Sci, New York, NY 10003 USA
[2] Northwestern Univ, Dept EECS, Evanston, IL USA
来源
2017 IEEE 58TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS) | 2017年
基金
美国国家科学基金会;
关键词
mixtures of Gaussians; unsupervised learning; clustering; parameter estimation; sample complexity; iterative algorithms; MODELS;
D O I
10.1109/FOCS.2017.17
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the problem of efficiently learning mixtures of a large number of spherical Gaussians, when the components of the mixture are well separated. In the most basic form of this problem, we are given samples from a uniform mixture of k standard spherical Gaussians with means mu(1),..., mu(k) is an element of R-d, and the goal is to estimate the means up to accuracy delta using poly(k, d, 1/delta) samples. In this work, we study the following question: what is the minimum separation needed between the means for solving this task? The best known algorithm due to Vempala and Wang [JCSS 2004] requires a separation of roughly min{k, d}(1/4). On the other hand, Moitra and Valiant [FOCS 2010] showed that with separation o(1), exponentially many samples are required. We address the significant gap between these two bounds, by showing the following results. We show that with separation o(root log k), superpolynomially many samples are required. In fact, this holds even when the k means of the Gaussians are picked at random in d = O(log k) dimensions. We show that with separation Omega(root log k), poly(k, d, 1/delta) samples suffice. Notice that the bound on the separation is independent of delta. This result is based on a new and efficient "accuracy boosting" algorithm that takes as input coarse estimates of the true means and in time (and samples) poly(k, d, 1/d) outputs estimates of the means up to arbitrarily good accuracy d assuming the separation between the means is Omega(min{root log k,root d}) (independently of delta). The idea of the algorithm is to iteratively solve a "diagonally dominant" system of non-linear equations. We also (1) present a computationally efficient algorithm in d = O(1) dimensions with only Omega(root d) separation, and (2) extend our results to the case that components might have different weights and variances. These results together essentially characterize the optimal order of separation between components that is needed to learn a mixture of k spherical Gaussians with polynomial samples.
引用
收藏
页码:85 / 96
页数:12
相关论文
共 50 条
  • [21] Distance labeling schemes for well-separated graph classes
    Katz, M
    Katz, NA
    Peleg, D
    STACS 2000: 17TH ANNUAL SYMPOSIUM ON THEORETICAL ASPECT OF COMPUTER SCIENCE, 2000, 1770 : 516 - 528
  • [22] Spatial interference from well-separated split condensates
    Zawadzki, M. E.
    Griffin, P. F.
    Riis, E.
    Arnold, A. S.
    PHYSICAL REVIEW A, 2010, 81 (04):
  • [23] A probabilistic analysis of EM for mixtures of separated, spherical Gaussians
    Dasgupta, Sanjoy
    Schulman, Leonard
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 203 - 226
  • [24] Microscopical justification of the Winterbottom problem for well-separated lattices
    Piovano, Paolo
    Velcic, Igor
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2023, 231
  • [25] A probabilistic analysis of EM for mixtures of separated, spherical Gaussians
    Dasgupta, Sanjoy
    Schulman, Leonard
    Journal of Machine Learning Research, 2007, 8 : 203 - 226
  • [26] THE MODULI SPACE METRIC FOR WELL-SEPARATED BPS MONOPOLES
    GIBBONS, GW
    MANTON, NS
    PHYSICS LETTERS B, 1995, 356 (01) : 32 - 38
  • [27] The Emergence of Sparse Spanners and Greedy Well-Separated Pair Decomposition
    Gao, Jie
    Zhou, Dengpan
    ALGORITHM THEORY - SWAT 2010, PROCEEDINGS, 2010, 6139 : 50 - 61
  • [28] Motion of Skyrmions in Well-Separated Two-Lane Racetracks
    Lai, P.
    Zhao, G. P.
    Morvan, F. J.
    Wu, S. Q.
    Ran, N.
    SPIN, 2017, 7 (01)
  • [29] THEORY OF DIPOLE MOMENTS INDUCED BETWEEN WELL-SEPARATED ATOMS
    GERSTEN, JI
    PHYSICAL REVIEW, 1969, 179 (01): : 1 - &
  • [30] Well-Separated Pair Decompositions for High-Dimensional Datasets
    Matijevic, Domagoj
    ALGORITHMS, 2023, 16 (05)