On Learning Mixtures of Well-Separated Gaussians

被引：35

作者：

Regev, Oded ^{[1
]}

Vijayaraghavan, Aravindan ^{[2
]}

机构：

[1] NYU, Courant Inst Math Sci, New York, NY 10003 USA

[2] Northwestern Univ, Dept EECS, Evanston, IL USA

来源：

2017 IEEE 58TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS) | 2017年

基金：

美国国家科学基金会;

关键词：

mixtures of Gaussians; unsupervised learning; clustering; parameter estimation; sample complexity; iterative algorithms; MODELS;

D O I：

10.1109/FOCS.2017.17

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We consider the problem of efficiently learning mixtures of a large number of spherical Gaussians, when the components of the mixture are well separated. In the most basic form of this problem, we are given samples from a uniform mixture of k standard spherical Gaussians with means mu(1),..., mu(k) is an element of R-d, and the goal is to estimate the means up to accuracy delta using poly(k, d, 1/delta) samples. In this work, we study the following question: what is the minimum separation needed between the means for solving this task? The best known algorithm due to Vempala and Wang [JCSS 2004] requires a separation of roughly min{k, d}(1/4). On the other hand, Moitra and Valiant [FOCS 2010] showed that with separation o(1), exponentially many samples are required. We address the significant gap between these two bounds, by showing the following results. We show that with separation o(root log k), superpolynomially many samples are required. In fact, this holds even when the k means of the Gaussians are picked at random in d = O(log k) dimensions. We show that with separation Omega(root log k), poly(k, d, 1/delta) samples suffice. Notice that the bound on the separation is independent of delta. This result is based on a new and efficient "accuracy boosting" algorithm that takes as input coarse estimates of the true means and in time (and samples) poly(k, d, 1/d) outputs estimates of the means up to arbitrarily good accuracy d assuming the separation between the means is Omega(min{root log k,root d}) (independently of delta). The idea of the algorithm is to iteratively solve a "diagonally dominant" system of non-linear equations. We also (1) present a computationally efficient algorithm in d = O(1) dimensions with only Omega(root d) separation, and (2) extend our results to the case that components might have different weights and variances. These results together essentially characterize the optimal order of separation between components that is needed to learn a mixture of k spherical Gaussians with polynomial samples.

引用

页码：85 / 96

页数：12

共 50 条

[1] Learning mixtures of separated nonspherical gaussians
Arora, S
Kannan, R
ANNALS OF APPLIED PROBABILITY, 2005, 15 (1A): : 69 - 92
[2] Differentially Private Algorithms for Learning Mixtures of Separated Gaussians
Kamath, Gautam
Sheffet, Or
Singhal, Vikrant
Ullman, Jonathan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[3] Differentially Private Algorithms for Learning Mixtures of Separated Gaussians
Kamath, Gautam
Sheffet, Or
Singhal, Vikrant
Ullman, Jonathan
2020 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2020,
[4] Well-Separated Spherical Designs
Andriy Bondarenko
Danylo Radchenko
Maryna Viazovska
Constructive Approximation, 2015, 41 : 93 - 112
[5] Well-Separated Spherical Designs
Bondarenko, Andriy
Radchenko, Danylo
Viazovska, Maryna
CONSTRUCTIVE APPROXIMATION, 2015, 41 (01) : 93 - 112
[6] Gradient flow for well-separated Skyrmions
Irwin, PW
Manton, NS
PHYSICS LETTERS B, 1996, 385 (1-4) : 187 - 192
[7] SQ Lower Bounds for Learning Mixtures of Separated and Bounded Covariance Gaussians
Diakonikolas, Ilias
Kane, Daniel M.
Pittas, Thanasis
Zarifis, Nikos
THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
[8] Projections of planar sets in well-separated directions
Orponen, Tuomas
ADVANCES IN MATHEMATICS, 2016, 297 : 1 - 25
[9] Refined PSO Clustering for Not Well-Separated Data
Sunny, Chilankamol
Kumar, Shibu K. B.
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023, 35 (06) : 831 - 847
[10] On well-separated sets and fast multipole methods
Engblom, Stefan
APPLIED NUMERICAL MATHEMATICS, 2011, 61 (10) : 1096 - 1102

← 1 2 3 4 5 →