Tradeoffs for Space, Time, Data and Risk in Unsupervised Learning

被引:0
|
作者
Lucic, Mario [1 ]
Ohannessian, Mesrob, I [2 ]
Karbasi, Amin [3 ]
Krause, Andreas [1 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Univ Calif San Diego, La Jolla, CA 92093 USA
[3] Yale Univ, New Haven, CT 06520 USA
关键词
FRAMEWORK;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Faced with massive data, is it possible to trade off (statistical) risk, and (computational) space and time? This challenge lies at the heart of large-scale machine learning. Using k-means clustering as a prototypical unsupervised learning problem, we show how we can strategically summarize the data (control space) in order to trade off risk and time when data is generated by a probabilistic model. Our summarization is based on coreset constructions from computational geometry. We also develop an algorithm, TRAM, to navigate the space/time/data/risk tradeoff in practice. In particular, we show that for a fixed risk (or data size), as the data size increases (resp. risk increases) the running time of TRAM decreases. Our extensive experiments on real data sets demonstrate the existence and practical utility of such tradeoffs, not only for k-means but also for Gaussian Mixture Models.
引用
收藏
页码:663 / 671
页数:9
相关论文
共 50 条
  • [21] Memory Devices: Energy-Space-Time Tradeoffs
    Zhirnov, Victor V.
    Cavin, Ralph K., III
    Menzel, Stephan
    Linn, Eike
    Schmelzer, Sebastian
    Braeuhaus, Dennis
    Schindler, Christina
    Waser, Rainer
    PROCEEDINGS OF THE IEEE, 2010, 98 (12) : 2185 - 2200
  • [22] Time-space tradeoffs for SAT on nonuniform machines
    Tourlakis, I
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2001, 63 (02) : 268 - 287
  • [23] Time-space tradeoffs in algebraic complexity theory
    Aldaz, M
    Heintz, J
    Matera, G
    Montaña, JL
    Pardo, LM
    JOURNAL OF COMPLEXITY, 2000, 16 (01) : 2 - 49
  • [24] 2 TIME-SPACE TRADEOFFS FOR ELEMENT DISTINCTNESS
    KARCHMER, M
    THEORETICAL COMPUTER SCIENCE, 1986, 47 (03) : 237 - 246
  • [25] Time space tradeoffs in vector algorithms for APL functions
    Budd, Timothy A.
    SIGPLAN Notices (ACM Special Interest Group on Programming Languages), 1988, 23 (12): : 63 - 68
  • [26] Quantum Time-Space Tradeoffs for Matrix Problems
    Beame, Paul
    Kornerup, Niels
    Whitmeyer, Michael
    PROCEEDINGS OF THE 56TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2024, 2024, : 596 - 607
  • [27] Optimal Space-time Tradeoffs for Inverted Indexes
    Ottaviano, Giuseppe
    Tonellotto, Nicola
    Venturini, Rossano
    WSDM'15: PROCEEDINGS OF THE EIGHTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2015, : 47 - 56
  • [28] Revisiting Time-Space Tradeoffs for Function Inversion
    Golovnev, Alexander
    Guo, Siyao
    Peters, Spencer
    Stephens-Davidowitz, Noah
    ADVANCES IN CRYPTOLOGY - CRYPTO 2023, PT II, 2023, 14082 : 453 - 481
  • [29] Data-Time Tradeoffs for Corrupted Sensing
    Chen, Jinchi
    Liu, Yulong
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (07) : 941 - 945
  • [30] Time-Data Tradeoffs by Aggressive Smoothing
    Bruer, John J.
    Tropp, Joel A.
    Cevher, Volkan
    Becker, Stephen R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27