Fast Dictionary Learning with a Smoothed Wasserstein Loss

被引:0
|
作者
Rolet, Antoine [1 ]
Cuturi, Marco [1 ]
Peyre, Gabriel [2 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
[2] Univ Paris 09, CNRS, CEREMADE, Paris, France
关键词
MATRIX FACTORIZATION; EQUIVALENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider in this paper the dictionary learning problem when the observations are normalized histograms of features. This problem can be tackled using non-negative matrix factorization approaches, using typically Euclidean or Kullback-Leibler fitting errors. Because these fitting errors are separable and treat each feature on equal footing, they are blind to any similarity the features may share. We assume in this work that we have prior knowledge on these features. To leverage this side-information, we propose to use the Wasserstein (a.k.a. earth mover's or optimal transport) distance as the fitting error between each original point and its reconstruction, and we propose scalable algorithms to to so. Our methods build upon Fenchel duality and entropic regularization of Wasserstein distances, which improves not only speed but also computational stability. We apply these techniques on face images and text documents. We show in particular that we can learn dictionaries (topics) for bag-of-word representations of texts using words that may not have appeared in the original texts, or even words that come from a different language than that used in the texts.
引用
收藏
页码:630 / 638
页数:9
相关论文
共 50 条
  • [41] Learning and smoothed analysis
    Microsoft Research, New England, United States
    不详
    不详
    Proc. Annu. IEEE Symp. Found. Comput. Sci. FOCS, 1600, (395-404):
  • [42] Learning to Generate Wasserstein Barycenters
    Lacombe, Julien
    Digne, Julie
    Courty, Nicolas
    Bonneel, Nicolas
    JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2023, 65 (02) : 354 - 370
  • [43] Bayesian learning with Wasserstein barycenters*
    Backhoff-Veraguas, Julio
    Fontbona, Joaquin
    Rios, Gonzalo
    Tobar, Felipe
    ESAIM-PROBABILITY AND STATISTICS, 2022, 26 : 436 - 472
  • [44] Learning to Generate Wasserstein Barycenters
    Julien Lacombe
    Julie Digne
    Nicolas Courty
    Nicolas Bonneel
    Journal of Mathematical Imaging and Vision, 2023, 65 : 354 - 370
  • [45] Learning and smoothed analysis
    Kalai, Adam Tauman
    Samorodnitsky, Alex
    Teng, Shang-Hua
    2009 50TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE: FOCS 2009, PROCEEDINGS, 2009, : 395 - 404
  • [46] Statistical Learning in Wasserstein Space
    Karimi, Amirhossein
    Ripani, Luigia
    Georgiou, Tryphon T.
    IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (03): : 899 - 904
  • [47] Wasserstein Unsupervised Reinforcement Learning
    He, Shuncheng
    Jiang, Yuhang
    Zhang, Hongchang
    Shao, Jianzhun
    Ji, Xiangyang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6884 - 6892
  • [48] Fast and Compact Kronecker-structured Dictionary Learning for Classification and Representation
    Jindal, Ishan
    Nokleby, Matthew
    2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 200 - 204
  • [49] Polarization Image Fast Fusion Method Based on Online Dictionary Learning
    Xu Guo-ming
    Xue Mo-gen
    Yuan Guang-lin
    Zhou Pu-cheng
    INTERNATIONAL SYMPOSIUM ON PHOTOELECTRONIC DETECTION AND IMAGING 2013: IMAGING SPECTROMETER TECHNOLOGIES AND APPLICATIONS, 2013, 8910
  • [50] Denoising and fast diffusion imaging with physically constrained sparse dictionary learning
    Gramfort, A.
    Poupon, C.
    Descoteaux, M.
    MEDICAL IMAGE ANALYSIS, 2014, 18 (01) : 36 - 49