Fast Dictionary Learning with a Smoothed Wasserstein Loss

被引:0
|
作者
Rolet, Antoine [1 ]
Cuturi, Marco [1 ]
Peyre, Gabriel [2 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
[2] Univ Paris 09, CNRS, CEREMADE, Paris, France
关键词
MATRIX FACTORIZATION; EQUIVALENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider in this paper the dictionary learning problem when the observations are normalized histograms of features. This problem can be tackled using non-negative matrix factorization approaches, using typically Euclidean or Kullback-Leibler fitting errors. Because these fitting errors are separable and treat each feature on equal footing, they are blind to any similarity the features may share. We assume in this work that we have prior knowledge on these features. To leverage this side-information, we propose to use the Wasserstein (a.k.a. earth mover's or optimal transport) distance as the fitting error between each original point and its reconstruction, and we propose scalable algorithms to to so. Our methods build upon Fenchel duality and entropic regularization of Wasserstein distances, which improves not only speed but also computational stability. We apply these techniques on face images and text documents. We show in particular that we can learn dictionaries (topics) for bag-of-word representations of texts using words that may not have appeared in the original texts, or even words that come from a different language than that used in the texts.
引用
收藏
页码:630 / 638
页数:9
相关论文
共 50 条
  • [21] Wasserstein distance loss function for financial time series deep learning
    Souto, Hugo Gobato
    Moradi, Amir
    SOFTWARE IMPACTS, 2024, 20
  • [22] Fast Computation of Wasserstein Barycenters
    Cuturi, Marco
    Doucet, Arnaud
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 685 - 693
  • [23] Fast Dictionary Learning for Sparse Representations of Speech Signals
    Jafari, Maria G.
    Plumbley, Mark D.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (05) : 1025 - 1031
  • [24] Robust fast dictionary learning for seismic noise attenuation
    Feng, Zhenjie
    GEOPHYSICAL PROSPECTING, 2022, 70 (07) : 1143 - 1162
  • [25] Fast and incoherent dictionary learning algorithms with application to fMRI
    Vahid Abolghasemi
    Saideh Ferdowsi
    Saeid Sanei
    Signal, Image and Video Processing, 2015, 9 : 147 - 158
  • [26] Fast single image SR via dictionary learning
    Mokari, Azade
    Ahmadyfard, Alireza
    IET IMAGE PROCESSING, 2017, 11 (02) : 135 - 144
  • [27] Analysis of Fast Alternating Minimization for Structured Dictionary Learning
    Ravishankar, Saiprasad
    Ma, Anna
    Needell, Deanna
    2018 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2018,
  • [28] Fast and incoherent dictionary learning algorithms with application to fMRI
    Abolghasemi, Vahid
    Ferdowsi, Saideh
    Sanei, Saeid
    SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (01) : 147 - 158
  • [29] Wasserstein Loss With Alternative Reinforcement Learning for Severity-Aware Semantic Segmentation
    Liu, Xiaofeng
    Lu, Yunhong
    Liu, Xiongchang
    Bai, Song
    Li, Site
    You, Jane
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (01) : 587 - 596
  • [30] Convolutional sparse dictionary learning with smoothed l0 norm and projected gradient descent
    Kitajima, Kazuki
    Sugano, Akira
    Kuroki, Yoshimitsu
    2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,