Fast Dictionary Learning with a Smoothed Wasserstein Loss

被引：0

作者：

Rolet, Antoine ^{[1
]}

Cuturi, Marco ^{[1
]}

Peyre, Gabriel ^{[2
]}

机构：

[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan

[2] Univ Paris 09, CNRS, CEREMADE, Paris, France

来源：

ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51 | 2016年 / 51卷

关键词：

MATRIX FACTORIZATION; EQUIVALENCE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider in this paper the dictionary learning problem when the observations are normalized histograms of features. This problem can be tackled using non-negative matrix factorization approaches, using typically Euclidean or Kullback-Leibler fitting errors. Because these fitting errors are separable and treat each feature on equal footing, they are blind to any similarity the features may share. We assume in this work that we have prior knowledge on these features. To leverage this side-information, we propose to use the Wasserstein (a.k.a. earth mover's or optimal transport) distance as the fitting error between each original point and its reconstruction, and we propose scalable algorithms to to so. Our methods build upon Fenchel duality and entropic regularization of Wasserstein distances, which improves not only speed but also computational stability. We apply these techniques on face images and text documents. We show in particular that we can learn dictionaries (topics) for bag-of-word representations of texts using words that may not have appeared in the original texts, or even words that come from a different language than that used in the texts.

引用

页码：630 / 638

页数：9

共 50 条

[41] Learning and smoothed analysis
Microsoft Research, New England, United States
不详
不详
Proc. Annu. IEEE Symp. Found. Comput. Sci. FOCS, 1600, (395-404):
[42] Learning to Generate Wasserstein Barycenters
Lacombe, Julien
Digne, Julie
Courty, Nicolas
Bonneel, Nicolas
JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2023, 65 (02) : 354 - 370
[43] Bayesian learning with Wasserstein barycenters*
Backhoff-Veraguas, Julio
Fontbona, Joaquin
Rios, Gonzalo
Tobar, Felipe
ESAIM-PROBABILITY AND STATISTICS, 2022, 26 : 436 - 472
[44] Learning to Generate Wasserstein Barycenters
Julien Lacombe
Julie Digne
Nicolas Courty
Nicolas Bonneel
Journal of Mathematical Imaging and Vision, 2023, 65 : 354 - 370
[45] Learning and smoothed analysis
Kalai, Adam Tauman
Samorodnitsky, Alex
Teng, Shang-Hua
2009 50TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE: FOCS 2009, PROCEEDINGS, 2009, : 395 - 404
[46] Statistical Learning in Wasserstein Space
Karimi, Amirhossein
Ripani, Luigia
Georgiou, Tryphon T.
IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (03): : 899 - 904
[47] Wasserstein Unsupervised Reinforcement Learning
He, Shuncheng
Jiang, Yuhang
Zhang, Hongchang
Shao, Jianzhun
Ji, Xiangyang
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6884 - 6892
[48] Fast and Compact Kronecker-structured Dictionary Learning for Classification and Representation
Jindal, Ishan
Nokleby, Matthew
2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 200 - 204
[49] Polarization Image Fast Fusion Method Based on Online Dictionary Learning
Xu Guo-ming
Xue Mo-gen
Yuan Guang-lin
Zhou Pu-cheng
INTERNATIONAL SYMPOSIUM ON PHOTOELECTRONIC DETECTION AND IMAGING 2013: IMAGING SPECTROMETER TECHNOLOGIES AND APPLICATIONS, 2013, 8910
[50] Denoising and fast diffusion imaging with physically constrained sparse dictionary learning
Gramfort, A.
Poupon, C.
Descoteaux, M.
MEDICAL IMAGE ANALYSIS, 2014, 18 (01) : 36 - 49

← 1 2 3 4 5 →