Gradient Estimation with Stochastic Softmax Tricks

被引：0

作者：

Paulus, Max B. ^{[1
]}

Choi, Dami ^{[2
]}

机构：

[1] Swiss Fed Inst Technol, Zurich, Switzerland

[2] Univ Toronto, Toronto, ON, Canada

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The Gumbel-Max trick is the basis of many relaxed gradient estimators. These estimators are easy to implement and low variance, but the goal of scaling them comprehensively to large combinatorial distributions is still outstanding. Working within the perturbation model framework, we introduce stochastic softmax tricks, which generalize the Gumbel-Softmax trick to combinatorial spaces. Our framework is a unified perspective on existing relaxed estimators for perturbation models, and it contains many novel relaxations. We design structured relaxations for subset selection, spanning trees, arborescences, and others. When compared to less structured baselines, we find that stochastic softmax tricks can be used to train latent variable models that perform better and discover more latent structure.

引用

页数：14

共 50 条

[11] An Alternate Policy Gradient Estimator for Softmax Policies
Garg, Shivam
Tosatto, Samuele
Pan, Yangchen
White, Martha
Mahmood, A. Rupam
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
[12] Stochastic incremental gradient descent for estimation in sensor networks
Ram, S. Sundhar
Nedic, A.
Veeravalli, V. V.
CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 582 - 586
[13] Estimation of the Conditional Probability Using a Stochastic Gradient Process
Labriji, Ali
Bennar, Abdelkrim
Rachik, Mostafa
JOURNAL OF MATHEMATICS, 2021, 2021
[14] Online Covariance Matrix Estimation in Stochastic Gradient Descent
Zhu, Wanrong
Chen, Xi
Wu, Wei Biao
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (541) : 393 - 404
[15] Stochastic Natural Gradient Descent by Estimation of Empirical Covariances
Luigi, Malago
Matteo, Matteucci
Giovanni, Pistone
2011 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2011, : 949 - 956
[16] Stochastic gradient estimation strategies for Markov random fields
Younes, L
BAYESIAN INFERENCE FOR INVERSE PROBLEMS, 1998, 3459 : 315 - 325
[17] Gradient-based stochastic estimation of the density matrix
Wang, Zhentao
Chern, Gia-Wei
Batista, Cristian D.
Barros, Kipton
JOURNAL OF CHEMICAL PHYSICS, 2018, 148 (09):
[18] LIKELIHOOD RATIO GRADIENT ESTIMATION FOR STOCHASTIC-SYSTEMS
GLYNN, PW
COMMUNICATIONS OF THE ACM, 1990, 33 (10) : 75 - 84
[19] Stochastic gradient estimation using a single design point
Wieland, Jamie R.
Schmeiser, Bruce W.
PROCEEDINGS OF THE 2006 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2006, : 390 - 397
[20] Stochastic Gradient Estimation Algorithm for a class of Dual-Rate Stochastic Systems
Cui Guimei
Guan Yinghui
Zhang Yong
2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 159 - 163

← 1 2 3 4 5 →