Gradient Estimation with Stochastic Softmax Tricks

被引:0
|
作者
Paulus, Max B. [1 ]
Choi, Dami [2 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Univ Toronto, Toronto, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Gumbel-Max trick is the basis of many relaxed gradient estimators. These estimators are easy to implement and low variance, but the goal of scaling them comprehensively to large combinatorial distributions is still outstanding. Working within the perturbation model framework, we introduce stochastic softmax tricks, which generalize the Gumbel-Softmax trick to combinatorial spaces. Our framework is a unified perspective on existing relaxed estimators for perturbation models, and it contains many novel relaxations. We design structured relaxations for subset selection, spanning trees, arborescences, and others. When compared to less structured baselines, we find that stochastic softmax tricks can be used to train latent variable models that perform better and discover more latent structure.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Adaptive Gradient Estimation Stochastic Parallel Gradient Descent Algorithm for Laser Beam Cleanup
    Ma, Shiqing
    Yang, Ping
    Lai, Boheng
    Su, Chunxuan
    Zhao, Wang
    Yang, Kangjian
    Jin, Ruiyan
    Cheng, Tao
    Xu, Bing
    PHOTONICS, 2021, 8 (05)
  • [22] Robust Gradient Estimation Algorithm for a Stochastic System with Colored Noise
    Liu, Wentao
    Xiong, Weili
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (02) : 553 - 562
  • [23] Robust Gradient Estimation Algorithm for a Stochastic System with Colored Noise
    Wentao Liu
    Weili Xiong
    International Journal of Control, Automation and Systems, 2023, 21 : 553 - 562
  • [24] A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs
    Mao, Jingkai
    Foerster, Jakob
    Rocktaschel, Tim
    Al-Shedivat, Maruan
    Farquhar, Gregory
    Whiteson, Shimon
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [25] A Stochastic Gradient Method with Biased Estimation for Faster Nonconvex Optimization
    Bi, Jia
    Gunn, Steve R.
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 337 - 349
  • [26] Stochastic Gradient Matching Pursuit Algorithm Based on Sparse Estimation
    Zhao, Liquan
    Hu, Yunfeng
    Liu, Yulong
    ELECTRONICS, 2019, 8 (02):
  • [27] Hyperparameter estimation of a variational model using a stochastic gradient method
    Zerubia, J
    Blanc-Féraud, L
    BAYESIAN INFERENCE FOR INVERSE PROBLEMS, 1998, 3459 : 349 - 356
  • [28] Online estimation of the asymptotic variance for averaged stochastic gradient algorithms
    Godichon-Baggioni, Antoine
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2019, 203 : 1 - 19
  • [29] Stochastic mutual information gradient estimation for dimensionality reduction networks
    Oezdenizci, Ozan
    Erdogmus, Deniz
    INFORMATION SCIENCES, 2021, 570 : 298 - 305
  • [30] Stochastic mutual information gradient estimation for dimensionality reduction networks
    Özdenizci, Ozan
    Erdoğmuş, Deniz
    Information Sciences, 2021, 570 : 298 - 305