Efficient Learning of Restricted Boltzmann Machines Using Covariance Estimates

被引:0
|
作者
Upadhya, Vidyadhar [1 ]
Sastry, P. S. [1 ]
机构
[1] Indian Inst Sci Bangalore, Bangalore, Karnataka, India
关键词
RBM; Maximum likelihood learning; Difference of Convex (DC) algorithm; Contrastive divergence;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning RBMs using standard algorithms such as CD(k) involves gradient descent on the negative log-likelihood. One of the terms in the gradient, which involves expectation w.r.t. the model distribution, is intractable and is obtained through an MCMC estimate. In this work we show that the Hessian of the log-likelihood can be written in terms of covariances of hidden and visible units and hence, all elements of the Hessian can also be estimated using the same MCMC samples with small extra computational costs. Since inverting the Hessian may be computationally expensive, we propose an algorithm that uses inverse of the diagonal approximation of the Hessian, instead. This essentially results in parameter-specific adaptive learning rates for the gradient descent process and improves the efficiency of learning RBMs compared to the standard methods. Specifically we show that using the inverse of diagonal approximation of Hessian in the stochastic DC (difference of convex functions) program approach results in very efficient learning of RBMs.
引用
收藏
页码:851 / 866
页数:16
相关论文
共 50 条
  • [1] Parallel Tempering is Efficient for Learning Restricted Boltzmann Machines
    Cho, KyungHyun
    Raiko, Tapani
    Ilin, Alexander
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [2] LEARNING SPAM FEATURES USING RESTRICTED BOLTZMANN MACHINES
    da Silva, Luis Alexandre
    Pontara da Costa, Kelton Augusto
    Ribeiro, Patricia Bellin
    de Rosa, Gustavo Henrique
    Papa, Joao Paulo
    IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2016, 11 (01): : 99 - 114
  • [3] SCALABLE LEARNING FOR RESTRICTED BOLTZMANN MACHINES
    Barshan, Elnaz
    Fieguth, Paul
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2754 - 2758
  • [4] Spectral dynamics of learning in restricted Boltzmann machines
    Decelle, A.
    Fissore, G.
    Furtlehner, C.
    EPL, 2017, 119 (06)
  • [5] Neurosymbolic Reasoning and Learning with Restricted Boltzmann Machines
    Tran, Son N.
    Garcez, Artur d'Avila
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 5, 2023, : 6558 - 6565
  • [6] Approximate Learning Algorithm for Restricted Boltzmann Machines
    Yasuda, Muneki
    Tanaka, Kazuyuki
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 692 - 697
  • [7] An Incremental Learning Approach for Restricted Boltzmann Machines
    Yu, Jongmin
    Gwak, Jeonghwan
    Lee, Sejeong
    Jeon, Moongu
    FOURTH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (CCAIS 2015), 2015, : 113 - 117
  • [8] An approach to improve online sequential extreme learning machines using restricted Boltzmann machines
    Pacheco, Andre G. C.
    Krohling, Renato A.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [9] Unsupervised hierarchical clustering using the learning dynamics of restricted Boltzmann machines
    Decelle, Aurelien
    Seoane, Beatriz
    Rosset, Lorenzo
    PHYSICAL REVIEW E, 2023, 108 (01)
  • [10] LEARNING A BETTER REPRESENTATION OF SPEECH SOUNDWAVES USING RESTRICTED BOLTZMANN MACHINES
    Jaitly, Navdeep
    Hinton, Geoffrey
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5884 - 5887