Efficient Learning of Restricted Boltzmann Machines Using Covariance Estimates

被引：0

作者：

Upadhya, Vidyadhar ^{[1
]}

Sastry, P. S. ^{[1
]}

机构：

[1] Indian Inst Sci Bangalore, Bangalore, Karnataka, India

来源：

ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101 | 2019年 / 101卷

关键词：

RBM; Maximum likelihood learning; Difference of Convex (DC) algorithm; Contrastive divergence;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning RBMs using standard algorithms such as CD(k) involves gradient descent on the negative log-likelihood. One of the terms in the gradient, which involves expectation w.r.t. the model distribution, is intractable and is obtained through an MCMC estimate. In this work we show that the Hessian of the log-likelihood can be written in terms of covariances of hidden and visible units and hence, all elements of the Hessian can also be estimated using the same MCMC samples with small extra computational costs. Since inverting the Hessian may be computationally expensive, we propose an algorithm that uses inverse of the diagonal approximation of the Hessian, instead. This essentially results in parameter-specific adaptive learning rates for the gradient descent process and improves the efficiency of learning RBMs compared to the standard methods. Specifically we show that using the inverse of diagonal approximation of Hessian in the stochastic DC (difference of convex functions) program approach results in very efficient learning of RBMs.

引用

页码：851 / 866

页数：16

共 50 条

[1] Parallel Tempering is Efficient for Learning Restricted Boltzmann Machines
Cho, KyungHyun
Raiko, Tapani
Ilin, Alexander
2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
[2] LEARNING SPAM FEATURES USING RESTRICTED BOLTZMANN MACHINES
da Silva, Luis Alexandre
Pontara da Costa, Kelton Augusto
Ribeiro, Patricia Bellin
de Rosa, Gustavo Henrique
Papa, Joao Paulo
IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2016, 11 (01): : 99 - 114
[3] SCALABLE LEARNING FOR RESTRICTED BOLTZMANN MACHINES
Barshan, Elnaz
Fieguth, Paul
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2754 - 2758
[4] Spectral dynamics of learning in restricted Boltzmann machines
Decelle, A.
Fissore, G.
Furtlehner, C.
EPL, 2017, 119 (06)
[5] Neurosymbolic Reasoning and Learning with Restricted Boltzmann Machines
Tran, Son N.
Garcez, Artur d'Avila
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 5, 2023, : 6558 - 6565
[6] Approximate Learning Algorithm for Restricted Boltzmann Machines
Yasuda, Muneki
Tanaka, Kazuyuki
2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 692 - 697
[7] An Incremental Learning Approach for Restricted Boltzmann Machines
Yu, Jongmin
Gwak, Jeonghwan
Lee, Sejeong
Jeon, Moongu
FOURTH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (CCAIS 2015), 2015, : 113 - 117
[8] An approach to improve online sequential extreme learning machines using restricted Boltzmann machines
Pacheco, Andre G. C.
Krohling, Renato A.
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[9] Unsupervised hierarchical clustering using the learning dynamics of restricted Boltzmann machines
Decelle, Aurelien
Seoane, Beatriz
Rosset, Lorenzo
PHYSICAL REVIEW E, 2023, 108 (01)
[10] LEARNING A BETTER REPRESENTATION OF SPEECH SOUNDWAVES USING RESTRICTED BOLTZMANN MACHINES
Jaitly, Navdeep
Hinton, Geoffrey
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5884 - 5887

← 1 2 3 4 5 →