Properties and Bayesian fitting of restricted Boltzmann machines

被引:2
|
作者
Kaplan, Andee [1 ]
Nordman, Daniel [2 ]
Vardeman, Stephen [2 ,3 ]
机构
[1] Duke Univ, Dept Stat Sci, POB 90251, Durham, NC 27708 USA
[2] Iowa State Univ, Dept Stat, Ames, IA USA
[3] Iowa State Univ, Dept Ind & Mfg Syst Engn, Ames, IA USA
关键词
degeneracy; instability; classification; deep learning; graphical models; INFERENCE;
D O I
10.1002/sam.11396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A restricted Boltzmann machine (RBM) is an undirected graphical model constructed for discrete or continuous random variables, with two layers, one hidden and one visible, and no conditional dependency within a layer. In recent years, RBMs have risen to prominence due to their connection to deep learning. By treating a hidden layer of one RBM as the visible layer in a second RBM, a deep architecture can be created. RBMs thereby are thought to have the ability to encode very complex and rich structures in data, making them attractive for supervised learning. However, the generative behavior of RBMs largely is unexplored and typical fitting methodology does not easily allow for uncertainty quantification in addition to point estimates. In this paper, we discuss the relationship between RBM parameter specification in the binary case and model properties such as degeneracy, instability and uninterpretability. We also describe the associated difficulties that can arise with likelihood-based inference and further discuss the potential Bayes fitting of such (highly flexible) models, especially as Gibbs sampling (quasi-Bayes) methods often are advocated for the RBM model structure.
引用
收藏
页码:23 / 38
页数:16
相关论文
共 50 条
  • [1] Analysis on Noisy Boltzmann Machines and Noisy Restricted Boltzmann Machines
    Lu, Wenhao
    Leung, Chi-Sing
    Sum, John
    IEEE ACCESS, 2021, 9 : 112955 - 112965
  • [2] An Overview of Restricted Boltzmann Machines
    Upadhya, Vidyadhar
    Sastry, P. S.
    JOURNAL OF THE INDIAN INSTITUTE OF SCIENCE, 2019, 99 (02) : 225 - 236
  • [3] Discrete Restricted Boltzmann Machines
    Montufar, Guido
    Morton, Jason
    JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 653 - 672
  • [4] Continuous restricted Boltzmann machines
    Harrison, Robert W.
    WIRELESS NETWORKS, 2022, 28 (03) : 1263 - 1267
  • [5] Discrete restricted Boltzmann machines
    Montúfar, Guido
    Morton, Jason
    Journal of Machine Learning Research, 2015, 16 : 653 - 672
  • [6] An overview on Restricted Boltzmann Machines
    Zhang, Nan
    Ding, Shifei
    Zhang, Jian
    Xue, Yu
    NEUROCOMPUTING, 2018, 275 : 1186 - 1199
  • [7] Restricted Boltzmann Machines: A Review
    Zhang J.
    Ding S.-F.
    Zhang N.
    Du P.
    Du W.
    Yu W.-J.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (07): : 2073 - 2090
  • [8] Supervised Restricted Boltzmann Machines
    Tu Dinh Nguyen
    Dinh Phung
    Viet Huynh
    Trung Le
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
  • [9] Training Restricted Boltzmann Machines
    Fischer, Asja
    KUNSTLICHE INTELLIGENZ, 2015, 29 (04): : 441 - 444
  • [10] An Overview of Restricted Boltzmann Machines
    Vidyadhar Upadhya
    P. S. Sastry
    Journal of the Indian Institute of Science, 2019, 99 : 225 - 236