A fast learning algorithm for deep belief nets

被引:12702
|
作者
Hinton, Geoffrey E. [1 ]
Osindero, Simon
Teh, Yee-Whye
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G4, Canada
[2] Natl Univ Singapore, Dept Comp Sci, Singapore 117543, Singapore
关键词
D O I
10.1162/neco.2006.18.7.1527
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We show how to use "complementary priors" to eliminate the explaining-away effects that make inference difficult in densely connected belief nets that have many hidden layers. Using complementary priors, we derive a fast, greedy algorithm that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory. The fast, greedy algorithm is used to initialize a slower learning procedure that fine-tunes the weights using a contrastive version of the wake-sleep algorithm. After fine-tuning, a network with three hidden layers forms a very good generative model of the joint distribution of handwritten digit images and their labels. This generative model gives better digit classification than the best discriminative learning algorithms. The low-dimensional manifolds on which the digits lie are modeled by long ravines in the free-energy landscape of the top-level associative memory, and it is easy to explore these ravines by using the directed connections to display what the associative memory has in mind.
引用
收藏
页码:1527 / 1554
页数:28
相关论文
共 50 条
  • [1] Aircraft Detection by Deep Belief Nets
    Chen, Xueyun
    Xiang, Shiming
    Liu, Cheng-Lin
    Pan, Chun-Hong
    2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 54 - 58
  • [2] Learning Sparse Feature Representations Using Probabilistic Quadtrees and Deep Belief Nets
    Basu, Saikat
    Karki, Manohar
    Ganguly, Sangram
    DiBiano, Robert
    Mukhopadhyay, Supratik
    Gayaka, Shreekant
    Kannan, Rajgopal
    Nemani, Ramakrishna
    NEURAL PROCESSING LETTERS, 2017, 45 (03) : 855 - 867
  • [3] Learning Sparse Feature Representations Using Probabilistic Quadtrees and Deep Belief Nets
    Saikat Basu
    Manohar Karki
    Sangram Ganguly
    Robert DiBiano
    Supratik Mukhopadhyay
    Shreekant Gayaka
    Rajgopal Kannan
    Ramakrishna Nemani
    Neural Processing Letters, 2017, 45 : 855 - 867
  • [4] Research and Application on Face Recognition Algorithm Based on FWLD Method and Deep Belief Nets
    Zhang, Yihong
    Li, Xiang
    Wang, Zhijie
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE & APPLICATION TECHNOLOGY (ICCIA 2017), 2017, 74 : 582 - 587
  • [5] Sparse Maximum Entropy Deep Belief Nets
    Jing, How
    Tsao, Yu
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [6] Application of Deep Belief Nets for Collaborative Filtering
    Zhao, Chong
    Shi, Jiyun
    Jiang, Tao
    Zhao, Junyao
    Chen, Jiehao
    2016 16TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2016, : 201 - 205
  • [7] Learning Parameters in Deep Belief Networks Through Firefly Algorithm
    Rosa, Gustavo
    Papa, Joao
    Costa, Kelton
    Passos, Leandro
    Pereira, Clayton
    Yang, Xin-She
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, 2016, 9896 : 138 - 149
  • [8] Modeling electroencephalography waveforms with semi-supervised deep belief nets: fast classification and anomaly measurement
    Wulsin, D. F.
    Gupta, J. R.
    Mani, R.
    Blanco, J. A.
    Litt, B.
    JOURNAL OF NEURAL ENGINEERING, 2011, 8 (03)
  • [9] Intrusion detection model based on deep belief nets
    Gao, Ni
    Gao, Ling
    He, Yiyue
    Gao, Quanli
    Ren, Jie
    Journal of Southeast University (English Edition), 2015, 31 (03) : 339 - 346
  • [10] Large-Scale Deep Belief Nets With MapReduce
    Zhang, Kunlei
    Chen, Xue-Wen
    IEEE ACCESS, 2014, 2 : 395 - 403