An adaptive deep Q-learning strategy for handwritten digit recognition

被引:52
|
作者
Qiao, Junfei [1 ,2 ]
Wang, Gongming [1 ,2 ]
Li, Wenjing [1 ,2 ]
Chen, Min [3 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[3] Civil Aviat Gen Hosp, Dept Obstet Gynecol, Beijing 100123, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten digits recognition; Deep learning; Reinforcement learning; Adaptive Q-learning deep belief network; Adaptive deep auto-encoder; RESTRICTED BOLTZMANN MACHINES; DECISION-MAKING; BELIEF NETWORKS; NEURAL-NETWORKS; DIMENSIONALITY; EFFICIENT;
D O I
10.1016/j.neunet.2018.02.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten digits recognition is a challenging problem in recent years. Although many deep learning-based classification algorithms are studied for handwritten digits recognition, the recognition accuracy and running time still need to be further improved. In this paper, an adaptive deep Q-learning strategy is proposed to improve accuracy and shorten running time for handwritten digit recognition. The adaptive deep Q-learning strategy combines the feature-extracting capability of deep learning and the decision-making of reinforcement learning to form an adaptive Q-learning deep belief network (Q-ADBN). First, Q-ADBN extracts the features of original images using an adaptive deep auto-encoder (ADAE), and the extracted features are considered as the current states of Q-learning algorithm. Second, Q-ADBN receives Q-function (reward signal) during recognition of the current states, and the final handwritten digits recognition is implemented by maximizing the Q-function using Q-learning algorithm. Finally, experimental results from the well-known MNIST dataset show that the proposed Q-ADBN has a superiority to other similar methods in terms of accuracy and running time. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:61 / 71
页数:11
相关论文
共 50 条
  • [41] FIRMLP for Handwritten Digit Recognition
    Codrescu, Cristinel
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 483 - 488
  • [42] Neocognitron for handwritten digit recognition
    Fukushima, K
    NEUROCOMPUTING, 2003, 51 : 161 - 180
  • [43] A novel deep learning driven robot path planning strategy: Q-learning approach
    Hu, Junli
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2023, 71 (03) : 237 - 243
  • [44] Q-learning with heterogeneous update strategy
    Tan, Tao
    Xie, Hong
    Feng, Liang
    INFORMATION SCIENCES, 2024, 656
  • [45] An adaptive architecture for modular Q-learning
    Kohri, T
    Matsubayashi, K
    Tokoro, M
    IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 820 - 825
  • [46] Deep Reinforcement Learning with Double Q-Learning
    van Hasselt, Hado
    Guez, Arthur
    Silver, David
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2094 - 2100
  • [47] Fuzzy Q-Learning with an Adaptive Representation
    Waldock, A.
    Carse, B.
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 720 - +
  • [48] Adaptive moving average Q-learning
    Tan, Tao
    Xie, Hong
    Xia, Yunni
    Shi, Xiaoyu
    Shang, Mingsheng
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (12) : 7389 - 7417
  • [49] Iterative Learning of Fisher Linear Discriminants for Handwritten Digit Recognition
    Qin Feng
    Gao Daqi
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [50] An optimal weight learning machine for handwritten digit image recognition
    Man, Zhihong
    Lee, Kevin
    Wang, Dianhui
    Cao, Zhenwei
    Khoo, Suiyang
    SIGNAL PROCESSING, 2013, 93 (06) : 1624 - 1638