An adaptive deep Q-learning strategy for handwritten digit recognition

被引:52
|
作者
Qiao, Junfei [1 ,2 ]
Wang, Gongming [1 ,2 ]
Li, Wenjing [1 ,2 ]
Chen, Min [3 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[3] Civil Aviat Gen Hosp, Dept Obstet Gynecol, Beijing 100123, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten digits recognition; Deep learning; Reinforcement learning; Adaptive Q-learning deep belief network; Adaptive deep auto-encoder; RESTRICTED BOLTZMANN MACHINES; DECISION-MAKING; BELIEF NETWORKS; NEURAL-NETWORKS; DIMENSIONALITY; EFFICIENT;
D O I
10.1016/j.neunet.2018.02.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten digits recognition is a challenging problem in recent years. Although many deep learning-based classification algorithms are studied for handwritten digits recognition, the recognition accuracy and running time still need to be further improved. In this paper, an adaptive deep Q-learning strategy is proposed to improve accuracy and shorten running time for handwritten digit recognition. The adaptive deep Q-learning strategy combines the feature-extracting capability of deep learning and the decision-making of reinforcement learning to form an adaptive Q-learning deep belief network (Q-ADBN). First, Q-ADBN extracts the features of original images using an adaptive deep auto-encoder (ADAE), and the extracted features are considered as the current states of Q-learning algorithm. Second, Q-ADBN receives Q-function (reward signal) during recognition of the current states, and the final handwritten digits recognition is implemented by maximizing the Q-function using Q-learning algorithm. Finally, experimental results from the well-known MNIST dataset show that the proposed Q-ADBN has a superiority to other similar methods in terms of accuracy and running time. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:61 / 71
页数:11
相关论文
共 50 条
  • [31] Evaluating SPAN Incremental Learning for Handwritten Digit Recognition
    Mohemmed, Ammar
    Lu, Guoyu
    Kasabov, Nikola
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 670 - 677
  • [32] Incremental Q-learning strategy for adaptive PID control of mobile robots
    Carlucho, Ignacio
    De Paula, Mariano
    Villar, Sebastian A.
    Acosta, Gerardo G.
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 80 : 183 - 199
  • [33] Deep, Big, Simple Neural Nets for Handwritten Digit Recognition
    Ciresan, Dan Claudiu
    Meier, Ueli
    Gambardella, Luca Maria
    Schmidhuber, Juergen
    NEURAL COMPUTATION, 2010, 22 (12) : 3207 - 3220
  • [34] Cooperative strategy based on adaptive Q-learning for robot soccer systems
    Hwang, KS
    Tan, SW
    Chen, CC
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (04) : 569 - 576
  • [35] A Novel Behavioral Strategy for RoboCode Platform Based on Deep Q-Learning
    Kayakoku, Hakan
    Guzel, Mehmet Serdar
    Bostanci, Erkan
    Medeni, Ihsan Tolga
    Mishra, Deepti
    COMPLEXITY, 2021, 2021
  • [36] Trading Strategy of the Cryptocurrency Market Based on Deep Q-Learning Agents
    Huang, Chester S. J.
    Su, Yu-Sheng
    APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [37] An intelligent financial portfolio trading strategy using deep Q-learning
    Park, Hyungjun
    Sim, Min Kyu
    Choi, Dong Gu
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 158 (158)
  • [38] Adaptive-Precision Framework for SGD Using Deep Q-Learning
    Zhang, Wentai
    Huang, Hanxian
    Zhang, Jiaxi
    Jiang, Ming
    Luo, Guojie
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
  • [39] Arabic handwritten digit recognition
    Sherif Abdleazeem
    Ezzat El-Sherif
    International Journal of Document Analysis and Recognition (IJDAR), 2008, 11 : 127 - 141
  • [40] Arabic handwritten digit recognition
    Abdleazeem, Sherif
    El-Sherif, Ezzat
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2008, 11 (03) : 127 - 141