An adaptive deep Q-learning strategy for handwritten digit recognition

被引：52

作者：

Qiao, Junfei ^{[1
,2
]}

Wang, Gongming ^{[1
,2
]}

Li, Wenjing ^{[1
,2
]}

Chen, Min ^{[3
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

[2] Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China

[3] Civil Aviat Gen Hosp, Dept Obstet Gynecol, Beijing 100123, Peoples R China

来源：

NEURAL NETWORKS | 2018年 / 107卷

基金：

中国国家自然科学基金;

关键词：

Handwritten digits recognition; Deep learning; Reinforcement learning; Adaptive Q-learning deep belief network; Adaptive deep auto-encoder; RESTRICTED BOLTZMANN MACHINES; DECISION-MAKING; BELIEF NETWORKS; NEURAL-NETWORKS; DIMENSIONALITY; EFFICIENT;

D O I：

10.1016/j.neunet.2018.02.010

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Handwritten digits recognition is a challenging problem in recent years. Although many deep learning-based classification algorithms are studied for handwritten digits recognition, the recognition accuracy and running time still need to be further improved. In this paper, an adaptive deep Q-learning strategy is proposed to improve accuracy and shorten running time for handwritten digit recognition. The adaptive deep Q-learning strategy combines the feature-extracting capability of deep learning and the decision-making of reinforcement learning to form an adaptive Q-learning deep belief network (Q-ADBN). First, Q-ADBN extracts the features of original images using an adaptive deep auto-encoder (ADAE), and the extracted features are considered as the current states of Q-learning algorithm. Second, Q-ADBN receives Q-function (reward signal) during recognition of the current states, and the final handwritten digits recognition is implemented by maximizing the Q-function using Q-learning algorithm. Finally, experimental results from the well-known MNIST dataset show that the proposed Q-ADBN has a superiority to other similar methods in terms of accuracy and running time. (C) 2018 Elsevier Ltd. All rights reserved.

引用

页码：61 / 71

页数：11

共 50 条

[41] FIRMLP for Handwritten Digit Recognition
Codrescu, Cristinel
PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 483 - 488
[42] Neocognitron for handwritten digit recognition
Fukushima, K
NEUROCOMPUTING, 2003, 51 : 161 - 180
[43] A novel deep learning driven robot path planning strategy: Q-learning approach
Hu, Junli
INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2023, 71 (03) : 237 - 243
[44] Q-learning with heterogeneous update strategy
Tan, Tao
Xie, Hong
Feng, Liang
INFORMATION SCIENCES, 2024, 656
[45] An adaptive architecture for modular Q-learning
Kohri, T
Matsubayashi, K
Tokoro, M
IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 820 - 825
[46] Deep Reinforcement Learning with Double Q-Learning
van Hasselt, Hado
Guez, Arthur
Silver, David
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2094 - 2100
[47] Fuzzy Q-Learning with an Adaptive Representation
Waldock, A.
Carse, B.
2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 720 - +
[48] Adaptive moving average Q-learning
Tan, Tao
Xie, Hong
Xia, Yunni
Shi, Xiaoyu
Shang, Mingsheng
KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (12) : 7389 - 7417
[49] Iterative Learning of Fisher Linear Discriminants for Handwritten Digit Recognition
Qin Feng
Gao Daqi
2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
[50] An optimal weight learning machine for handwritten digit image recognition
Man, Zhihong
Lee, Kevin
Wang, Dianhui
Cao, Zhenwei
Khoo, Suiyang
SIGNAL PROCESSING, 2013, 93 (06) : 1624 - 1638

← 1 2 3 4 5 →