An adaptive deep Q-learning strategy for handwritten digit recognition

被引：52

作者：

Qiao, Junfei ^{[1
,2
]}

Wang, Gongming ^{[1
,2
]}

Li, Wenjing ^{[1
,2
]}

Chen, Min ^{[3
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

[2] Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China

[3] Civil Aviat Gen Hosp, Dept Obstet Gynecol, Beijing 100123, Peoples R China

来源：

NEURAL NETWORKS | 2018年 / 107卷

基金：

中国国家自然科学基金;

关键词：

Handwritten digits recognition; Deep learning; Reinforcement learning; Adaptive Q-learning deep belief network; Adaptive deep auto-encoder; RESTRICTED BOLTZMANN MACHINES; DECISION-MAKING; BELIEF NETWORKS; NEURAL-NETWORKS; DIMENSIONALITY; EFFICIENT;

D O I：

10.1016/j.neunet.2018.02.010

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Handwritten digits recognition is a challenging problem in recent years. Although many deep learning-based classification algorithms are studied for handwritten digits recognition, the recognition accuracy and running time still need to be further improved. In this paper, an adaptive deep Q-learning strategy is proposed to improve accuracy and shorten running time for handwritten digit recognition. The adaptive deep Q-learning strategy combines the feature-extracting capability of deep learning and the decision-making of reinforcement learning to form an adaptive Q-learning deep belief network (Q-ADBN). First, Q-ADBN extracts the features of original images using an adaptive deep auto-encoder (ADAE), and the extracted features are considered as the current states of Q-learning algorithm. Second, Q-ADBN receives Q-function (reward signal) during recognition of the current states, and the final handwritten digits recognition is implemented by maximizing the Q-function using Q-learning algorithm. Finally, experimental results from the well-known MNIST dataset show that the proposed Q-ADBN has a superiority to other similar methods in terms of accuracy and running time. (C) 2018 Elsevier Ltd. All rights reserved.

引用

页码：61 / 71

页数：11

共 50 条

[31] Evaluating SPAN Incremental Learning for Handwritten Digit Recognition
Mohemmed, Ammar
Lu, Guoyu
Kasabov, Nikola
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 670 - 677
[32] Incremental Q-learning strategy for adaptive PID control of mobile robots
Carlucho, Ignacio
De Paula, Mariano
Villar, Sebastian A.
Acosta, Gerardo G.
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 80 : 183 - 199
[33] Deep, Big, Simple Neural Nets for Handwritten Digit Recognition
Ciresan, Dan Claudiu
Meier, Ueli
Gambardella, Luca Maria
Schmidhuber, Juergen
NEURAL COMPUTATION, 2010, 22 (12) : 3207 - 3220
[34] Cooperative strategy based on adaptive Q-learning for robot soccer systems
Hwang, KS
Tan, SW
Chen, CC
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (04) : 569 - 576
[35] A Novel Behavioral Strategy for RoboCode Platform Based on Deep Q-Learning
Kayakoku, Hakan
Guzel, Mehmet Serdar
Bostanci, Erkan
Medeni, Ihsan Tolga
Mishra, Deepti
COMPLEXITY, 2021, 2021
[36] Trading Strategy of the Cryptocurrency Market Based on Deep Q-Learning Agents
Huang, Chester S. J.
Su, Yu-Sheng
APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
[37] An intelligent financial portfolio trading strategy using deep Q-learning
Park, Hyungjun
Sim, Min Kyu
Choi, Dong Gu
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 158 (158)
[38] Adaptive-Precision Framework for SGD Using Deep Q-Learning
Zhang, Wentai
Huang, Hanxian
Zhang, Jiaxi
Jiang, Ming
Luo, Guojie
2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
[39] Arabic handwritten digit recognition
Sherif Abdleazeem
Ezzat El-Sherif
International Journal of Document Analysis and Recognition (IJDAR), 2008, 11 : 127 - 141
[40] Arabic handwritten digit recognition
Abdleazeem, Sherif
El-Sherif, Ezzat
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2008, 11 (03) : 127 - 141

← 1 2 3 4 5 →