An adaptive deep Q-learning strategy for handwritten digit recognition

被引:52
|
作者
Qiao, Junfei [1 ,2 ]
Wang, Gongming [1 ,2 ]
Li, Wenjing [1 ,2 ]
Chen, Min [3 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[3] Civil Aviat Gen Hosp, Dept Obstet Gynecol, Beijing 100123, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten digits recognition; Deep learning; Reinforcement learning; Adaptive Q-learning deep belief network; Adaptive deep auto-encoder; RESTRICTED BOLTZMANN MACHINES; DECISION-MAKING; BELIEF NETWORKS; NEURAL-NETWORKS; DIMENSIONALITY; EFFICIENT;
D O I
10.1016/j.neunet.2018.02.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten digits recognition is a challenging problem in recent years. Although many deep learning-based classification algorithms are studied for handwritten digits recognition, the recognition accuracy and running time still need to be further improved. In this paper, an adaptive deep Q-learning strategy is proposed to improve accuracy and shorten running time for handwritten digit recognition. The adaptive deep Q-learning strategy combines the feature-extracting capability of deep learning and the decision-making of reinforcement learning to form an adaptive Q-learning deep belief network (Q-ADBN). First, Q-ADBN extracts the features of original images using an adaptive deep auto-encoder (ADAE), and the extracted features are considered as the current states of Q-learning algorithm. Second, Q-ADBN receives Q-function (reward signal) during recognition of the current states, and the final handwritten digits recognition is implemented by maximizing the Q-function using Q-learning algorithm. Finally, experimental results from the well-known MNIST dataset show that the proposed Q-ADBN has a superiority to other similar methods in terms of accuracy and running time. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:61 / 71
页数:11
相关论文
共 50 条
  • [21] Very Deep Neural Network for Handwritten Digit Recognition
    Li, Yang
    Li, Hang
    Xu, Yulong
    Wang, Jiabao
    Zhang, Yafei
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 174 - 182
  • [22] Deep Evolution of Image Representations for Handwritten Digit Recognition
    Agapitos, Alexandros
    O'Neill, Michael
    Nicolau, Miguel
    Fagan, David
    Kattan, Ahmed
    Brabazon, Anthony
    Curran, Kathleen
    2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 2452 - 2459
  • [23] ADAPTIVE CONTENTION WINDOW DESIGN USING DEEP Q-LEARNING
    Kumar, Abhishek
    Verma, Gunjan
    Rao, Chirag
    Swami, Ananthram
    Segarra, Santiago
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4950 - 4954
  • [24] Trend following deep Q-Learning strategy for stock trading
    Chakole, Jagdish
    Kurhekar, Manish
    EXPERT SYSTEMS, 2020, 37 (04)
  • [25] Air-Combat Strategy Using Deep Q-Learning
    Ma, Xiaoteng
    Xia, Li
    Zhao, Qianchuan
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 3952 - 3957
  • [26] DIGITNET: A Deep Handwritten Digit Detection and Recognition Method Using a New Historical Handwritten Digit Dataset
    Kusetogullari, Huseyin
    Yavariabdi, Amir
    Hall, Johan
    Lavesson, Niklas
    BIG DATA RESEARCH, 2021, 23 (23)
  • [27] Adaptive Traffic Signal Control with Deep Recurrent Q-learning
    Zeng, Jinghong
    Hu, Jianming
    Zhang, Yi
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1215 - 1220
  • [28] Adaptive Bases for Q-learning
    Di Castro, Dotan
    Mannor, Shie
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 4587 - 4593
  • [29] Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning
    Ohnishi, Shota
    Uchibe, Eiji
    Yamaguchi, Yotaro
    Nakanishi, Kosuke
    Yasui, Yuji
    Ishii, Shin
    FRONTIERS IN NEUROROBOTICS, 2019, 13
  • [30] Handwritten Multi-Digit Recognition With Machine Learning
    Boroojerdi, Soha
    Rudolph, George
    2022 INTERMOUNTAIN ENGINEERING, TECHNOLOGY AND COMPUTING (IETC), 2022,