Discrete space reinforcement learning algorithm based on twin support vector machine classification

被引:1
|
作者
Wu, Wenguo [1 ]
Zhou, Zhengchun [1 ]
Adhikary, Avik Ranjan [1 ]
Dutta, Bapi [2 ]
机构
[1] Southwest Jiaotong Univ, Sch Math, Chengdu 611756, Peoples R China
[2] Univ Jaen, Dept Comp Sci, Jaen 23071, Spain
基金
中国国家自然科学基金;
关键词
Twin support vector machines; Actor-Critic; Reinforcement learning; Small-scale discrete space environment;
D O I
10.1016/j.patrec.2022.11.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) has become one of the key component of various machine learning algorithms in recent years. However, traditional RL algorithms lack convergence speed and accuracy in small-scale discrete space environment. Recently An et al. proposed RL algorithm based on support vector machines (SVMs) (Pattern Recognit. Lett. 111 (2018) 30-35) which adopts the Advantage Actor-Critic (A2C) framework and improves the speed and accuracy of convergence in discrete space. Owing to the advantages of twin support vector machines (TWSVMs) over SVMs, in this paper, we propose a RL algorithm based on TWSVM classification. The proposed algorithm adopts a modified A2C framework, where there are multiple Actors and a single Critic. Finally, we compare the performance of the proposed algorithm with some existing algorithms in traditional RL environment. Interestingly, the proposed algorithm outperforms the existing algorithms in terms of convergence speed and accuracy. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:254 / 260
页数:7
相关论文
共 50 条
  • [21] ν-projection twin support vector machine for pattern classification
    Chen, Wei-Jie
    Shao, Yuan-Hai
    Li, Chun-Na
    Liu, Ming-Zeng
    Wang, Zhen
    Deng, Nai-Yang
    NEUROCOMPUTING, 2020, 376 : 10 - 24
  • [22] Robust twin support vector machine for pattern classification
    Qi, Zhiquan
    Tian, Yingjie
    Shi, Yong
    PATTERN RECOGNITION, 2013, 46 (01) : 305 - 316
  • [23] ν-twin support vector machine with Universum data for classification
    Xu, Yitian
    Chen, Mei
    Yang, Zhiji
    Li, Guohui
    APPLIED INTELLIGENCE, 2016, 44 (04) : 956 - 968
  • [24] An adaptive support vector machine learning algorithm for large classification problem
    Yu, Shu
    Yang, Xiaowei
    Hao, Zhifeng
    Liang, Yanchun
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 981 - 990
  • [25] Twin support vector machine: theory, algorithm and applications
    Shifei Ding
    Nan Zhang
    Xiekai Zhang
    Fulin Wu
    Neural Computing and Applications, 2017, 28 : 3119 - 3130
  • [26] Twin support vector machine: theory, algorithm and applications
    Ding, Shifei
    Zhang, Nan
    Zhang, Xiekai
    Wu, Fulin
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 (11): : 3119 - 3130
  • [27] Q learning based on probability support vector classification machine
    Cheng, Yu-Hu
    Gao, Yang
    Wang, Xue-Song
    Zhongguo Kuangye Daxue Xuebao/Journal of China University of Mining and Technology, 2010, 39 (03): : 408 - 413
  • [28] A coordinate descent margin based-twin support vector machine for classification
    Shao, Yuan-Hai
    Deng, Nai-Yang
    NEURAL NETWORKS, 2012, 25 : 114 - 121
  • [29] Imbalanced Data Classification Based on Hybrid Resampling and Twin Support Vector Machine
    Cao, Lu
    Shen, Hong
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2017, 14 (03) : 579 - 595
  • [30] Classification of Heart Sounds Based on the Wavelet Fractal and Twin Support Vector Machine
    Li, Jinghui
    Ke, Li
    Du, Qiang
    ENTROPY, 2019, 21 (05):