Prescribed attractivity region selection for recurrent neural networks based on deep reinforcement learning

被引:1
|
作者
Bao, Gang [1 ]
Song, Zhenyan [1 ]
Xu, Rui [1 ]
机构
[1] China Three Gorges Univ, Hubei Key Lab Cascaded Hydropower Stat Operat & C, Yichang 443002, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2024年 / 36卷 / 05期
基金
中国国家自然科学基金;
关键词
Recurrent neural networks; Attractivity region selection; Deep reinforcement learning; GLOBAL EXPONENTIAL STABILITY; TIME-VARYING DELAYS; DESIGN;
D O I
10.1007/s00521-023-09191-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent neural networks' (RNNs') outputs are the same when network states converge to the same saturation region. Strong external inputs can cause the neural network to converge to a prescribed saturation region. Different from previous works, this paper employs deep reinforcement learning to obtain external inputs to make network states converge to the desired saturation region. Firstly, for five-dimensional neural networks, the deep Q learning (DQN) algorithm is used to compute the optimal external inputs that make the network state converge to the specified saturation region. When scaling to n-dimensional RNNs, the problem of dimensional disaster is encountered. Then, it proposes a batch computation of the external inputs to cope with the curse of dimensionality. At last, the proposed method is validated by numerical examples, and compared with existing methods, it shows that less conservative external inputs conditions can be obtained.
引用
收藏
页码:2399 / 2409
页数:11
相关论文
共 50 条
  • [21] Online scheduling of image satellites based on neural networks and deep reinforcement learning
    Haijiao WANG
    Zhen YANG
    Wugen ZHOU
    Dalin LI
    Chinese Journal of Aeronautics , 2019, (04) : 1011 - 1019
  • [22] A Deep Reinforcement Learning Heuristic for SAT based on Antagonist Graph Neural Networks
    Fournier, Thomas
    Lallouet, Arnaud
    Cropsal, Telio
    Glorian, Gael
    Papadopoulos, Alexandre
    Petitet, Antoine
    Perez, Guillaume
    Sekar, Suruthy
    Suijlen, Wijnand
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1218 - 1222
  • [23] Robotic Motion Planning Based on Deep Reinforcement Learning and Artificial Neural Networks
    Liu, Huashan
    Li, Xiangjian
    Dong, Menghua
    Gu, Yuqing
    Shen, Bo
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [24] Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks
    Brosch, Tobias
    Neumann, Heiko
    Roelfsema, Pieter R.
    PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (10)
  • [25] Reinforcement learning of dynamic behavior by using recurrent neural networks
    Ahmet Onat
    Hajime Kita
    Yoshikazu Nishikawa
    Artificial Life and Robotics, 1997, 1 (3) : 117 - 121
  • [26] Learning to Learn and Compositionality with Deep Recurrent Neural Networks
    de Freitas, Nando
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 3 - 3
  • [27] Performance Analysis of Deep Learning based on Recurrent Neural Networks for Channel Coding
    Sattiraju, Raja
    Weinand, Andreas
    Schotten, Hans D.
    2018 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATIONS SYSTEMS (ANTS), 2018,
  • [28] Deep Learning Methods for Vessel Trajectory Prediction Based on Recurrent Neural Networks
    Capobianco, Samuele
    Millefiori, Leonardo M.
    Forti, Nicola
    Braca, Paolo
    Willett, Peter
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2021, 57 (06) : 4329 - 4346
  • [29] Recurrent neural networks for reinforcement learning: Architecture, learning algorithms and internal representation
    Onat, A
    Kita, H
    Nishikawa, Y
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 2010 - 2015
  • [30] Deep Reinforcement Learning Based Online Network Selection in CRNs With Multiple Primary Networks
    Yang, Yi
    Wang, Ye
    Liu, Kaiyu
    Zhang, Ning
    Gu, Shushi
    Zhang, Qinyu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (12) : 7691 - 7699