Reinforcement Learning with Attention that Works: A Self-Supervised Approach

被引:31
|
作者
Manchin, Anthony [1 ]
Abbasnejad, Ehsan [1 ]
van den Hengel, Anton [1 ]
机构
[1] Univ Adelaide, Australian Inst Machine Learning, Adelaide, SA, Australia
关键词
Reinforcement learning; Attention; Deep learning;
D O I
10.1007/978-3-030-36802-9_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attention models have had a significant positive impact on deep learning across a range of tasks. However previous attempts at integrating attention with reinforcement learning have failed to produce significant improvements. Unlike the selective attention models used in previous attempts, which constrain the attention via preconceived notions of importance, our implementation utilises the Markovian properties inherent in the state input. We propose the first combination of self attention and reinforcement learning that is capable of producing significant improvements, including new state of the art results in the Arcade Learning Environment.
引用
收藏
页码:223 / 230
页数:8
相关论文
共 50 条
  • [31] COMPETITIVE MULTI-AGENT REINFORCEMENT LEARNING WITH SELF-SUPERVISED REPRESENTATION
    Su, DiJia
    Lee, Jason D.
    Mulvey, John M.
    Poor, H. Vincent
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4098 - 4102
  • [32] Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
    Chen, Yinda
    Huang, Wei
    Zhou, Shenglong
    Chen, Qi
    Xiong, Zhiwei
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 609 - 617
  • [33] Parkinson's Disease Classification with Self-supervised Learning and Attention Mechanism
    Zhang, Yuchen
    Lei, Haijun
    Huang, Zhongwei
    Zhao, Menglu
    Li, Zhen
    Liu, Chuan-Ming
    Lei, Baiying
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4601 - 4607
  • [34] Dynamic-boosting attention for self-supervised video representation learning
    Zhipeng Wang
    Chunping Hou
    Guanghui Yue
    Qingyuan Yang
    Applied Intelligence, 2022, 52 : 3143 - 3155
  • [35] Improving novelty detection by self-supervised learning and channel attention mechanism
    Tian, Miao
    Cui, Ying
    Long, Haixia
    Li, Junxia
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2021, 48 (05): : 673 - 679
  • [36] Self-Supervised Attention Learning for Depth and Ego-motion Estimation
    Sadek, Assent
    Chidlovskii, Boris
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10054 - 10060
  • [37] Improving novelty detection by self-supervised learning and channel attention mechanism
    Tian, Miao
    Cui, Ying
    Long, Haixia
    Li, Junxia
    Industrial Robot, 2021, 48 (05): : 673 - 679
  • [38] Dynamic-boosting attention for self-supervised video representation learning
    Wang, Zhipeng
    Hou, Chunping
    Yue, Guanghui
    Yang, Qingyuan
    APPLIED INTELLIGENCE, 2022, 52 (03) : 3143 - 3155
  • [39] Gated Self-supervised Learning for Improving Supervised Learning
    Fuadi, Erland Hillman
    Ruslim, Aristo Renaldo
    Wardhana, Putu Wahyu Kusuma
    Yudistira, Novanto
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 611 - 615
  • [40] Topic attention encoder: A self-supervised approach for short text clustering
    Jin, Jian
    Zhao, Haiyuan
    Ji, Ping
    JOURNAL OF INFORMATION SCIENCE, 2022, 48 (05) : 701 - 717