Attentive multi-view reinforcement learning

被引:0
|
作者
Yueyue Hu
Shiliang Sun
Xin Xu
Jing Zhao
机构
[1] East China Normal University,School of Computer Science and Technology
[2] National University of Defense Technology,College of Intelligence Science and Technology
关键词
Deep reinforcement learning; Function approximation; Multi-view learning; Representation learning;
D O I
暂无
中图分类号
学科分类号
摘要
The reinforcement learning process usually takes millions of steps from scratch, due to the limited observation experience. More precisely, the representation approximated by a single deep network is usually limited for reinforcement learning agents. In this paper, we propose a novel multi-view deep attention network (MvDAN), which introduces multi-view representation learning into the reinforcement learning framework for the first time. Based on the multi-view scheme of function approximation, the proposed model approximates multiple view-specific policy or value functions in parallel by estimating the middle-level representation and integrates these functions based on attention mechanisms to generate a comprehensive strategy. Furthermore, we develop the multi-view generalized policy improvement to jointly optimize all policies instead of a single one. Compared with the single-view function approximation scheme in reinforcement learning methods, experimental results on eight Atari benchmarks show that MvDAN outperforms the state-of-the-art methods and has faster convergence and training stability.
引用
收藏
页码:2461 / 2474
页数:13
相关论文
共 50 条
  • [31] Robust cooperative multi-agent reinforcement learning via multi-view message certification
    Yuan, Lei
    Jiang, Tao
    Li, Lihe
    Chen, Feng
    Zhang, Zongzhang
    Yu, Yang
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (04)
  • [32] Robust cooperative multi-agent reinforcement learning via multi-view message certification
    Lei Yuan
    Tao Jiang
    Lihe Li
    Feng Chen
    Zongzhang Zhang
    Yang Yu
    Science China Information Sciences, 2024, 67
  • [33] Multi-View Multi-Instance Learning Based on Joint Sparse Representation and Multi-View Dictionary Learning
    Li, Bing
    Yuan, Chunfeng
    Xiong, Weihua
    Hu, Weiming
    Peng, Houwen
    Ding, Xinmiao
    Maybank, Steve
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2554 - 2560
  • [34] Contrastive Consistency and Attentive Complementarity for Deep Multi-View Subspace Clustering
    Wang, Jiao
    Wu, Bin
    Zhang, Hongying
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (01): : 143 - 160
  • [35] Learning to Generate Personalized Query Auto-Completions via a Multi-View Multi-Task Attentive Approach
    Yin, Di
    Tan, Jiwei
    Zhang, Zhe
    Deng, Hongbo
    Huang, Shujian
    Chen, Jiajun
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 2998 - 3007
  • [36] Attentive Multi-task Deep Reinforcement Learning
    Bram, Timo
    Brunner, Gino
    Richter, Oliver
    Wattenhofer, Roger
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT III, 2020, 11908 : 134 - 149
  • [37] Ensemble multi-view feature set partitioning method for effective multi-view learning
    Singh, Ritika
    Kumar, Vipin
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (08) : 4957 - 5001
  • [38] Multi-view reinforcement learning for sequential decision-making with insufficient state information
    Min Li
    William Zhu
    Shiping Wang
    International Journal of Machine Learning and Cybernetics, 2024, 15 : 1533 - 1552
  • [39] SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-view Total Correlation
    Cheng, Tong
    Dong, Hang
    Wang, Lu
    Qiao, Bo
    Lin, Qingwei
    Rajmohan, Saravan
    Moscibroda, Thomas
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2024, 244 : 698 - 717
  • [40] Identifying the potential miRNA biomarkers based on multi-view networks and reinforcement learning for diseases
    Su, Benzhe
    Wang, Weiwei
    Lin, Xiaohui
    Liu, Shenglan
    Huang, Xin
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (01)