Optimization of Deep Reinforcement Learning with Hybrid Multi-Task Learning

被引:1
|
作者
Varghese, Nelson Vithayathil [1 ]
Mahmoud, Qusay H. [1 ]
机构
[1] Ontario Tech Univ, Dept Elect Comp & Software Engn, Oshawa, ON L1G 0C5, Canada
关键词
deep reinforcement learning; transfer learning; multi-tasking; actor-mimic;
D O I
10.1109/SysCon48628.2021.9447080
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an outcome of the technological advancements occurred within artificial intelligence (AI) domain in recent times, deep learning (DL) has been established its position as a prominent representation learning method for all forms of machine learning (ML), including the reinforcement learning (RL). Subsequently, leading to the evolution of deep reinforcement learning (DRL) which combines deep learning's high representational learning capabilities with current reinforcement learning methods. Undoubtedly, this new direction has caused a pivotal role towards the performance optimization of intelligent RL systems designed by following model-free based methodology. Optimization of the performance achieved with this methodology was majorly restricted to intelligent systems having reinforcement learning algorithms designed to learn single task at a time. Simultaneously, single task-based learning method was observed as quite less efficient in terms of data, especially when such intelligent systems required operate under too complex as well as data rich conditions. The prime reason for this was because of the restricted application of existing methods to wide range of scenarios, and associated tasks from those operating environments. One of the possible approaches to mitigate this issue is by adopting the method of multi-task learning. Objective of this research paper is to present a parallel multi-task learning (PMTL) approach for the optimization of deep reinforcement learning agents operating within two different by semantically similar environments with related tasks. The proposed framework will be built with multiple individual actor-critic models functioning within each environment and transferring the knowledge among themselves through a global network to optimize the performance.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Multi-task end-edge offloading based on Lyapunov optimization and deep reinforcement learning
    Xu C.
    Tang Z.-X.
    Jin X.
    Xia C.-Q.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (07): : 2457 - 2464
  • [22] Unsupervised Task Clustering for Multi-task Reinforcement Learning
    Ackermann, Johannes
    Richter, Oliver
    Wattenhofer, Roger
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 222 - 237
  • [23] Study on deep reinforcement learning for multi-task scheduling in cloud manufacturing
    Xiao, Jiuhong
    Cai, Yishuai
    Chen, Yong
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2025,
  • [24] Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control
    Xu, Zhiyuan
    Wu, Kun
    Che, Zhengping
    Tang, Jian
    Ye, Jieping
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [25] A reinforcement learning assisted evolutionary algorithm for constrained multi-task optimization
    Yang, Yufei
    Zhang, Changsheng
    Zhang, Bin
    Ning, Jiaxu
    INFORMATION SCIENCES, 2024, 678
  • [26] A multi-task learning model with reinforcement optimization for ASD comorbidity discrimination
    Dong, Heyou
    Chen, Dan
    Chen, Yukang
    Tang, Yunbo
    Yin, Dingze
    Li, Xiaoli
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 243
  • [27] Pareto Multi-task Deep Learning
    Riccio, Salvatore D.
    Dyankov, Deyan
    Jansen, Giorgio
    Di Fatta, Giuseppe
    Nicosia, Giuseppe
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 132 - 141
  • [28] Learning potential functions and their representations for multi-task reinforcement learning
    Matthijs Snel
    Shimon Whiteson
    Autonomous Agents and Multi-Agent Systems, 2014, 28 : 637 - 681
  • [29] Multi-Task Reinforcement Learning with Soft Modularization
    Yang, Ruihan
    Xu, Huazhe
    Wu, Yi
    Wang, Xiaolong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [30] Adversarial Online Multi-Task Reinforcement Learning
    Nguyen, Quan
    Mehta, Nishant A.
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 1124 - 1165