Particle swarm optimization based multi-task parallel reinforcement learning algorithm

被引:4
|
作者
Duan Junhua [1 ]
Zhu Yi-an [1 ]
Zhong Dong [1 ]
Zhang Lixiang [1 ]
Zhang Lin [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp, 127 West Youyi Rd, Xian 710072, Shaanxi, Peoples R China
关键词
Multi-task reinforcement learning; parallel reinforcement learning; particle swarm optimization; transfer learning;
D O I
10.3233/JIFS-190209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transfer learning has been identified as conducive to improving the speed of machine learning in many areas. In multi-task reinforcement learning, transfer learning can assist the transfer of experiences between different tasks. The research conducted in this article is focused on two aspects. On the one hand, multi-task parallel transfer learning can improve the learning speed of parallel learning tasks. On the other hand, the learning of the current optimal experience can help the target point rewards to be transmitted to the starting point. The value of this self-learning can also accelerate the convergence speed of the reinforcement learning. According to the research into these two aspects, this paper uses the idea of particle swarm optimization (PSO) to conduct self-learning and interactive learning in multi-task parallel learning. In this paper, a new multi-task learning algorithm named PSO-MTPRL (Multi-Task Parallel Reinforcement Learning based on PSO) is proposed. Based on the idea of PSO algorithm, the Boltzmann strategy, Self-Learning Process (SLP) and Interactive Learning Process (ILP) are selected probabilistically. Based on the characteristic exhibited by reinforcement learning, segmented learning model is recommended. In the early learning stages, the complete Boltzmann exploration strategy is applied, and B-SLP-ILP (Boltzmann-SLP- ILP) learning procedure is conducted exclusively in the middle stage of the learning. In the late learning stages, Boltzmann exploration is involved again. The segmented learning model can help ensure the balance of the exploration and exploitation, in addition to ensuring that all tasks convergence.
引用
收藏
页码:8567 / 8575
页数:9
相关论文
共 50 条
  • [41] Gait Optimization for Multiple Humanoid Robots Based on Parallel Multi-swarm Particle Swarm Algorithm
    Li, Chunguang
    He, Rongyi
    Yao, Lina
    Tao, Chongben
    PROCEEDINGS OF THE 14TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS 2017), 2017, : 11 - 19
  • [42] Curriculum-Based Asymmetric Multi-Task Reinforcement Learning
    Huang, Hanchi
    Ye, Deheng
    Shen, Li
    Liu, Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7258 - 7269
  • [43] Multi-Task Reinforcement Learning with Context-based Representations
    Sodhani, Shagun
    Zhang, Amy
    Pineau, Joelle
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [44] A multi-strategy particle swarm optimization framework based on deep reinforcement learning
    Hou, Leyong
    Fan, Debin
    Cheng, Junjie
    Wu, Honglian
    Peng, Hu
    Deng, Changshou
    2023 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE, ICACI, 2023,
  • [45] SOLVING MULTI-OBJECTIVE PROBLEM BASED ON PARALLEL PARTICLE SWARM OPTIMIZATION ALGORITHM
    Zhang, Tao
    Qu, Shihai
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2024, 25 (02) : 445 - 461
  • [46] A parallel particle swarm optimization algorithm for multi-objective optimization problems
    Fan, Shu-Kai S.
    Chang, Ju-Ming
    ENGINEERING OPTIMIZATION, 2009, 41 (07) : 673 - 697
  • [47] Scalable Parallel Task Scheduling for Autonomous Driving Using Multi-Task Deep Reinforcement Learning
    Qi, Qi
    Zhang, Lingxin
    Wang, Jingyu
    Sun, Haifeng
    Zhuang, Zirui
    Liao, Jianxin
    Yu, F. Richard
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13861 - 13874
  • [48] Algorithm for Stereo Matching Based on Multi-Task Learning
    Wang Yufeng
    Wang Hongwei
    Liu Yu
    Yang Mingquan
    Quan Jicheng
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (04)
  • [49] Portfolio optimization based on multi-task relationship learning
    Ni X.
    Shen X.
    Zhao H.
    Qiu Y.
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2021, 41 (06): : 1428 - 1438
  • [50] A Dialogues Summarization Algorithm Based on Multi-task Learning
    Chen, Haowei
    Li, Chen
    Liang, Jiajing
    Tian, Lihua
    NEURAL PROCESSING LETTERS, 2024, 56 (03)