Accelerated multi-objective task learning using modified Q-learning algorithm

被引:0
|
作者
Rajamohan, Varun Prakash [1 ]
Jagatheesaperumal, Senthil Kumar [1 ]
机构
[1] Mepco Schlenk Engn Coll, Dept Elect & Commun Engn, Sivakasi, Tamil Nadu, India
关键词
reinforcement learning; Q-learning; robotic manipulator; task learning; distance metric;
D O I
10.1504/IJAHUC.2024.140665
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robots find extensive applications in industry. In recent years, the influence of robots has also increased rapidly in domestic scenarios. The Q-learning algorithm aims to maximise the reward for reaching the goal. This paper proposes a modified version of the Q-learning algorithm, known as Q-learning with scaled distance metric (Q - SD). This algorithm enhances task learning and makes task completion more meaningful. A robotic manipulator (agent) applies the Q - SD algorithm to the task of table cleaning. Using Q - SD, the agent acquires the sequence of steps necessary to accomplish the task while minimising the manipulator's movement distance. We partition the table into grids of different dimensions. The first has a grid count of 3 x 3, and the second has a grid count of 4 x 4. Using the Q - SD algorithm, the maximum success obtained in these two environments was 86% and 59% respectively. Moreover, compared to the conventional Q-learning algorithm, the drop in average distance moved by the agent in these two environments using the Q - SD algorithm was 8.61% and 6.7% respectively.
引用
收藏
页码:28 / 37
页数:10
相关论文
共 50 条
  • [41] Decomposition based Multi-Objective Evolutionary Algorithm in XCS for Multi-Objective Reinforcement Learning
    Cheng, Xiu
    Browne, Will N.
    Zhang, Mengjie
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 622 - 629
  • [42] Multi-Agent Reinforcement Learning - An Exploration Using Q-Learning
    Graham, Caoimhin
    Bell, David
    Luo, Zhihui
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 293 - 298
  • [43] Multi-objective optimization of heat exchangers using a modified teaching-learning-based optimization algorithm
    Rao, R. Venkata
    Patel, Vivek
    APPLIED MATHEMATICAL MODELLING, 2013, 37 (03) : 1147 - 1162
  • [44] Anomaly Detection using Fuzzy Q-learning Algorithm
    Shamshirband, Shahaboddin
    Anuar, Nor Badrul
    Kiah, Miss Laiha Mat
    Misra, Sanjay
    ACTA POLYTECHNICA HUNGARICA, 2014, 11 (08) : 5 - 28
  • [45] An improved Q-learning algorithm using synthetic pheromones
    Monekosso, N
    Remagnino, P
    Szarowicz, A
    FROM THEORY TO PRACTICE IN MULTI-AGENT SYSTEMS, 2002, 2296 : 197 - 206
  • [46] Research on multi-objective control strategy of thermal management system of pure electric vehicle at low temperature based on Q-learning algorithm
    Zhan, Sen
    Huang, Yu
    Li, Fei
    Yin, Yanli
    Liu, Chunsheng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024,
  • [47] MULTI-OBJECTIVE MULTI-TASK LEARNING ON RNNLM FOR SPEECH RECOGNITION
    Song, Minguang
    Zhao, Yunxin
    Wang, Shaojun
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 197 - 203
  • [48] Unmanned ground weapon target assignment based on deep Q-learning network with an improved multi-objective artificial bee colony algorithm
    Wang, Tong
    Fu, Liyue
    Wei, Zhengxian
    Zhou, Yuhu
    Gao, Shan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [49] A Q-learning driven multi-objective evolutionary algorithm for worker fatigue dual-resource-constrained distributed hybrid flow shop
    Song, Haonan
    Li, Junqing
    Du, Zhaosheng
    Yu, Xin
    Xu, Ying
    Zheng, Zhixin
    Li, Jiake
    COMPUTERS & OPERATIONS RESEARCH, 2025, 175
  • [50] Track Learning Agent Using Multi-objective Reinforcement Learning
    Shah, Rushabh
    Ruparel, Vidhi
    Prabhu, Mukul
    D'mello, Lynette
    FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 1, CIS 2023, 2024, 868 : 27 - 40