REINFORCEMENT LEARNING-BASED IBVS STRUCTURE FOR CONTROL OF POINT-TO-POINT MOTION OF ROBOT MANIPULATORS

被引:0
|
作者
Ye, Ting-Yu [1 ]
Cheng, Ming-Yang [1 ]
Chen, Ya-Ling [1 ]
Huang, Pin-Hsuan [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Elect Engn, Tainan 701, Taiwan
来源
关键词
reinforcement learning; visual servoing; robotic system; Q-learning; deep Q-network;
D O I
10.6119/JMST.202010_28(5).0006
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In order to facilitate the use of robot manipulators equipped with visual servoing systems so as to enhance the flexibility/functionality of the automatic production line in industry, this paper focuses on applying the reinforcement learning paradigm to the Image-Based Visual Servoing (IBVS) structure. By responding to changes in the environment, the proposed reinforcement learning-based IBVS structure can select the best policy for controlling the position/pose of the robot manipulator so as to converge the error between the image feature and the desired image feature. This paper exploits Q-learning and a deep Q-network to implement a reinforcement learning-based IBVS structure, respectively. In this paper, the states used in reinforcement learning are the coordinates of the image feature point (or grid points) on the image plane, while the action is the increment in the gain constant of the IBVS structure. Three different IBVS structures-conventional IBVS, Q- learningbased IBVS and deep Q-network-based IBVS-are implemented on a 2-DOF planar robot manipulator to perform a point-to-point motion. Experimental results indicate that the proposed deep Q-network-based IBVS structure has the best performance, while the conventional IBVS yields the worst.
引用
收藏
页码:367 / 375
页数:9
相关论文
共 50 条
  • [21] Iterative learning variable structure controller on linear motors for point-to-point motion
    Wu, Jianhua
    Ding, Han
    2006 IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, VOLS 1 AND 2, 2006, : 888 - +
  • [22] Point-to-point robot control under actuator constraints
    Div. de Física Aplicada, CICESE, Carretera Tijuana-Ensenada Km. 107, Ensenada, B.C., 22800, Mexico
    不详
    不详
    CONTROL ENG. PRACT., 11 (1555-1562):
  • [23] Point-to-point robot control under actuator constraints
    Kelly, R
    Santibanez, V
    Berghuis, H
    CONTROL ENGINEERING PRACTICE, 1997, 5 (11) : 1555 - 1562
  • [24] Point-to-Point trajectory planning of flexible redundant robot manipulators using genetic algorithms
    Yue, SG
    Henrich, D
    Xu, WL
    Tso, SK
    ROBOTICA, 2002, 20 : 269 - 280
  • [25] Smooth point-to-point trajectory planning for robot manipulators by using radial basis functions
    Chettibi, Taha
    ROBOTICA, 2019, 37 (03) : 539 - 559
  • [26] Stochastic Point-to-Point Iterative Learning Control Based on Stochastic Approximation
    Xu, Yun
    Shen, Dong
    Zhang, Xiao-Dong
    ASIAN JOURNAL OF CONTROL, 2017, 19 (05) : 1748 - 1755
  • [27] Optimal Control of Point-to-Point Navigation in Turbulent Time Dependent Flows Using Reinforcement Learning
    Buzzicotti, Michele
    Biferale, Luca
    Bonaccorso, Fabio
    Di Leoni, Patricio Clark
    Gustavsson, Kristian
    AIXIA 2020 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 12414 : 223 - 234
  • [28] POINT-TO-POINT QUASI-STATIC MOTION PLANNING FOR FLEXIBLE-LINK MANIPULATORS
    XI, F
    FENTON, RG
    IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1995, 11 (05): : 770 - 776
  • [29] OPTIMAL MOTION PLANNING OF MANIPULATORS WITH ELASTIC LINKS AND JOINTS IN GENERALIZED POINT-TO-POINT TASK
    Rahimi, H. N.
    Korayem, M. H.
    Nikoobin, A.
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, VOL 7, PTS A AND B, 2010, : 1167 - 1174
  • [30] Lifelong deep learning-based control of robot manipulators
    Ganie, Irfan
    Sarangapani, Jagannathan
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2023, 37 (12) : 3169 - 3192