REINFORCEMENT LEARNING-BASED IBVS STRUCTURE FOR CONTROL OF POINT-TO-POINT MOTION OF ROBOT MANIPULATORS

被引：0

作者：

Ye, Ting-Yu ^{[1
]}

Cheng, Ming-Yang ^{[1
]}

Chen, Ya-Ling ^{[1
]}

Huang, Pin-Hsuan ^{[1
]}

机构：

[1] Natl Cheng Kung Univ, Dept Elect Engn, Tainan 701, Taiwan

来源：

JOURNAL OF MARINE SCIENCE AND TECHNOLOGY-TAIWAN | 2020年 / 28卷 / 05期

关键词：

reinforcement learning; visual servoing; robotic system; Q-learning; deep Q-network;

D O I：

10.6119/JMST.202010_28(5).0006

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

In order to facilitate the use of robot manipulators equipped with visual servoing systems so as to enhance the flexibility/functionality of the automatic production line in industry, this paper focuses on applying the reinforcement learning paradigm to the Image-Based Visual Servoing (IBVS) structure. By responding to changes in the environment, the proposed reinforcement learning-based IBVS structure can select the best policy for controlling the position/pose of the robot manipulator so as to converge the error between the image feature and the desired image feature. This paper exploits Q-learning and a deep Q-network to implement a reinforcement learning-based IBVS structure, respectively. In this paper, the states used in reinforcement learning are the coordinates of the image feature point (or grid points) on the image plane, while the action is the increment in the gain constant of the IBVS structure. Three different IBVS structures-conventional IBVS, Q- learningbased IBVS and deep Q-network-based IBVS-are implemented on a 2-DOF planar robot manipulator to perform a point-to-point motion. Experimental results indicate that the proposed deep Q-network-based IBVS structure has the best performance, while the conventional IBVS yields the worst.

引用

页码：367 / 375

页数：9

共 50 条

[21] Iterative learning variable structure controller on linear motors for point-to-point motion
Wu, Jianhua
Ding, Han
2006 IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, VOLS 1 AND 2, 2006, : 888 - +
[22] Point-to-point robot control under actuator constraints
Div. de Física Aplicada, CICESE, Carretera Tijuana-Ensenada Km. 107, Ensenada, B.C., 22800, Mexico
不详
不详
CONTROL ENG. PRACT., 11 (1555-1562):
[23] Point-to-point robot control under actuator constraints
Kelly, R
Santibanez, V
Berghuis, H
CONTROL ENGINEERING PRACTICE, 1997, 5 (11) : 1555 - 1562
[24] Point-to-Point trajectory planning of flexible redundant robot manipulators using genetic algorithms
Yue, SG
Henrich, D
Xu, WL
Tso, SK
ROBOTICA, 2002, 20 : 269 - 280
[25] Smooth point-to-point trajectory planning for robot manipulators by using radial basis functions
Chettibi, Taha
ROBOTICA, 2019, 37 (03) : 539 - 559
[26] Stochastic Point-to-Point Iterative Learning Control Based on Stochastic Approximation
Xu, Yun
Shen, Dong
Zhang, Xiao-Dong
ASIAN JOURNAL OF CONTROL, 2017, 19 (05) : 1748 - 1755
[27] Optimal Control of Point-to-Point Navigation in Turbulent Time Dependent Flows Using Reinforcement Learning
Buzzicotti, Michele
Biferale, Luca
Bonaccorso, Fabio
Di Leoni, Patricio Clark
Gustavsson, Kristian
AIXIA 2020 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 12414 : 223 - 234
[28] POINT-TO-POINT QUASI-STATIC MOTION PLANNING FOR FLEXIBLE-LINK MANIPULATORS
XI, F
FENTON, RG
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1995, 11 (05): : 770 - 776
[29] OPTIMAL MOTION PLANNING OF MANIPULATORS WITH ELASTIC LINKS AND JOINTS IN GENERALIZED POINT-TO-POINT TASK
Rahimi, H. N.
Korayem, M. H.
Nikoobin, A.
PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, VOL 7, PTS A AND B, 2010, : 1167 - 1174
[30] Lifelong deep learning-based control of robot manipulators
Ganie, Irfan
Sarangapani, Jagannathan
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2023, 37 (12) : 3169 - 3192

← 1 2 3 4 5 →