Deep Reinforcement Learning Based Dynamic Proportional-Integral (PI) Gain Auto-Tuning Method for a Robot Driver System

被引：6

作者：

Park, Joonghoo ^{[1
]}

Kim, Heejung ^{[1
]}

Hwang, Kyunghun ^{[2
]}

Lim, Sejoon ^{[3
]}

机构：

[1] Kookmin Univ, Grad Sch Automot Engn, Seoul 02707, South Korea

[2] Hyundai Motor Co, Electrificat Energy Efficiency & Drivabil Team 3, Hwaseong 18280, South Korea

[3] Kookmin Univ, Dept Automobile & IT Convergence, Seoul 02707, South Korea

来源：

IEEE ACCESS | 2022年 / 10卷

基金：

新加坡国家研究基金会;

关键词：

Robots; Vehicles; PI control; Heuristic algorithms; Vehicle dynamics; Control systems; Dynamometers; Automation; deep Q-learning; emission test; machine learning; PID control; reinforcement learning; vehicle control; NEURAL-NETWORK CONTROL;

D O I：

10.1109/ACCESS.2022.3159785

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To meet the growing trend of stringent fuel economy regulations, automakers around the world are designing modules such as engines, motors, transmissions and batteries to be as efficient as possible. In order to verify the effect of these designs on the overall fuel efficiency of the vehicle, the vehicle equipped with each module is placed on the chassis dynamometer, driven to follow the target vehicle speed, and actual fuel efficiency is measured. These tests are traditionally performed by human operators, but are now being replaced by robots (physical or software) to ensure the accuracy and reliability of test results. Although the conventionally proposed proportional integral (PI)-based controller has a simple structure and is easy to implement, it requires the process of finding the optimal gain whenever the test conditions such as vehicle or drive cycle change, which is difficult and time consuming. In this study, we propose a proportional integral controller gain adjustment algorithm using deep reinforcement learning. The reinforcement learning agent learns to dynamically modify the PI gain value of the acceleration/deceleration pedal to better follow the target vehicle in a simulation environment. The perturbation is used in each training episode to reduce the difference between the simulation and real testing environment. Upon completion of the training process, the trained agent performs an adjustment process that generates a reference gain table. We then use this reference gain table to perform a real test. The performance of the proposed system was evaluated using Hyundai Tucson HEV (NX4) on an AVL chassis dynamometer. We also compared the performance of our proposed algorithm to traditional fuzzy logic-based PI controllers. The obtained experimental results show that the proposed control system achieved a performance improvement of aounrd 46.8% compared to the conventional PI control system in terms of root mean square error.

引用

页码：31043 / 31057

页数：15

共 50 条

[41] Dynamic Tuning of PI-Controllers based on Model-free Reinforcement Learning Methods
Brujeni, Lena Abbasi
Lee, Jong Min
Shah, Sirish L.
INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 453 - 458
[42] Guidewire feeding method based on deep reinforcement learning for vascular intervention robot
Yang, Deai
Song, Jingzhou
Hu, Yuhang
PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 1287 - 1293
[43] Mobile Robot Path Planning Method Based on Deep Reinforcement Learning Algorithm
Meng, Haitao
Zhang, Hengrui
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
[44] A Disturbance Rejection Control Method Based on Deep Reinforcement Learning for a Biped Robot
Liu, Chuzhao
Gao, Junyao
Tian, Dingkui
Zhang, Xuefeng
Liu, Huaxin
Meng, Libo
APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 17
[45] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
Yanglong Liu
Zuguo Chen
Yonggang Li
Ming Lu
Chaoyang Chen
Xuzhuo Zhang
International Journal of Control, Automation and Systems, 2022, 20 : 2669 - 2680
[46] A Behavior-Based Mobile Robot Navigation Method with Deep Reinforcement Learning
Li, Juncheng
Ran, Maopeng
Wang, Han
Xie, Lihua
UNMANNED SYSTEMS, 2021, 9 (03) : 201 - 209
[47] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
Liu, Yanglong
Chen, Zuguo
Li, Yonggang
Lu, Ming
Chen, Chaoyang
Zhang, Xuzhuo
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (08) : 2669 - 2680
[48] AC speed servo system with α-parameter ultimate sensitivity method based fuzzy reasoning auto-tuning approach
Yoshitsugu, J
Inoue, K
Nakaoka, M
SEVENTH INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND VARIABLE SPEED DRIVES, 1998, (456): : 548 - 553
[49] Novel design method for the ball mill pulverizing system based on fuzzy reasoning and auto-tuning PID control
Li, XF
Zhang, JH
Zhu, DY
Zhang, C
NEW TECHNOLOGIES FOR COMPUTER CONTROL 2001, 2002, : 473 - 478
[50] Proportional-Integral Controllers Performance of a Grid-Connected Solar PV System with Particle Swarm Optimization and Ziegler-Nichols Tuning Method
Dezelak, Klemen
Bracinik, Peter
Sredensek, Klemen
Seme, Sebastijan
ENERGIES, 2021, 14 (09)

← 1 2 3 4 5 →