Deep Reinforcement Learning Based Dynamic Proportional-Integral (PI) Gain Auto-Tuning Method for a Robot Driver System

被引:6
|
作者
Park, Joonghoo [1 ]
Kim, Heejung [1 ]
Hwang, Kyunghun [2 ]
Lim, Sejoon [3 ]
机构
[1] Kookmin Univ, Grad Sch Automot Engn, Seoul 02707, South Korea
[2] Hyundai Motor Co, Electrificat Energy Efficiency & Drivabil Team 3, Hwaseong 18280, South Korea
[3] Kookmin Univ, Dept Automobile & IT Convergence, Seoul 02707, South Korea
来源
IEEE ACCESS | 2022年 / 10卷
基金
新加坡国家研究基金会;
关键词
Robots; Vehicles; PI control; Heuristic algorithms; Vehicle dynamics; Control systems; Dynamometers; Automation; deep Q-learning; emission test; machine learning; PID control; reinforcement learning; vehicle control; NEURAL-NETWORK CONTROL;
D O I
10.1109/ACCESS.2022.3159785
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To meet the growing trend of stringent fuel economy regulations, automakers around the world are designing modules such as engines, motors, transmissions and batteries to be as efficient as possible. In order to verify the effect of these designs on the overall fuel efficiency of the vehicle, the vehicle equipped with each module is placed on the chassis dynamometer, driven to follow the target vehicle speed, and actual fuel efficiency is measured. These tests are traditionally performed by human operators, but are now being replaced by robots (physical or software) to ensure the accuracy and reliability of test results. Although the conventionally proposed proportional integral (PI)-based controller has a simple structure and is easy to implement, it requires the process of finding the optimal gain whenever the test conditions such as vehicle or drive cycle change, which is difficult and time consuming. In this study, we propose a proportional integral controller gain adjustment algorithm using deep reinforcement learning. The reinforcement learning agent learns to dynamically modify the PI gain value of the acceleration/deceleration pedal to better follow the target vehicle in a simulation environment. The perturbation is used in each training episode to reduce the difference between the simulation and real testing environment. Upon completion of the training process, the trained agent performs an adjustment process that generates a reference gain table. We then use this reference gain table to perform a real test. The performance of the proposed system was evaluated using Hyundai Tucson HEV (NX4) on an AVL chassis dynamometer. We also compared the performance of our proposed algorithm to traditional fuzzy logic-based PI controllers. The obtained experimental results show that the proposed control system achieved a performance improvement of aounrd 46.8% compared to the conventional PI control system in terms of root mean square error.
引用
收藏
页码:31043 / 31057
页数:15
相关论文
共 50 条
  • [41] Dynamic Tuning of PI-Controllers based on Model-free Reinforcement Learning Methods
    Brujeni, Lena Abbasi
    Lee, Jong Min
    Shah, Sirish L.
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 453 - 458
  • [42] Guidewire feeding method based on deep reinforcement learning for vascular intervention robot
    Yang, Deai
    Song, Jingzhou
    Hu, Yuhang
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 1287 - 1293
  • [43] Mobile Robot Path Planning Method Based on Deep Reinforcement Learning Algorithm
    Meng, Haitao
    Zhang, Hengrui
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
  • [44] A Disturbance Rejection Control Method Based on Deep Reinforcement Learning for a Biped Robot
    Liu, Chuzhao
    Gao, Junyao
    Tian, Dingkui
    Zhang, Xuefeng
    Liu, Huaxin
    Meng, Libo
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 17
  • [45] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
    Yanglong Liu
    Zuguo Chen
    Yonggang Li
    Ming Lu
    Chaoyang Chen
    Xuzhuo Zhang
    International Journal of Control, Automation and Systems, 2022, 20 : 2669 - 2680
  • [46] A Behavior-Based Mobile Robot Navigation Method with Deep Reinforcement Learning
    Li, Juncheng
    Ran, Maopeng
    Wang, Han
    Xie, Lihua
    UNMANNED SYSTEMS, 2021, 9 (03) : 201 - 209
  • [47] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
    Liu, Yanglong
    Chen, Zuguo
    Li, Yonggang
    Lu, Ming
    Chen, Chaoyang
    Zhang, Xuzhuo
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (08) : 2669 - 2680
  • [48] AC speed servo system with α-parameter ultimate sensitivity method based fuzzy reasoning auto-tuning approach
    Yoshitsugu, J
    Inoue, K
    Nakaoka, M
    SEVENTH INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND VARIABLE SPEED DRIVES, 1998, (456): : 548 - 553
  • [49] Novel design method for the ball mill pulverizing system based on fuzzy reasoning and auto-tuning PID control
    Li, XF
    Zhang, JH
    Zhu, DY
    Zhang, C
    NEW TECHNOLOGIES FOR COMPUTER CONTROL 2001, 2002, : 473 - 478
  • [50] Proportional-Integral Controllers Performance of a Grid-Connected Solar PV System with Particle Swarm Optimization and Ziegler-Nichols Tuning Method
    Dezelak, Klemen
    Bracinik, Peter
    Sredensek, Klemen
    Seme, Sebastijan
    ENERGIES, 2021, 14 (09)