Human as AI mentor: Enhanced human-in-the-loop reinforcement learning for safe and efficient autonomous driving

被引：8

作者：

Huang, Zilin ^{[1
]}

Sheng, Zihao ^{[1
]}

Ma, Chengyuan ^{[1
]}

Chen, Sikai ^{[1
]}

机构：

[1] Univ Wisconsin Madison, Dept Civil & Environm Engn, Madison, WI 53706 USA

来源：

COMMUNICATIONS IN TRANSPORTATION RESEARCH | 2024年 / 4卷

关键词：

Human as AI mentor paradigm; Autonomous driving; Deep reinforcement learning; Human -in -the -loop learning; Driving policy; Mixed traffic platoon; MODEL;

D O I：

10.1016/j.commtr.2024.100127

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Despite significant progress in autonomous vehicles (AVs), the development of driving policies that ensure both the safety of AVs and traffic flow efficiency has not yet been fully explored. In this paper, we propose an enhanced human-in-the-loop reinforcement learning method, termed the Human as AI mentor-based deep reinforcement learning (HAIM-DRL) framework, which facilitates safe and efficient autonomous driving in mixed traffic platoon. Drawing inspiration from the human learning process, we first introduce an innovative learning paradigm that effectively injects human intelligence into AI, termed Human as AI mentor (HAIM). In this paradigm, the human expert serves as a mentor to the AI agent. While allowing the agent to sufficiently explore uncertain environments, the human expert can take control in dangerous situations and demonstrate correct actions to avoid potential accidents. On the other hand, the agent could be guided to minimize traffic flow disturbance, thereby optimizing traffic flow efficiency. In detail, HAIM-DRL leverages data collected from free exploration and partial human demonstrations as its two training sources. Remarkably, we circumvent the intricate process of manually designing reward functions; instead, we directly derive proxy state-action values from partial human demonstrations to guide the agents' policy learning. Additionally, we employ a minimal intervention technique to reduce the human mentor's cognitive load. Comparative results show that HAIM-DRL outperforms traditional methods in driving safety, sampling efficiency, mitigation of traffic flow disturbance, and generalizability to unseen traffic scenarios.

引用

页数：24

共 50 条

[31] Efficient-enhanced Reinforcement Learning for Autonomous Driving in Urban Traffic Scenarios
Yin, Jianwen
Jiang, Zhengmin
Liang, Qingyi
Li, Wenfei
Pan, Zhongming
Li, Huiyun
Liu, Jia
2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 887 - 893
[32] Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning Framework for Intervention Strategies
Abbas, Ammar N.
Amazu, Chidera W.
Mietkiewicz, Joseph
Briwa, Houda
Perez, Andres Alonso
Baldissone, Gabriele
Demichela, Micaela
Chasparis, Georgios C.
Kelleher, John D.
Leva, Maria Chiara
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024,
[33] Deep Reinforcement Active Learning for Human-In-The-Loop Person Re-Identification
Liu, Zimo
Wang, Jingya
Gong, Shaogang
Lu, Huchuan
Tao, Dacheng
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6121 - 6130
[34] A survey of human-in-the-loop for machine learning
Wu, Xingjiao
Xiao, Luwei
Sun, Yixuan
Zhang, Junhang
Ma, Tianlong
He, Liang
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 135 : 364 - 381
[35] Human-in-the-loop Applied Machine Learning
Brodley, Carla E.
2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1 - 1
[36] Reinforcement Learning Control of Robotic Knee With Human-in-the-Loop by Flexible Policy Iteration
Gao, Xiang
Si, Jennie
Wen, Yue
Li, Minhan
Huang, He
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5873 - 5887
[37] Artificial Swarm Intelligence, a Human-in-the-Loop Approach to AI
Rosenberg, Louis
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4381 - 4382
[38] Constructing Ethical AI Based on the "Human-in-the-Loop" System
Chen, Ximeng
Wang, Xiaohong
Qu, Yanzhang
SYSTEMS, 2023, 11 (11):
[39] Human-in-the-Loop AI Reviewing: Feasibility, Opportunities, and Risks
Drori, Iddo
Te'eni, Dov
JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2024, 25 (01): : 98 - 109
[40] Synthesis of Human-in-the-Loop Control Protocols for Autonomous Systems
Feng, Lu
Wiltsche, Clemens
Humphrey, Laura
Topcu, Ufuk
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (02) : 450 - 462

← 1 2 3 4 5 →