Human as AI mentor: Enhanced human-in-the-loop reinforcement learning for safe and efficient autonomous driving

被引:8
|
作者
Huang, Zilin [1 ]
Sheng, Zihao [1 ]
Ma, Chengyuan [1 ]
Chen, Sikai [1 ]
机构
[1] Univ Wisconsin Madison, Dept Civil & Environm Engn, Madison, WI 53706 USA
关键词
Human as AI mentor paradigm; Autonomous driving; Deep reinforcement learning; Human -in -the -loop learning; Driving policy; Mixed traffic platoon; MODEL;
D O I
10.1016/j.commtr.2024.100127
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Despite significant progress in autonomous vehicles (AVs), the development of driving policies that ensure both the safety of AVs and traffic flow efficiency has not yet been fully explored. In this paper, we propose an enhanced human-in-the-loop reinforcement learning method, termed the Human as AI mentor-based deep reinforcement learning (HAIM-DRL) framework, which facilitates safe and efficient autonomous driving in mixed traffic platoon. Drawing inspiration from the human learning process, we first introduce an innovative learning paradigm that effectively injects human intelligence into AI, termed Human as AI mentor (HAIM). In this paradigm, the human expert serves as a mentor to the AI agent. While allowing the agent to sufficiently explore uncertain environments, the human expert can take control in dangerous situations and demonstrate correct actions to avoid potential accidents. On the other hand, the agent could be guided to minimize traffic flow disturbance, thereby optimizing traffic flow efficiency. In detail, HAIM-DRL leverages data collected from free exploration and partial human demonstrations as its two training sources. Remarkably, we circumvent the intricate process of manually designing reward functions; instead, we directly derive proxy state-action values from partial human demonstrations to guide the agents' policy learning. Additionally, we employ a minimal intervention technique to reduce the human mentor's cognitive load. Comparative results show that HAIM-DRL outperforms traditional methods in driving safety, sampling efficiency, mitigation of traffic flow disturbance, and generalizability to unseen traffic scenarios.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Efficient-enhanced Reinforcement Learning for Autonomous Driving in Urban Traffic Scenarios
    Yin, Jianwen
    Jiang, Zhengmin
    Liang, Qingyi
    Li, Wenfei
    Pan, Zhongming
    Li, Huiyun
    Liu, Jia
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 887 - 893
  • [32] Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning Framework for Intervention Strategies
    Abbas, Ammar N.
    Amazu, Chidera W.
    Mietkiewicz, Joseph
    Briwa, Houda
    Perez, Andres Alonso
    Baldissone, Gabriele
    Demichela, Micaela
    Chasparis, Georgios C.
    Kelleher, John D.
    Leva, Maria Chiara
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024,
  • [33] Deep Reinforcement Active Learning for Human-In-The-Loop Person Re-Identification
    Liu, Zimo
    Wang, Jingya
    Gong, Shaogang
    Lu, Huchuan
    Tao, Dacheng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6121 - 6130
  • [34] A survey of human-in-the-loop for machine learning
    Wu, Xingjiao
    Xiao, Luwei
    Sun, Yixuan
    Zhang, Junhang
    Ma, Tianlong
    He, Liang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 135 : 364 - 381
  • [35] Human-in-the-loop Applied Machine Learning
    Brodley, Carla E.
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1 - 1
  • [36] Reinforcement Learning Control of Robotic Knee With Human-in-the-Loop by Flexible Policy Iteration
    Gao, Xiang
    Si, Jennie
    Wen, Yue
    Li, Minhan
    Huang, He
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5873 - 5887
  • [37] Artificial Swarm Intelligence, a Human-in-the-Loop Approach to AI
    Rosenberg, Louis
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4381 - 4382
  • [38] Constructing Ethical AI Based on the "Human-in-the-Loop" System
    Chen, Ximeng
    Wang, Xiaohong
    Qu, Yanzhang
    SYSTEMS, 2023, 11 (11):
  • [39] Human-in-the-Loop AI Reviewing: Feasibility, Opportunities, and Risks
    Drori, Iddo
    Te'eni, Dov
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2024, 25 (01): : 98 - 109
  • [40] Synthesis of Human-in-the-Loop Control Protocols for Autonomous Systems
    Feng, Lu
    Wiltsche, Clemens
    Humphrey, Laura
    Topcu, Ufuk
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (02) : 450 - 462