Cooperative Multi-Agent Reinforcement Learning Framework for Edge Intelligence-Empowered Traffic Light Control

被引:0
|
作者
Shi, Haiyong [1 ]
Liu, Bingyi [1 ]
Wang, Enshu [2 ]
Han, Weizhen [1 ]
Wang, Jinfan [3 ]
Cui, Shihong [4 ]
Wu, Libing [2 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[2] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430070, Peoples R China
[3] Southern Univ Sci & Technol, Inst Future Networks, Shenzhen 518055, Peoples R China
[4] Tianyijiaotong Technol Co Ltd, Mkt Dept, Suzhou 215000, Peoples R China
基金
中国国家自然科学基金;
关键词
Consumer electronics; Adaptation models; Training; Roads; Vehicle dynamics; Decision making; Real-time systems; Edge intelligence; traffic light control; cooperative multi-agent reinforcement learning; options framework; mixing network; SCHEME; URBAN;
D O I
10.1109/TCE.2024.3416822
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Edge Intelligence (EI) technologies obtain an advance with promotion by Consumer Electronics (CE) and spread to the Intelligent Transportation System (ITS). As part of the edge in ITS, traffic lights suffer from overlooking the importance of cooperation among traffic lights and lack of long sequence scheduling. To address this challenge, we formulate the control problem of multi-intersection traffic lights as a multi-agent Markov game problem. In response, we propose a Cooperative Adaptive Control Method (CACOM), a framework based on multi-agent reinforcement learning. CACOM integrates the mixing network and the options framework. Specifically, the mixing network enables cooperation among intersections, and the options framework provides the ability for intersections to make a long sequence scheduling. Besides, we designed a weight generator for the mixing network based on the traffic conditions at intersections, allowing the agents to adjust their weights adaptively during cooperation. Finally, we build a simulator including two real-world urban road networks for extensive evaluation. In contrast to the best baseline methods, our approach achieves an average waiting time reduction of around 24% and 42% for high-priority vehicles in two scenarios. Moreover, the waiting time for all vehicles is decreased by approximately 15% and 6%, respectively.
引用
收藏
页码:7373 / 7384
页数:12
相关论文
共 50 条
  • [1] Swarm Reinforcement Learning for traffic signal control based on cooperative multi-agent framework
    Tahifa, Mohammed
    Boumhidi, Jaouad
    Yahyaouy, Ali
    2015 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2015,
  • [2] Multi-Agent Reinforcement Learning for Traffic Signal Control: A Cooperative Approach
    Kolat, Mate
    Kovari, Balint
    Becsi, Tamas
    Aradi, Szilard
    SUSTAINABILITY, 2023, 15 (04)
  • [3] Cooperative Traffic Signal Control Based on Multi-agent Reinforcement Learning
    Gao, Ruowen
    Liu, Zhihan
    Li, Jinglin
    Yuan, Quan
    BLOCKCHAIN AND TRUSTWORTHY SYSTEMS, BLOCKSYS 2019, 2020, 1156 : 787 - 793
  • [4] Multi-Agent Deep Reinforcement Learning with Clustering and Information Sharing for Traffic Light Cooperative Control
    Du T.
    Wang B.
    Cheng H.
    Luo L.
    Zeng N.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (02): : 538 - 545
  • [5] Collective Deep Reinforcement Learning for Intelligence Sharing in the Internet of Intelligence-Empowered Edge Computing
    Tang, Qinqin
    Xie, Renchao
    Yu, Fei Richard
    Chen, Tianjiao
    Zhang, Ran
    Huang, Tao
    Liu, Yunjie
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (11) : 6327 - 6342
  • [6] Multi-agent broad reinforcement learning for intelligent traffic light control
    Zhu, Ruijie
    Li, Lulu
    Wu, Shuning
    Lv, Pei
    Li, Yafei
    Xu, Mingliang
    INFORMATION SCIENCES, 2023, 619 : 509 - 525
  • [7] Multi-Agent Deep Reinforcement Learning for Decentralized Cooperative Traffic Signal Control
    Zhao, Yang
    Hu, Jian-Ming
    Gao, Ming-Yang
    Zhang, Zuo
    CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 458 - 470
  • [8] Cooperative Multi-agent Reinforcement Learning Models (CMRLM) for Intelligent Traffic Control
    Vidhate, Deepak A.
    Kulkarni, Parag
    2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 325 - 331
  • [9] STMARL: A Spatio-Temporal Multi-Agent Reinforcement Learning Approach for Cooperative Traffic Light Control
    Wang, Yanan
    Xu, Tong
    Niu, Xin
    Tan, Chang
    Chen, Enhong
    Xiong, Hui
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (06) : 2228 - 2242
  • [10] Multi-User QoE Enhancement: Federated Multi-Agent Reinforcement Learning for Cooperative Edge Intelligence
    Li, Xiuhua
    Sun, Chuan
    Wen, Junhao
    Wang, Xiaofei
    Guizani, Mohsen
    Leung, Victor C. M.
    IEEE NETWORK, 2022, 36 (05): : 144 - 151