Data Augmented Deep Behavioral Cloning for Urban Traffic Control Operations Under a Parallel Learning Framework

被引:34
作者
Li, Xiaoshuang [1 ,2 ]
Ye, Peijun [1 ,3 ]
Jin, Junchen [1 ]
Zhu, Fenghua [1 ,3 ]
Wang, Fei-Yue [1 ,4 ,5 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Qingdao Acad Intelligent Ind, Parallel Intelligence Res Ctr, Qingdao 266109, Peoples R China
[4] Univ Chinese Acad Sci, Ctr China Econ & Social Secur, Beijing 100149, Peoples R China
[5] Macau Univ Sci & Technol, Inst Syst Engn, Macau 999078, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative adversarial networks; Gallium nitride; Cloning; Data models; Task analysis; Complex systems; Knowledge engineering; Intelligent traffic signal operations; parallel learning; deep behavioral cloning; generative adversarial networks; VEHICLES; LIGHTS; SYSTEM;
D O I
10.1109/TITS.2020.3048151
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
It is indispensable for professional traffic signal engineers to perform manual operations of traffic signal control (TSC) to mitigate traffic congestion, especially with complicated scenarios. However, such a task is time-consuming, and the level of congestion mitigation heavily relies on individual expertise in engineering practice. Therefore, it is cost-effective to learn traffic engineers' knowledge to enhance the problem-solving skills for a large-scale urban traffic network. In this paper, a data augmented deep behavioral cloning (DADBC) method is proposed to imitate the problem-solving skills of traffic engineers. The method is under a conceptual framework, parallel learning (PL) framework, that incorporates machine learning techniques for solving decision-making problems in complex systems. The DADBC method enhances a hybrid demonstration by exploiting a generative adversarial network (GAN) and then uses the deep behavioral cloning (DBC) model to learn traffic engineers' control schemes. According to the validation results using the real manipulation data from Hangzhou, China, our method can imitate complex human behaviors in intervening traffic signal control operations to improve traffic efficiency in urban areas.
引用
收藏
页码:5128 / 5137
页数:10
相关论文
共 39 条
[1]   A Fast, Robust, and Incremental Model for Learning High-Level Concepts From Human Motions by Imitation [J].
Alibeigi, Mina ;
Ahmadabadi, Majid Nili ;
Araabi, Babak Nadjar .
IEEE TRANSACTIONS ON ROBOTICS, 2017, 33 (01) :153-168
[2]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[3]  
Bhattacharyya RP, 2018, IEEE INT C INT ROBOT, P1534, DOI 10.1109/IROS.2018.8593758
[4]  
Bowles C., 2018, GAN augmentation: Augmenting training data using generative adversarial networks
[5]  
Caruana R, 2001, ADV NEUR IN, V13, P402
[6]   Traffic signal timing optimisation based on genetic algorithm approach, including drivers' routing [J].
Ceylan, H ;
Bell, MGH .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2004, 38 (04) :329-342
[7]   Traffic Flow Imputation Using Parallel Data and Generative Adversarial Networks [J].
Chen, Yuanyuan ;
Lv, Yisheng ;
Wang, Fei-Yue .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (04) :1624-1630
[8]  
Chowdhury SR, 2019, CONF REC ASILOMAR C, P79, DOI [10.1109/IEEECONF44664.2019.9049005, 10.1109/ieeeconf44664.2019.9049005]
[9]   GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification [J].
Frid-Adar, Maayan ;
Diamant, Idit ;
Klang, Eyal ;
Amitai, Michal ;
Goldberger, Jacob ;
Greenspan, Hayit .
NEUROCOMPUTING, 2018, 321 :321-331
[10]   Optimal Cycle Program of Traffic Lights With Particle Swarm Optimization [J].
Garcia-Nieto, Jose ;
Carolina Olivera, Ana ;
Alba, Enrique .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2013, 17 (06) :823-839