MLP-Based Efficient Convolutional Neural Network for Lane Detection

被引:3
|
作者
Yao, Xuedong [1 ]
Wang, Yandong [1 ,2 ,3 ]
Wu, Yanlan [4 ,5 ,6 ]
He, Guoxiong [1 ]
Luo, Shuchang [1 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China
[2] Collaborat Innovat Ctr Geospatial Informat Techno, Wuhan 430079, Peoples R China
[3] East China Univ Technol, Fac Geomat, Nanchang 330013, Jiangxi, Peoples R China
[4] Anhui Univ, Informat Mat & Intelligent Sensing Lab Anhu Prov, Hefei 230601, Peoples R China
[5] Anhui Univ, Sch Resources & Environm Engn, Hefei 230601, Peoples R China
[6] Anhui Engn Res Ctr Geog Informat Intelligent Tech, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural network (CNN); lane detection; long-range dependencies; multilayer perceptron (MLP); ATTENTION; TRACKING;
D O I
10.1109/TVT.2023.3275571
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Lane detection is an important and fundamental task in autonomous driving. Modern convolutional neural network (CNN) methods have achieved high performance in lane detection; however, the intrinsic locality of convolution operations makes these methods limited in effectively modeling the long-range dependencies that are vital to capture global information of lanes. Additionally, numerous convolution operations result in considerable computational cost for high complexity. To overcome these difficulties, we propose an efficient lane detection method by combining CNN with a multilayer perceptron (MLP). First, an improved bottleneck-1D layer is used to replace the standard convolutional layer in overall network to reduce the computational cost and parameters while applying hybrid dilated convolution (HDC) to better capture multiscale lane information. Second, we construct a hybrid MLP block in the latent space to capture the long-range dependencies of lanes. The hybrid MLP projects tokenized convolutional features from spatial locations and channels, and then, they are fused together to obtain global representation, in which each output pixel is related to each input pixel. The introduction of MLP further decreases computational complexity and makes the proposed architecture more efficient for lane detection. Experimental results on two challenging datasets (CULane, Tusimple) demonstrate that our method can achieve a higher computational efficiency while maintaining a decent detection performance compared with other state-of-the-art methods. Furthermore, this study indicates that integrating the global representation capacity of an MLP with local prior information of convolution is an effective and potential perspective in lane detection.
引用
收藏
页码:12602 / 12614
页数:13
相关论文
共 50 条
  • [21] U-MLP: MLP-based ultralight refinement network for medical image segmentation
    Gao, Shuo
    Yang, Wenhui
    Xu, Menglei
    Zhang, Hao
    Yu, Hong
    Qian, Airong
    Zhang, Wenjuan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165
  • [22] Face Detection System Based on MLP Neural Network
    Shilbayeh, Nidal F.
    Al-Qudah, Gaith A.
    RECENT ADVANCES IN NEURAL NETWORKS, FUZZY SYSTEMS & EVOLUTIONARY COMPUTING, 2010, : 238 - 243
  • [23] Efficient Lane Detection Technique Based on Lightweight Attention Deep Neural Network
    Yao, Zhiting
    Chen, Xiyuan
    Journal of Advanced Transportation, 2022, 2022
  • [24] Efficient Lane Detection Technique Based on Lightweight Attention Deep Neural Network
    Yao, Zhiting
    Chen, Xiyuan
    JOURNAL OF ADVANCED TRANSPORTATION, 2022, 2022
  • [25] An efficient intrusion detection model based on convolutional spiking neural network
    Zhen Wang
    Fuad A. Ghaleb
    Anazida Zainal
    Maheyzah Md Siraj
    Xing Lu
    Scientific Reports, 14
  • [26] An MLP-Based Player Detection and Tracking in Broadcast Soccer Video
    Heydari, Mohammad
    Moghadam, Amir Masoud Eftekhari
    2012 INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE (ICRAI), 2012, : 195 - 199
  • [27] An efficient intrusion detection model based on convolutional spiking neural network
    Wang, Zhen
    Ghaleb, Fuad A.
    Zainal, Anazida
    Siraj, Maheyzah Md
    Lu, Xing
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [28] Speech Enhancement Using MLP-Based Architecture With Convolutional Token Mixing Module and Squeeze-and-Excitation Network
    Song, Hyungchan
    Kim, Minseung
    Shin, Jong Won
    IEEE ACCESS, 2022, 10 : 119283 - 119289
  • [29] Lane Instance Segmentation Algorithm Based on Convolutional Neural Network
    Zhou Su
    Wu Di
    Jin Jie
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (08)
  • [30] An MLP-based representation of neural tensor networks for the RDF data models
    Abedini, Farhad
    Menhaj, Mohammad Bagher
    Keyvanpour, Mohammad Reza
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (Suppl 2): : 1135 - 1144