Lane Detection Transformer Based on Multi-frame Horizontal and Vertical Attention and Visual Transformer Module

被引:2
|
作者
Zhang, Han [1 ]
Gu, Yunchao [1 ]
Wang, Xinliang [1 ]
Pan, Junjun [1 ]
Wang, Minghui [1 ]
机构
[1] Beihang Univ, XueYuan Rd 37, Beijing, Peoples R China
来源
关键词
Autonomous driving; Lane detection; Transformer;
D O I
10.1007/978-3-031-19842-7_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lane detection requires adequate global information due to the simplicity of lane line features and changeable road scenes. In this paper, we propose a novel lane detection Transformer based on multiframe input to regress the parameters of lanes under a lane shape modeling. We design a Multi-frame Horizontal and Vertical Attention (MHVA) module to obtain more global features and use Visual Transformer (VT) module to get "lane tokens" with interaction information of lane instances. Extensive experiments on two public datasets show that our model can achieve state-of-art results on VIL-100 dataset and comparable performance on TuSimple dataset. In addition, our model runs at 46 fps on multi-frame data while using few parameters, indicating the feasibility and practicability in real-time self-driving applications of our proposed method.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [31] CSA-Lanenet: a contiguous spatial attention lane detection network with vision transformer modules
    Yang, Wei-Jong
    Ho, Li-Yang
    VISUAL COMPUTER, 2024,
  • [32] CSA-Lanenet: a contiguous spatial attention lane detection network with vision transformer modules
    Yang, Wei-Jong
    Ho, Li-Yang
    Visual Computer, 2024,
  • [33] Dual-Attention Transformer and Discriminative Flow for Industrial Visual Anomaly Detection
    Yao, Haiming
    Luo, Wei
    Yu, Wenyong
    Zhang, Xiaotian
    Qiang, Zhenfeng
    Luo, Donghao
    Shi, Hui
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6126 - 6140
  • [34] A novel transformer attention-based approach for sarcasm detection
    Khan, Shumaila
    Qasim, Iqbal
    Khan, Wahab
    Aurangzeb, Khursheed
    Khan, Javed Ali
    Anwar, Muhammad Shahid
    EXPERT SYSTEMS, 2025, 42 (01)
  • [35] A Transformer Architecture based mutual attention for Image Anomaly Detection
    Zhang, Mengting
    Tian, Xiuxia
    Virtual Reality and Intelligent Hardware, 2023, 5 (01): : 57 - 67
  • [36] Transformer-Based Interactive Multi-Modal Attention Network for Video Sentiment Detection
    Zhuang, Xuqiang
    Liu, Fangai
    Hou, Jian
    Hao, Jianhua
    Cai, Xiaohong
    NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1943 - 1960
  • [37] Transformer-Based Interactive Multi-Modal Attention Network for Video Sentiment Detection
    Xuqiang Zhuang
    Fangai Liu
    Jian Hou
    Jianhua Hao
    Xiaohong Cai
    Neural Processing Letters, 2022, 54 : 1943 - 1960
  • [38] Multi-frame feature-fusion-based model for violence detection
    Asad, Mujtaba
    Yang, Jie
    He, Jiang
    Shamsolmoali, Pourya
    He, Xiangjian
    VISUAL COMPUTER, 2021, 37 (06): : 1415 - 1431
  • [39] Greedy Integration Based Multi-Frame Detection Algorithm in Radar Systems
    Li, Wujun
    Yi, Wei
    Teh, Kah Chan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) : 5877 - 5891
  • [40] Multi-frame feature-fusion-based model for violence detection
    Mujtaba Asad
    Jie Yang
    Jiang He
    Pourya Shamsolmoali
    Xiangjian He
    The Visual Computer, 2021, 37 : 1415 - 1431