Lane Detection Transformer Based on Multi-frame Horizontal and Vertical Attention and Visual Transformer Module

被引:2
|
作者
Zhang, Han [1 ]
Gu, Yunchao [1 ]
Wang, Xinliang [1 ]
Pan, Junjun [1 ]
Wang, Minghui [1 ]
机构
[1] Beihang Univ, XueYuan Rd 37, Beijing, Peoples R China
来源
关键词
Autonomous driving; Lane detection; Transformer;
D O I
10.1007/978-3-031-19842-7_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lane detection requires adequate global information due to the simplicity of lane line features and changeable road scenes. In this paper, we propose a novel lane detection Transformer based on multiframe input to regress the parameters of lanes under a lane shape modeling. We design a Multi-frame Horizontal and Vertical Attention (MHVA) module to obtain more global features and use Visual Transformer (VT) module to get "lane tokens" with interaction information of lane instances. Extensive experiments on two public datasets show that our model can achieve state-of-art results on VIL-100 dataset and comparable performance on TuSimple dataset. In addition, our model runs at 46 fps on multi-frame data while using few parameters, indicating the feasibility and practicability in real-time self-driving applications of our proposed method.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [1] Lane Detection Based on Multi-Frame Image Input
    Fan, Chao
    Chen, Fangfang
    Song, Yupei
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (07)
  • [2] Self-supervised multi-frame depth estimation with visual-inertial pose transformer and monocular guidance
    Wang, Xiang
    Luo, Haonan
    Wang, Zihang
    Zheng, Jin
    Bai, Xiao
    INFORMATION FUSION, 2024, 108
  • [3] Multi-frame Network Feature Fusion Model andSelf-attention Mechanism for Vehicle Lane Line Detection
    Zhu, Guang
    Liu, Yajuan
    Wang, Jiyue
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2024, 21 (04)
  • [4] GSDC Transformer: An Efficient and Effective Cue Fusion for Monocular Multi-Frame Depth Estimation
    Fang, Naiyu
    Qiu, Lemiao
    Zhang, Shuyou
    Wang, Zili
    Zhou, Zheyuan
    Hu, Kerui
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (03) : 2256 - 2263
  • [5] CFI-Former: Efficient lane detection by multi-granularity perceptual query attention transformer
    Gao, Rong
    Hu, Siqi
    Yan, Lingyu
    Zhang, Lefei
    Wu, Jia
    NEURAL NETWORKS, 2025, 187
  • [6] Image Captioning using Visual Attention and Detection Transformer Model
    Eluri, Yaswanth
    Vinutha, N.
    Jeevika, M.
    Sree, Sai Bhavya N.
    Abhiram, G. Surya
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [7] Research on Multiframe Lane Detection Method Using Swin Transformer Embedded with Attention
    Li, Yanhui
    Fang, Zhongchun
    Li, Hairong
    LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
  • [8] Behavior detection and evaluation based on multi-frame MobileNet
    Linqi Liu
    Xiuhui Wang
    Qifu Bao
    Xuesheng Li
    Multimedia Tools and Applications, 2024, 83 : 15733 - 15750
  • [9] Behavior detection and evaluation based on multi-frame MobileNet
    Liu, Linqi
    Wang, Xiuhui
    Bao, Qifu
    Li, Xuesheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 15733 - 15750
  • [10] DHFormer: A Vision Transformer-Based Attention Module for Image Dehazing
    Wasi, Abdul
    Shiney, O. Jeba
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 148 - 159