Research on Multiframe Lane Detection Method Using Swin Transformer Embedded with Attention

被引:0
|
作者
Li, Yanhui [1 ]
Fang, Zhongchun [2 ]
Li, Hairong [2 ]
机构
[1] Inner Mongolia Univ Sci & Technol, Sch Cyber Sci & Technol, Sch Digital & Intelligent Ind, Baotou 014000, Inner Mongolia, Peoples R China
[2] Inner Mongolia Univ Sci & Technol, Coll Innovat & Entrepreneurship Educ, Engn Training Ctr, Baotou 014000, Inner Mongolia, Peoples R China
关键词
Swin Transformer; coordinate attention mechanism; spatiotemporal long-short term memory; lane detection;
D O I
10.3788/LOP241332
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To reduce computational costs and efficiently complete lane detection tasks, this paper proposes a multiframe lane detection method using a Swin Transformer embedded with a coordinate attention mechanism for lane detection in continuous multiframe image sequences. In this approach, continuous multiframe image sequences are taken as inputs and the Swin Transformer encoder-decoder architecture is adopted to ensure consistent input and output image sizes. The coordinate attention mechanism is embedded in patch merging from the stage 3 fusion layer of the Swin Transformer model, enhancing the model's focus on long-distance dependencies and its ability to extract both global and local features of lane lines. Additionally, introducing spatiotemporal long-short term memory between the encoder and decoder boosts the model's ability to predict temporal sequence information, significantly improving the lane line detection accuracy. Extensive experiments conducts on the CULane, Tusimple, and VIL-100 datasets demonstrate that the proposed method provides a comprehensive advantage in handling continuous multiframe image sequences, delivering superior detection performance compared to existing studies.
引用
收藏
页数:10
相关论文
共 29 条
  • [1] Real time Detection of Lane Markers in Urban Streets
    Aly, Mohamed
    [J]. 2008 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2008, : 165 - 170
  • [2] [Anonymous], 2024, CAM (sic) ASPP, V38, P86
  • [3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [4] Target Classification of Hyperspectral Lidar Based on Optimization Selection of Spatial-Spectral Features
    Chen Bowen
    Shi Shuo
    Gong Wei
    Xu Qian
    Tang Xingtao
    Bi Sifu
    Chen Biwu
    [J]. ACTA OPTICA SINICA, 2023, 43 (12)
  • [5] You Only Look One-level Feature
    Chen, Qiang
    Wang, Yingming
    Yang, Tong
    Zhang, Xiangyu
    Cheng, Jian
    Sun, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13034 - 13043
  • [6] Rethinking BiSeNet For Real-time Semantic Segmentation
    Fan, Mingyuan
    Lai, Shenqi
    Huang, Junshi
    Wei, Xiaoming
    Chai, Zhenhua
    Luo, Junfeng
    Wei, Xiaolin
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9711 - 9720
  • [7] EL-GAN: Embedding Loss Driven Generative Adversarial Networks for Lane Detection
    Ghafoorian, Mohsen
    Nugteren, Cedric
    Baka, Nora
    Booij, Olaf
    Hofmann, Michael
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 256 - 272
  • [8] Deformable Gabor Feature Networks for Biomedical Image Classification
    Gong, Xuan
    Xia, Xin
    Zhu, Wentao
    Zhang, Baochang
    Doermann, David
    Zhuo, Li'an
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 4003 - 4011
  • [9] Fair Comparison: Quantifying Variance in Results for Fine-grained Visual Categorization
    Gwilliam, Matthew
    Teuscher, Adam
    Anderson, Connor
    Farrell, Ryan
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3308 - 3317
  • [10] Monte Carlo Scene Search for 3D Scene Understanding
    Hampali, Shreyas
    Stekovic, Sinisa
    Sarkar, Sayan Deb
    Kumar, Chetan S.
    Fraundorfer, Friedrich
    Lepetit, Vincent
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13799 - 13808