Transformable Dilated Convolution by Distance for LiDAR Semantic Segmentation

被引:3
|
作者
Lee, Jae-Seol [1 ]
Park, Tae-Hyoung [2 ]
机构
[1] Chungbuk Natl Univ, Ind AI Res Ctr, Cheongju 28644, Chungcheongbuk, South Korea
[2] Chungbuk Natl Univ, Dept Intelligent Syst & Robot, Cheongju 28644, Chungcheongbuk, South Korea
关键词
Autonomous vehicles; convolutional neural network; LiDAR; point cloud; semantic segmentation; spherical coordinate transformation;
D O I
10.1109/ACCESS.2022.3225556
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
LiDAR semantic segmentation is essential in autonomous vehicle safety. A rotating 3D LiDAR projects more laser points onto nearby objects and fewer points onto farther objects. Therefore, when projecting points onto a 2D image, such as spherical coordinates, nearer objects appear larger than more distant objects. Recognizing a closer object requires a larger receptive field, whereas recognizing a nearer object requires a smaller receptive field. However, existing CNNs have always used the same receptive field, making it difficult to express objects of various sizes in a single-sized receptive field, restricting their performance in terms of the recognition of larger (or nearer) objects that require a larger receptive field. In response to these limitations, we propose a transformable dilated convolution (TD Conv) to adjust the convolution filter's size according to the input distance. Leveraging the distance information of LiDAR and dilated convolution, a large convolution was applied to nearby objects, and a small convolution was applied to farther objects. The proposed method yielded good performance when recognizing nearer objects or larger objects such as roads and buildings and showed similar performance to the conventional method for farther or smaller objects. To test the proposed method, we used the SemanticKITTI dataset.
引用
收藏
页码:125102 / 125111
页数:10
相关论文
共 50 条
  • [1] Gaussian Dilated Convolution for Semantic Image Segmentation
    Shen, Falong
    Zeng, Gang
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 324 - 334
  • [2] Frequency-Adaptive Dilated Convolution for Semantic Segmentation
    Chen, Linwei
    Gu, Lin
    Zheng, Dezhi
    Fu, Ying
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 3414 - 3425
  • [3] MULTIPLE SKIP CONNECTIONS OF DILATED CONVOLUTION NETWORK FOR SEMANTIC SEGMENTATION
    Yamashita, Takayoshi
    Furukawa, Hironori
    Fujiyoshi, Hironobu
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1593 - 1597
  • [4] End-to-end dilated convolution network for document image semantic segmentation
    Xu Can-hui
    Shi Cao
    Chen Yi-nong
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2021, 28 (06) : 1765 - 1774
  • [5] MCDCNet: Mask Classification Combined with Adaptive Dilated Convolution for Image Semantic Segmentation
    Wei, Geng
    Wang, Junbo
    Shi, Bingxian
    Zhu, Xiaolin
    Cao, Bo
    Liu, Tong
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [6] LiDAR Point Clouds Semantic Segmentation in Autonomous Driving Based on Asymmetrical Convolution
    Sun, Xiang
    Song, Shaojing
    Miao, Zhiqing
    Tang, Pan
    Ai, Luxia
    ELECTRONICS, 2023, 12 (24)
  • [7] DENSE CONVOLUTION FOR SEMANTIC SEGMENTATION
    Han, Chaoyi
    Tao, Xiaoming
    Duan, Yiping
    Lu, Jianhua
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2222 - 2226
  • [8] Understanding Convolution for Semantic Segmentation
    Wang, Panqu
    Chen, Pengfei
    Yuan, Ye
    Liu, Ding
    Huang, Zehua
    Hou, Xiaodi
    Cottrell, Garrison
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1451 - 1460
  • [9] Semantic Segmentation by Multi-Scale Feature Extraction Based on Grouped Dilated Convolution Module
    Kim, Dong Seop
    Kim, Yu Hwan
    Park, Kang Ryoung
    MATHEMATICS, 2021, 9 (09)
  • [10] Hybrid Dilated Convolution Network Using Attentive Kernels for Real-Time Semantic Segmentation
    He, Jiankai
    Jiang, Bin
    Yang, Chao
    Tu, Wenxuan
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2020, 2020, 12305 : 129 - 141