PVI-Net: Point-Voxel-Image Fusion for Semantic Segmentation of Point Clouds in Large-Scale Autonomous Driving Scenarios

被引:2
|
作者
Wang, Zongshun [1 ]
Li, Ce [1 ]
Ma, Jialin [1 ]
Feng, Zhiqiang [1 ]
Xiao, Limei [1 ]
机构
[1] Lanzhou Univ Technol, Sch Elect Engn & Informat Engn, Lanzhou 730050, Peoples R China
基金
中国国家自然科学基金;
关键词
semantic segmentation; multi-perspective; cross-attention; LiDAR point clouds;
D O I
10.3390/info15030148
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, we introduce a novel framework for the semantic segmentation of point clouds in autonomous driving scenarios, termed PVI-Net. This framework uniquely integrates three different data perspectives-point clouds, voxels, and distance maps-executing feature extraction through three parallel branches. Throughout this process, we ingeniously design a point cloud-voxel cross-attention mechanism and a multi-perspective feature fusion strategy for point images. These strategies facilitate information interaction across different feature dimensions of perspectives, thereby optimizing the fusion of information from various viewpoints and significantly enhancing the overall performance of the model. The network employs a U-Net structure and residual connections, effectively merging and encoding information to improve the precision and efficiency of semantic segmentation. We validated the performance of PVI-Net on the SemanticKITTI and nuScenes datasets. The results demonstrate that PVI-Net surpasses most of the previous methods in various performance metrics.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Semantic segmentation of large-scale point clouds by integrating attention mechanisms and transformer models
    Yuan, Tiebiao
    Yu, Yangyang
    Wang, Xiaolong
    IMAGE AND VISION COMPUTING, 2024, 146
  • [22] FAR-Net: Semantic Segmentation of Large-Scale Point Clouds Based on Feature Aggregation and Recoding for Aerial Computing
    Zhang, Jianlong
    Chen, Huangwei
    Wang, Bin
    Fang, Guangzu
    Zhou, Yang
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 5217 - 5227
  • [23] CSFNet: Cross-Modal Semantic Focus Network for Semantic Segmentation of Large-Scale Point Clouds
    Luo, Yang
    Han, Ting
    Liu, Yujun
    Su, Jinhe
    Chen, Yiping
    Li, Jinyuan
    Wu, Yundong
    Cai, Guorong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [24] Fusion of images and point clouds for the semantic segmentation of large-scale 3D scenes based on deep learning
    Zhang, Rui
    Li, Guangyun
    Li, Minglei
    Wang, Li
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 143 : 85 - 96
  • [25] GAF-Net: Geometric Contextual Feature Aggregation and Adaptive Fusion for Large-Scale Point Cloud Semantic Segmentation
    Zhou, Ce
    Ling, Qiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [26] Active Spatio-Fine Enhancement Network for Semantic Segmentation of Large-Scale Point Clouds
    Chen, Xijiang
    Wang, Zihao
    Zhao, Bufan
    Qin, Mengjiao
    Han, Xianquan
    Ozdemir, Emirhan
    IEEE Sensors Journal, 2024, 24 (22): : 37358 - 37379
  • [27] Active Spatio-Fine Enhancement Network for Semantic Segmentation of Large-Scale Point Clouds
    Chen, Xijiang
    Wang, Zihao
    Zhao, Bufan
    Qin, Mengjiao
    Han, Xianquan
    Ozdemir, Emirhan
    IEEE SENSORS JOURNAL, 2024, 24 (22) : 37358 - 37379
  • [28] DSC-Net: learning discriminative spatial contextual features for semantic segmentation of large-scale ancient architecture point clouds
    Zhao, Jianghong
    Liu, Rui
    Hua, Xinnan
    Yu, Haiquan
    Zhao, Jifu
    Wang, Xin
    Yang, Jia
    HERITAGE SCIENCE, 2024, 12 (01):
  • [29] DLA-Net: Learning dual local attention features for semantic segmentation of large-scale building facade point clouds
    Su, Yanfei
    Liu, Weiquan
    Yuan, Zhimin
    Cheng, Ming
    Zhang, Zhihong
    Shen, Xuelun
    Wang, Cheng
    PATTERN RECOGNITION, 2022, 123
  • [30] Weakly Supervised Semantic Segmentation for Large-Scale Point Cloud
    Zhang, Yachao
    Li, Zonghao
    Xie, Yuan
    Qu, Yanyun
    Li, Cuihua
    Mei, Tao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3421 - 3429