CV-MOS: A Cross-View Model for Motion Segmentation

被引:0
|
作者
Tang, Xiaoyu [1 ,2 ]
Chen, Zeyu [1 ,2 ]
Cheng, Jintao [1 ,2 ]
Chen, Xieyuanli [3 ]
Wu, Jin [4 ]
Xue, Bohuan [5 ]
机构
[1] South China Normal Univ, Fac Engn, Sch Elect & Informat Engn, Foshan 528225, Guangdong, Peoples R China
[2] South China Normal Univ, Xingzhi Coll, Guangzhou 510000, Guangdong, Peoples R China
[3] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[4] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[5] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud compression; Semantics; Three-dimensional displays; Feature extraction; Laser radar; Periodic structures; Motion segmentation; Autonomous driving; cross view; LiDAR motion segmentation;
D O I
10.1109/TIM.2024.3458036
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In autonomous driving, accurately distinguishing between static and moving objects is crucial for the autonomous driving system. When performing the motion object segmentation (MOS) task, effectively leveraging motion information from objects becomes a primary challenge in improving the recognition of moving objects. Previous methods either utilized range view (RV) or bird's eye view (BEV) residual maps to capture motion information. Unlike traditional approaches, we propose combining RV and BEV residual maps to exploit a greater potential of motion information jointly. Thus, we introduce CV-MOS, a cross-view model for moving object segmentation. Novelty, we decouple spatial-temporal information by capturing the motion from BEV and RV residual maps and generating semantic features from range images, which are used as moving object guidance for the motion branch. Our direct and unique solution maximizes the use of range images and RV and BEV residual maps, significantly enhancing the performance of LiDAR-based MOS task. Our method achieved leading IoU (%) scores of 77.5% and 79.2% on the validation and test sets of the SemanticKITTI dataset. In particular, CV-MOS demonstrates SOTA performance to date on various datasets. The CV-MOS implementation is available at https://github.com/SCNU-RISLAB/CV-MOS.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Cross-View Semantic Segmentation for Sensing Surroundings
    Pan, Bowen
    Sun, Jiankai
    Leung, Ho Yin Tiga
    Andonian, Alex
    Zhou, Bolei
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03) : 4867 - 4873
  • [2] Cross-View Regularization for Domain Adaptive Panoptic Segmentation
    Huang, Jiaxing
    Guan, Dayan
    Xiao, Aoran
    Lu, Shijian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10128 - 10139
  • [3] Towards Cross-View Consistency in Semantic Segmentation While Varying View Direction
    Tong, Xin
    Ying, Xianghua
    Shi, Yongjie
    Zhao, He
    Wang, Ruibin
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1054 - 1060
  • [4] Cross-view Transformers for real-time Map-view Semantic Segmentation
    Zhou, Brady
    Krahenbuhl, Philipp
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13750 - 13759
  • [5] CV-Cities: Advancing Cross-View Geo-Localization in Global Cities
    Huang, Gaoshuang
    Zhou, Yang
    Zhao, Luying
    Gan, Wenjian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 1592 - 1606
  • [6] Volleyball Motion Analysis Model Based on GCN and Cross-View 3D Posture Tracking
    Han, Hongsi
    Chang, Jinming
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (10) : 804 - 815
  • [7] Semantic Cross-View Matching
    Castaldo, Francesco
    Zamir, Amir
    Angst, Roland
    Palmieri, Francesco
    Savarese, Silvio
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 1044 - 1052
  • [8] Cross-view Convolutional Networks
    Jacobs, Nathan
    Workman, Scott
    Zhai, Menghua
    2016 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2016,
  • [9] Cross-View kernel transfer
    Huusari, Riikka
    Capponi, Cecile
    Villoutreix, Paul
    Kadri, Hachem
    PATTERN RECOGNITION, 2022, 129
  • [10] CAS-Net: Cross-View Aligned Segmentation by Graph Representation of Knees
    Zhuang, Zixu
    Wang, Xin
    Wang, Sheng
    Shen, Zhenrong
    Zhao, Xiangyu
    Liu, Mengjun
    Xue, Zhong
    Shen, Dinggang
    Zhang, Lichi
    Wang, Qian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 110 - 119