CV-MOS: A Cross-View Model for Motion Segmentation

被引:0
|
作者
Tang, Xiaoyu [1 ,2 ]
Chen, Zeyu [1 ,2 ]
Cheng, Jintao [1 ,2 ]
Chen, Xieyuanli [3 ]
Wu, Jin [4 ]
Xue, Bohuan [5 ]
机构
[1] South China Normal Univ, Fac Engn, Sch Elect & Informat Engn, Foshan 528225, Guangdong, Peoples R China
[2] South China Normal Univ, Xingzhi Coll, Guangzhou 510000, Guangdong, Peoples R China
[3] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[4] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[5] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud compression; Semantics; Three-dimensional displays; Feature extraction; Laser radar; Periodic structures; Motion segmentation; Autonomous driving; cross view; LiDAR motion segmentation;
D O I
10.1109/TIM.2024.3458036
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In autonomous driving, accurately distinguishing between static and moving objects is crucial for the autonomous driving system. When performing the motion object segmentation (MOS) task, effectively leveraging motion information from objects becomes a primary challenge in improving the recognition of moving objects. Previous methods either utilized range view (RV) or bird's eye view (BEV) residual maps to capture motion information. Unlike traditional approaches, we propose combining RV and BEV residual maps to exploit a greater potential of motion information jointly. Thus, we introduce CV-MOS, a cross-view model for moving object segmentation. Novelty, we decouple spatial-temporal information by capturing the motion from BEV and RV residual maps and generating semantic features from range images, which are used as moving object guidance for the motion branch. Our direct and unique solution maximizes the use of range images and RV and BEV residual maps, significantly enhancing the performance of LiDAR-based MOS task. Our method achieved leading IoU (%) scores of 77.5% and 79.2% on the validation and test sets of the SemanticKITTI dataset. In particular, CV-MOS demonstrates SOTA performance to date on various datasets. The CV-MOS implementation is available at https://github.com/SCNU-RISLAB/CV-MOS.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Graph regularized supervised cross-view hashing
    Shu, Xin
    Jiang, Haiyan
    Xu, Huanliang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (21) : 28207 - 28224
  • [42] Multi-view Deep Network for Cross-view Classification
    Kan, Meina
    Shan, Shiguang
    Chen, Xilin
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4847 - 4855
  • [43] Tokenization of Skeleton-based Transformer Model for Cross-View Gait Recognition
    Kawakami, Tatsuya
    Ryu, Jegoon
    Kamata, Sei-ichiro
    2024 IEEE 8TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS, ICSIPA, 2024,
  • [44] Transfer Sparse Subspace Analysis for Unsupervised Cross-View Scene Model Adaptation
    Sun, Hao
    Liu, Shuai
    Zhou, Shilin
    Zou, Huanxin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (07) : 2901 - 2909
  • [45] CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Ground Image Synthesis
    Chen, Yuankun
    Rong, Dazhong
    Li, Yi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT III, 2024, 15018 : 287 - 302
  • [46] Edge but not Least: Cross-View Graph Pooling
    Zhou, Xiaowei
    Yin, Jie
    Tsang, Ivor W.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II, 2023, 13714 : 344 - 359
  • [47] Discriminative Cross-View Binary Representation Learning
    Liu, Liu
    Qi, Hairong
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1736 - 1744
  • [48] View Synthesis with Scene Recognition for Cross-View Image Localization
    Lee, Uddom
    Jiang, Peng
    Wu, Hongyi
    Xin, Chunsheng
    FUTURE INTERNET, 2023, 15 (04)
  • [49] Cross-View Policy Learning for Street Navigation
    Li, Ang
    Hu, Huiyi
    Mirowski, Piotr
    Farajtabar, Mehrdad
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8099 - 8108
  • [50] Canonical sparse cross-view correlation analysis
    Zu, Chen
    Zhang, Daoqiang
    NEUROCOMPUTING, 2016, 191 : 263 - 272