CV-MOS: A Cross-View Model for Motion Segmentation

被引:0
|
作者
Tang, Xiaoyu [1 ,2 ]
Chen, Zeyu [1 ,2 ]
Cheng, Jintao [1 ,2 ]
Chen, Xieyuanli [3 ]
Wu, Jin [4 ]
Xue, Bohuan [5 ]
机构
[1] South China Normal Univ, Fac Engn, Sch Elect & Informat Engn, Foshan 528225, Guangdong, Peoples R China
[2] South China Normal Univ, Xingzhi Coll, Guangzhou 510000, Guangdong, Peoples R China
[3] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[4] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[5] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud compression; Semantics; Three-dimensional displays; Feature extraction; Laser radar; Periodic structures; Motion segmentation; Autonomous driving; cross view; LiDAR motion segmentation;
D O I
10.1109/TIM.2024.3458036
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In autonomous driving, accurately distinguishing between static and moving objects is crucial for the autonomous driving system. When performing the motion object segmentation (MOS) task, effectively leveraging motion information from objects becomes a primary challenge in improving the recognition of moving objects. Previous methods either utilized range view (RV) or bird's eye view (BEV) residual maps to capture motion information. Unlike traditional approaches, we propose combining RV and BEV residual maps to exploit a greater potential of motion information jointly. Thus, we introduce CV-MOS, a cross-view model for moving object segmentation. Novelty, we decouple spatial-temporal information by capturing the motion from BEV and RV residual maps and generating semantic features from range images, which are used as moving object guidance for the motion branch. Our direct and unique solution maximizes the use of range images and RV and BEV residual maps, significantly enhancing the performance of LiDAR-based MOS task. Our method achieved leading IoU (%) scores of 77.5% and 79.2% on the validation and test sets of the SemanticKITTI dataset. In particular, CV-MOS demonstrates SOTA performance to date on various datasets. The CV-MOS implementation is available at https://github.com/SCNU-RISLAB/CV-MOS.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Multiview Co-segmentation for Wide Baseline Images using Cross-view Supervision
    Yao, Yuan
    Park, Hyun Soo
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1931 - 1940
  • [22] Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation
    Wang, Zicheng
    Zhao, Zhen
    Xing, Xiaoxia
    Xu, Dong
    Kong, Xiangyu
    Zhou, Luping
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19585 - 19595
  • [23] CrossMatch: Cross-View Matching for Semi-Supervised Remote Sensing Image Segmentation
    Liu, Ruizhong
    Luo, Tingzhang
    Huang, Shaoguang
    Wu, Yuwei
    Jiang, Zhen
    Zhang, Hongyan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [24] Cross-View Label Transfer in Knee MR Segmentation Using Iterative Context Learning
    Li, Tong
    Xuan, Kai
    Xue, Zhong
    Chen, Lei
    Zhang, Lichi
    Qian, Dahong
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER, AND DISTRIBUTED AND COLLABORATIVE LEARNING, DART 2020, DCL 2020, 2020, 12444 : 96 - 105
  • [25] A Cross-View Model for Tourism Demand Forecasting with Artificial Intelligence Method
    Han, Siming
    Guo, Yanhui
    Cao, Han
    Feng, Qian
    Li, Yifei
    DATA SCIENCE, PT 1, 2017, 727 : 573 - 582
  • [26] Cross-View Panorama Image Synthesis
    Wu, Songsong
    Tang, Hao
    Jing, Xiao-Yuan
    Zhao, Haifeng
    Qian, Jianjun
    Sebe, Nicu
    Yan, Yan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3546 - 3559
  • [27] Bridge point cloud semantic segmentation based on view consensus and cross-view self-prompt fusion
    Zeng, Yan
    Huang, Feng
    Xiong, Guikai
    Ma, Xiaoxiao
    Peng, Yingchuan
    Yang, Wenshu
    Liu, Jiepeng
    AUTOMATION IN CONSTRUCTION, 2025, 171
  • [28] Cross-View Fusion for Multi-View Clustering
    Huang, Zhijie
    Huang, Binqiang
    Zheng, Qinghai
    Yu, Yuanlong
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 621 - 625
  • [29] An Asymmetric Distance Model for Cross-View Feature Mapping in Person Reidentification
    Chen, Ying-Cong
    Zheng, Wei-Shi
    Lai, Jian-Huang
    Yuen, Pong C.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (08) : 1661 - 1675
  • [30] X-Align plus plus : cross-modal cross-view alignment for Bird's-eye-view segmentation
    Borse, Shubhankar
    Klingner, Marvin
    Ravi, Varun
    Cai, Hong
    Almuzairee, Abdulaziz
    Yogamani, Senthil
    Porikli, Fatih
    MACHINE VISION AND APPLICATIONS, 2023, 34 (04)