3D Point-Voxel Correlation Fields for Scene Flow Estimation

被引:4
|
作者
Wang, Ziyi [1 ]
Wei, Yi [1 ]
Rao, Yongming [1 ]
Zhou, Jie [1 ]
Lu, Jiwen [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
关键词
Correlation; Point cloud compression; Three-dimensional displays; Estimation; Feature extraction; Deformation; Deep learning; Point cloud; scene flow estimation; point-voxel correlation fields; deformations; SPARSE;
D O I
10.1109/TPAMI.2023.3294355
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose Point-Voxel Correlation Fields to explore relations between two consecutive point clouds and estimate scene flow that represents 3D motions. Most existing works only consider local correlations, which are able to handle small movements but fail when there are large displacements. Therefore, it is essential to introduce all-pair correlation volumes that are free from local neighbor restrictions and cover both short- and long-term dependencies. However, it is challenging to efficiently extract correlation features from all-pairs fields in the 3D space, given the irregular and unordered nature of point clouds. To tackle this problem, we present point-voxel correlation fields, proposing distinct point and voxel branches to inquire about local and long-range correlations from all-pair fields respectively. To exploit point-based correlations, we adopt the K-Nearest Neighbors search that preserves fine-grained information in the local region, which guarantees the scene flow estimation precision. By voxelizing point clouds in a multi-scale manner, we construct pyramid correlation voxels to model long-range correspondences, which are utilized to handle fast-moving objects. Integrating these two types of correlations, we propose Point-Voxel Recurrent All-Pairs Field Transforms (PV-RAFT) architecture that employs an iterative scheme to estimate scene flow from point clouds. To adapt to different flow scope conditions and obtain more fine-grained results, we further propose Deformable PV-RAFT (DPV-RAFT), where the Spatial Deformation deforms the voxelized neighborhood, and the Temporal Deformation controls the iterative update process. We evaluate the proposed method on the FlyingThings3D and KITTI Scene Flow 2015 datasets and experimental results show that we outperform state-of-the-art methods by remarkable margins.
引用
收藏
页码:13621 / 13635
页数:15
相关论文
共 50 条
  • [1] PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds
    Wei, Yi
    Wang, Ziyi
    Rao, Yongming
    Lu, Jiwen
    Zhou, Jie
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6950 - 6959
  • [2] Point-Voxel Fusion for Multimodal 3D Detection
    Wang, Ke
    Zhang, Zhichuang
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1716 - 1719
  • [3] Point-Voxel CNN for Efficient 3D Deep Learning
    Liu, Zhijian
    Tang, Haotian
    Lin, Yujun
    Han, Song
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [4] Point-voxel dual stream transformer for 3d point cloud learning
    Zhao, Tianmeng
    Zeng, Hui
    Zhang, Baoqing
    Fan, Bin
    Li, Chen
    VISUAL COMPUTER, 2024, 40 (08): : 5323 - 5339
  • [5] 3D Shape Generation and Completion through Point-Voxel Diffusion
    Zhou, Linqi
    Du, Yilun
    Wu, Jiajun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5806 - 5815
  • [6] PVNAS: 3D Neural Architecture Search With Point-Voxel Convolution
    Liu, Zhijian
    Tang, Haotian
    Zhao, Shengyu
    Shao, Kevin
    Han, Song
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8552 - 8568
  • [7] PVDECONV: POINT-VOXEL DECONVOLUTION FOR AUTOENCODING CAD CONSTRUCTION IN 3D
    Cherenkova, Kseniya
    Aouada, Djamila
    Gusev, Gleb
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2741 - 2745
  • [8] HCPVF: Hierarchical Cascaded Point-Voxel Fusion for 3D Object Detection
    Fan, Baojie
    Zhang, Kexin
    Tian, Jiandong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 8997 - 9009
  • [9] Point-Voxel Based Geometry-Adaptive Network for 3D Point Cloud Analysis
    Zhao, Tian-Meng
    Zeng, Hui
    Zhang, Bao-Qing
    Liu, Hong-Min
    Fan, Bin
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2024, 39 (05) : 1167 - 1179
  • [10] ASPVNet: Attention Based Sparse Point-Voxel Network for 3D Object Detection
    Yu, Bingxin
    Wang, Lu
    He, Yuhong
    Wang, Xiaoyang
    Cheng, Jun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 161 - 176