3D Point-Voxel Correlation Fields for Scene Flow Estimation

被引:4
|
作者
Wang, Ziyi [1 ]
Wei, Yi [1 ]
Rao, Yongming [1 ]
Zhou, Jie [1 ]
Lu, Jiwen [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
关键词
Correlation; Point cloud compression; Three-dimensional displays; Estimation; Feature extraction; Deformation; Deep learning; Point cloud; scene flow estimation; point-voxel correlation fields; deformations; SPARSE;
D O I
10.1109/TPAMI.2023.3294355
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose Point-Voxel Correlation Fields to explore relations between two consecutive point clouds and estimate scene flow that represents 3D motions. Most existing works only consider local correlations, which are able to handle small movements but fail when there are large displacements. Therefore, it is essential to introduce all-pair correlation volumes that are free from local neighbor restrictions and cover both short- and long-term dependencies. However, it is challenging to efficiently extract correlation features from all-pairs fields in the 3D space, given the irregular and unordered nature of point clouds. To tackle this problem, we present point-voxel correlation fields, proposing distinct point and voxel branches to inquire about local and long-range correlations from all-pair fields respectively. To exploit point-based correlations, we adopt the K-Nearest Neighbors search that preserves fine-grained information in the local region, which guarantees the scene flow estimation precision. By voxelizing point clouds in a multi-scale manner, we construct pyramid correlation voxels to model long-range correspondences, which are utilized to handle fast-moving objects. Integrating these two types of correlations, we propose Point-Voxel Recurrent All-Pairs Field Transforms (PV-RAFT) architecture that employs an iterative scheme to estimate scene flow from point clouds. To adapt to different flow scope conditions and obtain more fine-grained results, we further propose Deformable PV-RAFT (DPV-RAFT), where the Spatial Deformation deforms the voxelized neighborhood, and the Temporal Deformation controls the iterative update process. We evaluate the proposed method on the FlyingThings3D and KITTI Scene Flow 2015 datasets and experimental results show that we outperform state-of-the-art methods by remarkable margins.
引用
收藏
页码:13621 / 13635
页数:15
相关论文
共 50 条
  • [21] PVA-GCN: point-voxel absorbing graph convolutional network for 3D human pose estimation from monocular video
    Minghao Liu
    Wenshan Wang
    Wei Zhao
    Signal, Image and Video Processing, 2024, 18 : 3627 - 3641
  • [22] Improved Point-Voxel Region Convolutional Neural Network: 3D Object Detectors for Autonomous Driving
    Li, Yujie
    Yang, Shuo
    Zheng, Yuchao
    Lu, Huimin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9311 - 9317
  • [23] PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer
    Yang, Honghui
    Wang, Wenxiao
    Chen, Minghao
    Lin, Binbin
    He, Tong
    Chen, Hua
    He, Xiaofei
    Ouyang, Wanli
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13476 - 13487
  • [24] SASAN: Shape-Adaptive Set Abstraction Network for Point-Voxel 3D Object Detection
    Zhang, Hui
    Luo, Guiyang
    Wang, Xiao
    Li, Yidong
    Ding, Weiping
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [25] Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective
    Tang, Jiaxiang
    Chen, Xiaokang
    Wang, Jingbo
    Zeng, Gang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2352 - 2360
  • [26] 3D Scene Flow Estimation with a Piecewise Rigid Scene Model
    Christoph Vogel
    Konrad Schindler
    Stefan Roth
    International Journal of Computer Vision, 2015, 115 : 1 - 28
  • [27] 3D Scene Flow Estimation with a Piecewise Rigid Scene Model
    Vogel, Christoph
    Schindler, Konrad
    Roth, Stefan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (01) : 1 - 28
  • [28] JOINT 3D ESTIMATION OF VEHICLES AND SCENE FLOW
    Menze, M.
    Heipke, C.
    Geiger, A.
    ISPRS GEOSPATIAL WEEK 2015, 2015, II-3 (W5): : 427 - 434
  • [29] PV-RCNN plus plus : semantical point-voxel feature interaction for 3D object detection
    Wu, Peng
    Gu, Lipeng
    Yan, Xuefeng
    Xie, Haoran
    Wang, Fu Lee
    Cheng, Gary
    Wei, Mingqiang
    VISUAL COMPUTER, 2023, 39 (06): : 2425 - 2440
  • [30] LiDAR Point Cloud Tracking Method Using Point-Voxel Relationship Modeling Under 3D Sparse Convolutional Framework
    Tian, Sheng-Jing
    Han, Yi-Nan
    Zhao, Xian-Tong
    Liu, Xiu-Ping
    Zhang, Ming
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (10): : 3527 - 3540