3D Point-Voxel Correlation Fields for Scene Flow Estimation

被引：4

作者：

Wang, Ziyi ^{[1
]}

Wei, Yi ^{[1
]}

Rao, Yongming ^{[1
]}

Zhou, Jie ^{[1
]}

Lu, Jiwen ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 11期

关键词：

Correlation; Point cloud compression; Three-dimensional displays; Estimation; Feature extraction; Deformation; Deep learning; Point cloud; scene flow estimation; point-voxel correlation fields; deformations; SPARSE;

D O I：

10.1109/TPAMI.2023.3294355

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose Point-Voxel Correlation Fields to explore relations between two consecutive point clouds and estimate scene flow that represents 3D motions. Most existing works only consider local correlations, which are able to handle small movements but fail when there are large displacements. Therefore, it is essential to introduce all-pair correlation volumes that are free from local neighbor restrictions and cover both short- and long-term dependencies. However, it is challenging to efficiently extract correlation features from all-pairs fields in the 3D space, given the irregular and unordered nature of point clouds. To tackle this problem, we present point-voxel correlation fields, proposing distinct point and voxel branches to inquire about local and long-range correlations from all-pair fields respectively. To exploit point-based correlations, we adopt the K-Nearest Neighbors search that preserves fine-grained information in the local region, which guarantees the scene flow estimation precision. By voxelizing point clouds in a multi-scale manner, we construct pyramid correlation voxels to model long-range correspondences, which are utilized to handle fast-moving objects. Integrating these two types of correlations, we propose Point-Voxel Recurrent All-Pairs Field Transforms (PV-RAFT) architecture that employs an iterative scheme to estimate scene flow from point clouds. To adapt to different flow scope conditions and obtain more fine-grained results, we further propose Deformable PV-RAFT (DPV-RAFT), where the Spatial Deformation deforms the voxelized neighborhood, and the Temporal Deformation controls the iterative update process. We evaluate the proposed method on the FlyingThings3D and KITTI Scene Flow 2015 datasets and experimental results show that we outperform state-of-the-art methods by remarkable margins.

引用

页码：13621 / 13635

页数：15

共 50 条

[21] PVA-GCN: point-voxel absorbing graph convolutional network for 3D human pose estimation from monocular video
Minghao Liu
Wenshan Wang
Wei Zhao
Signal, Image and Video Processing, 2024, 18 : 3627 - 3641
[22] Improved Point-Voxel Region Convolutional Neural Network: 3D Object Detectors for Autonomous Driving
Li, Yujie
Yang, Shuo
Zheng, Yuchao
Lu, Huimin
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9311 - 9317
[23] PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer
Yang, Honghui
Wang, Wenxiao
Chen, Minghao
Lin, Binbin
He, Tong
Chen, Hua
He, Xiaofei
Ouyang, Wanli
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13476 - 13487
[24] SASAN: Shape-Adaptive Set Abstraction Network for Point-Voxel 3D Object Detection
Zhang, Hui
Luo, Guiyang
Wang, Xiao
Li, Yidong
Ding, Weiping
Wang, Fei-Yue
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
[25] Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective
Tang, Jiaxiang
Chen, Xiaokang
Wang, Jingbo
Zeng, Gang
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2352 - 2360
[26] 3D Scene Flow Estimation with a Piecewise Rigid Scene Model
Christoph Vogel
Konrad Schindler
Stefan Roth
International Journal of Computer Vision, 2015, 115 : 1 - 28
[27] 3D Scene Flow Estimation with a Piecewise Rigid Scene Model
Vogel, Christoph
Schindler, Konrad
Roth, Stefan
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (01) : 1 - 28
[28] JOINT 3D ESTIMATION OF VEHICLES AND SCENE FLOW
Menze, M.
Heipke, C.
Geiger, A.
ISPRS GEOSPATIAL WEEK 2015, 2015, II-3 (W5): : 427 - 434
[29] PV-RCNN plus plus : semantical point-voxel feature interaction for 3D object detection
Wu, Peng
Gu, Lipeng
Yan, Xuefeng
Xie, Haoran
Wang, Fu Lee
Cheng, Gary
Wei, Mingqiang
VISUAL COMPUTER, 2023, 39 (06): : 2425 - 2440
[30] LiDAR Point Cloud Tracking Method Using Point-Voxel Relationship Modeling Under 3D Sparse Convolutional Framework
Tian, Sheng-Jing
Han, Yi-Nan
Zhao, Xian-Tong
Liu, Xiu-Ping
Zhang, Ming
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (10): : 3527 - 3540

← 1 2 3 4 5 →