3D Point-Voxel Correlation Fields for Scene Flow Estimation

被引：4

作者：

Wang, Ziyi ^{[1
]}

Wei, Yi ^{[1
]}

Rao, Yongming ^{[1
]}

Zhou, Jie ^{[1
]}

Lu, Jiwen ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 11期

关键词：

Correlation; Point cloud compression; Three-dimensional displays; Estimation; Feature extraction; Deformation; Deep learning; Point cloud; scene flow estimation; point-voxel correlation fields; deformations; SPARSE;

D O I：

10.1109/TPAMI.2023.3294355

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose Point-Voxel Correlation Fields to explore relations between two consecutive point clouds and estimate scene flow that represents 3D motions. Most existing works only consider local correlations, which are able to handle small movements but fail when there are large displacements. Therefore, it is essential to introduce all-pair correlation volumes that are free from local neighbor restrictions and cover both short- and long-term dependencies. However, it is challenging to efficiently extract correlation features from all-pairs fields in the 3D space, given the irregular and unordered nature of point clouds. To tackle this problem, we present point-voxel correlation fields, proposing distinct point and voxel branches to inquire about local and long-range correlations from all-pair fields respectively. To exploit point-based correlations, we adopt the K-Nearest Neighbors search that preserves fine-grained information in the local region, which guarantees the scene flow estimation precision. By voxelizing point clouds in a multi-scale manner, we construct pyramid correlation voxels to model long-range correspondences, which are utilized to handle fast-moving objects. Integrating these two types of correlations, we propose Point-Voxel Recurrent All-Pairs Field Transforms (PV-RAFT) architecture that employs an iterative scheme to estimate scene flow from point clouds. To adapt to different flow scope conditions and obtain more fine-grained results, we further propose Deformable PV-RAFT (DPV-RAFT), where the Spatial Deformation deforms the voxelized neighborhood, and the Temporal Deformation controls the iterative update process. We evaluate the proposed method on the FlyingThings3D and KITTI Scene Flow 2015 datasets and experimental results show that we outperform state-of-the-art methods by remarkable margins.

引用

页码：13621 / 13635

页数：15

共 50 条

[1] PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds
Wei, Yi
Wang, Ziyi
Rao, Yongming
Lu, Jiwen
Zhou, Jie
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6950 - 6959
[2] Point-Voxel Fusion for Multimodal 3D Detection
Wang, Ke
Zhang, Zhichuang
2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1716 - 1719
[3] Point-Voxel CNN for Efficient 3D Deep Learning
Liu, Zhijian
Tang, Haotian
Lin, Yujun
Han, Song
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[4] Point-voxel dual stream transformer for 3d point cloud learning
Zhao, Tianmeng
Zeng, Hui
Zhang, Baoqing
Fan, Bin
Li, Chen
VISUAL COMPUTER, 2024, 40 (08): : 5323 - 5339
[5] 3D Shape Generation and Completion through Point-Voxel Diffusion
Zhou, Linqi
Du, Yilun
Wu, Jiajun
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5806 - 5815
[6] PVNAS: 3D Neural Architecture Search With Point-Voxel Convolution
Liu, Zhijian
Tang, Haotian
Zhao, Shengyu
Shao, Kevin
Han, Song
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8552 - 8568
[7] PVDECONV: POINT-VOXEL DECONVOLUTION FOR AUTOENCODING CAD CONSTRUCTION IN 3D
Cherenkova, Kseniya
Aouada, Djamila
Gusev, Gleb
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2741 - 2745
[8] HCPVF: Hierarchical Cascaded Point-Voxel Fusion for 3D Object Detection
Fan, Baojie
Zhang, Kexin
Tian, Jiandong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 8997 - 9009
[9] Point-Voxel Based Geometry-Adaptive Network for 3D Point Cloud Analysis
Zhao, Tian-Meng
Zeng, Hui
Zhang, Bao-Qing
Liu, Hong-Min
Fan, Bin
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2024, 39 (05) : 1167 - 1179
[10] ASPVNet: Attention Based Sparse Point-Voxel Network for 3D Object Detection
Yu, Bingxin
Wang, Lu
He, Yuhong
Wang, Xiaoyang
Cheng, Jun
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 161 - 176

← 1 2 3 4 5 →