3D-C2FT: Coarse-to-Fine Transformer for Multi-view 3D Reconstruction

被引：4

作者：

Tiong, Leslie Ching Ow ^{[1
]}

Sigmund, Dick ^{[2
]}

Teoh, Andrew Beng Jin ^{[3
]}

机构：

[1] Korea Inst Sci & Technol, Computat Sci Res Ctr, 5 Hwarang Ro 14 Gil, Seoul 02792, South Korea

[2] AIDOT Inc, 128 Beobwon Ro, Seoul 05854, South Korea

[3] Yonsei Univ, Sch Elect & Elect Engn, Seoul 120749, South Korea

来源：

COMPUTER VISION - ACCV 2022, PT I | 2023年 / 13841卷

关键词：

Multi-view 3D reconstruction; Coarse-to-fine transformer; Multi-scale attention;

D O I：

10.1007/978-3-031-26319-4_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, the transformer model has been successfully employed for the multi-view 3D reconstruction problem. However, challenges remain in designing an attention mechanism to explore the multi-view features and exploit their relations for reinforcing the encoding-decoding modules. This paper proposes a new model, namely 3D coarse-to-fine transformer (3D-C2FT), by introducing a novel coarse-to-fine (C2F) attention mechanism for encoding multi-view features and rectifying defective voxel-based 3D objects. C2F attention mechanism enables the model to learn multi-view information flow and synthesize 3D surface correction in a coarse to fine-grained manner. The proposed model is evaluated by ShapeNet and Multi-view Real-life voxel-based datasets. Experimental results show that 3D-C2FT achieves notable results and outperforms several competing models on these datasets.

引用

页码：211 / 227

页数：17

共 50 条

[21] Geometry-Biased Transformer for Robust Multi-View 3D Human Pose Reconstruction
Moliner, Olivier
Huang, Sangxia
Astrom, Kalle
2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
[22] A Real World Dataset for Multi-view 3D Reconstruction
Shrestha, Rakesh
Hu, Siqi
Gou, Minghao
Liu, Ziyuan
Tan, Ping
COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 56 - 73
[23] Underwater 3D reconstruction based on multi-view stereo
Gu, Feifei
Zhao, Juan
Xu, Pei
Huang, Shulan
Zhang, Gaopeng
Song, Zhan
OCEAN OPTICS AND INFORMATION TECHNOLOGY, 2018, 10850
[24] MVLayoutNet: 3D Layout Reconstruction with Multi-view Panoramas
Hu, Zhihua
Duan, Bo
Zhang, Yanfeng
Sun, Mingwei
Huang, Jingwei
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1289 - 1298
[25] OPTIMIZING CAMERA POSITIONS FOR MULTI-VIEW 3D RECONSTRUCTION
Qian, Ningqing
Lo, Chao-Yang
2015 INTERNATIONAL CONFERENCE ON 3D IMAGING (IC3D), 2015,
[26] 3D BIOLOGICAL CELL RECONSTRUCTION WITH MULTI-VIEW GEOMETRY
Lei, Yang
Shkolnikov, Viktor
Xin, Daisy
2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 495 - 498
[27] Multi-view 3D Reconstruction by Fusing Polarization Information
Hu, Gaomei
Zhao, Haimeng
Hu, Qirun
Zhu, Jianfang
Yang, Peng
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 181 - 195
[28] 3D ear reconstruction attempts: Using multi-view
Liu, Heng
Yan, Jingqi
Zhang, David
INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 578 - 583
[29] Multi-view 3D Reconstruction with Self-attention
Qian, Qiuting
2021 14TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2021), 2021, : 20 - 26
[30] Cross-view Transformer for enhanced multi-view 3D reconstructionCross-view Transformer for enhanced multi-view 3D reconstructionW. Shi et al.
Wuzhen Shi
Aixue Yin
Yingxiang Li
Bo Qian
The Visual Computer, 2025, 41 (7) : 4865 - 4877

← 1 2 3 4 5 →