3D-C2FT: Coarse-to-Fine Transformer for Multi-view 3D Reconstruction

被引：4

作者：

Tiong, Leslie Ching Ow ^{[1
]}

Sigmund, Dick ^{[2
]}

Teoh, Andrew Beng Jin ^{[3
]}

机构：

[1] Korea Inst Sci & Technol, Computat Sci Res Ctr, 5 Hwarang Ro 14 Gil, Seoul 02792, South Korea

[2] AIDOT Inc, 128 Beobwon Ro, Seoul 05854, South Korea

[3] Yonsei Univ, Sch Elect & Elect Engn, Seoul 120749, South Korea

来源：

COMPUTER VISION - ACCV 2022, PT I | 2023年 / 13841卷

关键词：

Multi-view 3D reconstruction; Coarse-to-fine transformer; Multi-scale attention;

D O I：

10.1007/978-3-031-26319-4_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, the transformer model has been successfully employed for the multi-view 3D reconstruction problem. However, challenges remain in designing an attention mechanism to explore the multi-view features and exploit their relations for reinforcing the encoding-decoding modules. This paper proposes a new model, namely 3D coarse-to-fine transformer (3D-C2FT), by introducing a novel coarse-to-fine (C2F) attention mechanism for encoding multi-view features and rectifying defective voxel-based 3D objects. C2F attention mechanism enables the model to learn multi-view information flow and synthesize 3D surface correction in a coarse to fine-grained manner. The proposed model is evaluated by ShapeNet and Multi-view Real-life voxel-based datasets. Experimental results show that 3D-C2FT achieves notable results and outperforms several competing models on these datasets.

引用

页码：211 / 227

页数：17

共 50 条

[31] Multi-view convolutional vision transformer for 3D object recognition
Li, Jie
Liu, Zhao
Li, Li
Lin, Junqin
Yao, Jian
Tu, Jingmin
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
[32] Coarse-to-fine multiview 3d face reconstruction using multiple geometrical features
Dai, Peng
Wang, Xue
Zhang, Weihang
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (01) : 939 - 966
[33] 3D mouse brain reconstruction from histology using a coarse-to-fine approach
Yushkevich, Paul A.
Avants, Brian B.
Ng, Lydia
Hawrylycz, Michael
Burstein, Pablo D.
Zhang, Hui
Gee, James C.
BIOMEDICAL IMAGE REGISTRATION, PROCEEDINGS, 2006, 4057 : 230 - 237
[34] Multi-View Depth Completion with Coarse-to-Fine Networks
Songsong, Yu
Wang, Haiting
Wang, Lijun
Wang, Yifan
Lu, Huchuan
SSRN,
[35] 3D Reconstruction of Aircraft Structures via 2D Multi-view Images
Zhang, Tianyou
Fan, Runze
Zhang, Yu
Feng, Guangkun
Wei, Zhenzhong
TENTH INTERNATIONAL SYMPOSIUM ON PRECISION MECHANICAL MEASUREMENTS, 2021, 12059
[36] Multi-view 3D reconstruction of seedling using 2D image contour
Chen, Qingguang
Huang, Shentao
Liu, Shuang
Zhong, Mingwei
Zhang, Guohao
Song, Liang
Zhang, Xinghao
Zhang, Jingcheng
Wu, Kaihua
Ye, Ziran
Kong, Dedong
BIOSYSTEMS ENGINEERING, 2024, 243 : 130 - 147
[37] Coarse-to-fine multiview 3d face reconstruction using multiple geometrical features
Peng Dai
Xue Wang
Weihang Zhang
Multimedia Tools and Applications, 2018, 77 : 939 - 966
[38] Coarse-to-fine stereo vision with accurate 3D boundaries
Sizintsev, Mikhail
Wildes, Richard P.
IMAGE AND VISION COMPUTING, 2010, 28 (03) : 352 - 366
[39] A coarse-to-fine keypoint detection method for 3D model
1600, International Frequency Sensor Association, 46 Thorny Vineway, Toronto, ON M2J 4J2, Canada (160):
[40] Coarse-to-fine fusion for language grounding in 3D navigation
Nguyen, Thanh Tin
Vo, Anh H.
Choi, Soo-Mi
Kim, Yong-Guk
KNOWLEDGE-BASED SYSTEMS, 2023, 277

← 1 2 3 4 5 →