3D-C2FT: Coarse-to-Fine Transformer for Multi-view 3D Reconstruction

被引:4
|
作者
Tiong, Leslie Ching Ow [1 ]
Sigmund, Dick [2 ]
Teoh, Andrew Beng Jin [3 ]
机构
[1] Korea Inst Sci & Technol, Computat Sci Res Ctr, 5 Hwarang Ro 14 Gil, Seoul 02792, South Korea
[2] AIDOT Inc, 128 Beobwon Ro, Seoul 05854, South Korea
[3] Yonsei Univ, Sch Elect & Elect Engn, Seoul 120749, South Korea
来源
关键词
Multi-view 3D reconstruction; Coarse-to-fine transformer; Multi-scale attention;
D O I
10.1007/978-3-031-26319-4_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the transformer model has been successfully employed for the multi-view 3D reconstruction problem. However, challenges remain in designing an attention mechanism to explore the multi-view features and exploit their relations for reinforcing the encoding-decoding modules. This paper proposes a new model, namely 3D coarse-to-fine transformer (3D-C2FT), by introducing a novel coarse-to-fine (C2F) attention mechanism for encoding multi-view features and rectifying defective voxel-based 3D objects. C2F attention mechanism enables the model to learn multi-view information flow and synthesize 3D surface correction in a coarse to fine-grained manner. The proposed model is evaluated by ShapeNet and Multi-view Real-life voxel-based datasets. Experimental results show that 3D-C2FT achieves notable results and outperforms several competing models on these datasets.
引用
收藏
页码:211 / 227
页数:17
相关论文
共 50 条
  • [21] Geometry-Biased Transformer for Robust Multi-View 3D Human Pose Reconstruction
    Moliner, Olivier
    Huang, Sangxia
    Astrom, Kalle
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [22] A Real World Dataset for Multi-view 3D Reconstruction
    Shrestha, Rakesh
    Hu, Siqi
    Gou, Minghao
    Liu, Ziyuan
    Tan, Ping
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 56 - 73
  • [23] Underwater 3D reconstruction based on multi-view stereo
    Gu, Feifei
    Zhao, Juan
    Xu, Pei
    Huang, Shulan
    Zhang, Gaopeng
    Song, Zhan
    OCEAN OPTICS AND INFORMATION TECHNOLOGY, 2018, 10850
  • [24] MVLayoutNet: 3D Layout Reconstruction with Multi-view Panoramas
    Hu, Zhihua
    Duan, Bo
    Zhang, Yanfeng
    Sun, Mingwei
    Huang, Jingwei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1289 - 1298
  • [25] OPTIMIZING CAMERA POSITIONS FOR MULTI-VIEW 3D RECONSTRUCTION
    Qian, Ningqing
    Lo, Chao-Yang
    2015 INTERNATIONAL CONFERENCE ON 3D IMAGING (IC3D), 2015,
  • [26] 3D BIOLOGICAL CELL RECONSTRUCTION WITH MULTI-VIEW GEOMETRY
    Lei, Yang
    Shkolnikov, Viktor
    Xin, Daisy
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 495 - 498
  • [27] Multi-view 3D Reconstruction by Fusing Polarization Information
    Hu, Gaomei
    Zhao, Haimeng
    Hu, Qirun
    Zhu, Jianfang
    Yang, Peng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 181 - 195
  • [28] 3D ear reconstruction attempts: Using multi-view
    Liu, Heng
    Yan, Jingqi
    Zhang, David
    INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 578 - 583
  • [29] Multi-view 3D Reconstruction with Self-attention
    Qian, Qiuting
    2021 14TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2021), 2021, : 20 - 26
  • [30] Cross-view Transformer for enhanced multi-view 3D reconstructionCross-view Transformer for enhanced multi-view 3D reconstructionW. Shi et al.
    Wuzhen Shi
    Aixue Yin
    Yingxiang Li
    Bo Qian
    The Visual Computer, 2025, 41 (7) : 4865 - 4877