3D-C2FT: Coarse-to-Fine Transformer for Multi-view 3D Reconstruction

被引:4
|
作者
Tiong, Leslie Ching Ow [1 ]
Sigmund, Dick [2 ]
Teoh, Andrew Beng Jin [3 ]
机构
[1] Korea Inst Sci & Technol, Computat Sci Res Ctr, 5 Hwarang Ro 14 Gil, Seoul 02792, South Korea
[2] AIDOT Inc, 128 Beobwon Ro, Seoul 05854, South Korea
[3] Yonsei Univ, Sch Elect & Elect Engn, Seoul 120749, South Korea
来源
关键词
Multi-view 3D reconstruction; Coarse-to-fine transformer; Multi-scale attention;
D O I
10.1007/978-3-031-26319-4_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the transformer model has been successfully employed for the multi-view 3D reconstruction problem. However, challenges remain in designing an attention mechanism to explore the multi-view features and exploit their relations for reinforcing the encoding-decoding modules. This paper proposes a new model, namely 3D coarse-to-fine transformer (3D-C2FT), by introducing a novel coarse-to-fine (C2F) attention mechanism for encoding multi-view features and rectifying defective voxel-based 3D objects. C2F attention mechanism enables the model to learn multi-view information flow and synthesize 3D surface correction in a coarse to fine-grained manner. The proposed model is evaluated by ShapeNet and Multi-view Real-life voxel-based datasets. Experimental results show that 3D-C2FT achieves notable results and outperforms several competing models on these datasets.
引用
收藏
页码:211 / 227
页数:17
相关论文
共 50 条
  • [31] Multi-view convolutional vision transformer for 3D object recognition
    Li, Jie
    Liu, Zhao
    Li, Li
    Lin, Junqin
    Yao, Jian
    Tu, Jingmin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [32] Coarse-to-fine multiview 3d face reconstruction using multiple geometrical features
    Dai, Peng
    Wang, Xue
    Zhang, Weihang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (01) : 939 - 966
  • [33] 3D mouse brain reconstruction from histology using a coarse-to-fine approach
    Yushkevich, Paul A.
    Avants, Brian B.
    Ng, Lydia
    Hawrylycz, Michael
    Burstein, Pablo D.
    Zhang, Hui
    Gee, James C.
    BIOMEDICAL IMAGE REGISTRATION, PROCEEDINGS, 2006, 4057 : 230 - 237
  • [34] Multi-View Depth Completion with Coarse-to-Fine Networks
    Songsong, Yu
    Wang, Haiting
    Wang, Lijun
    Wang, Yifan
    Lu, Huchuan
    SSRN,
  • [35] 3D Reconstruction of Aircraft Structures via 2D Multi-view Images
    Zhang, Tianyou
    Fan, Runze
    Zhang, Yu
    Feng, Guangkun
    Wei, Zhenzhong
    TENTH INTERNATIONAL SYMPOSIUM ON PRECISION MECHANICAL MEASUREMENTS, 2021, 12059
  • [36] Multi-view 3D reconstruction of seedling using 2D image contour
    Chen, Qingguang
    Huang, Shentao
    Liu, Shuang
    Zhong, Mingwei
    Zhang, Guohao
    Song, Liang
    Zhang, Xinghao
    Zhang, Jingcheng
    Wu, Kaihua
    Ye, Ziran
    Kong, Dedong
    BIOSYSTEMS ENGINEERING, 2024, 243 : 130 - 147
  • [37] Coarse-to-fine multiview 3d face reconstruction using multiple geometrical features
    Peng Dai
    Xue Wang
    Weihang Zhang
    Multimedia Tools and Applications, 2018, 77 : 939 - 966
  • [38] Coarse-to-fine stereo vision with accurate 3D boundaries
    Sizintsev, Mikhail
    Wildes, Richard P.
    IMAGE AND VISION COMPUTING, 2010, 28 (03) : 352 - 366
  • [39] A coarse-to-fine keypoint detection method for 3D model
    1600, International Frequency Sensor Association, 46 Thorny Vineway, Toronto, ON M2J 4J2, Canada (160):
  • [40] Coarse-to-fine fusion for language grounding in 3D navigation
    Nguyen, Thanh Tin
    Vo, Anh H.
    Choi, Soo-Mi
    Kim, Yong-Guk
    KNOWLEDGE-BASED SYSTEMS, 2023, 277