Improving 2D Feature Representations by 3D-Aware Fine-Tuning

被引:0
|
作者
Yue, Yuanwen [1 ]
Das, Anurag [2 ]
Engelmann, Francis [1 ,3 ]
Tang, Siyu [1 ]
Lenssen, Jan Eric [2 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
[3] Google, Zurich, Switzerland
来源
COMPUTER VISION - ECCV 2024, PT II | 2025年 / 15060卷
关键词
Representation learning; Foundation models; Gaussian splatting; Scene understanding;
D O I
10.1007/978-3-031-72627-9_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current visual foundation models are trained purely on unstructured 2D data, limiting their understanding of 3D structure of objects and scenes. In this work, we show that fine-tuning on 3D-aware data improves the quality of emerging semantic features. We design a method to lift semantic 2D features into an efficient 3D Gaussian representation, which allows us to re-render them for arbitrary views. Using the rendered 3D-aware features, we design a fine-tuning strategy to transfer such 3D awareness into a 2D foundation model. We demonstrate that models fine-tuned in that way produce features that readily improve downstream task performance in semantic segmentation and depth estimation through simple linear probing. Notably, though fined-tuned on a single indoor dataset, the improvement is transferable to a variety of indoor datasets and out-of-domain datasets. We hope our study encourages the community to consider injecting 3D awareness when training 2D foundation models. Project page: https://ywyue.github.io/FiT3D.
引用
收藏
页码:57 / 74
页数:18
相关论文
共 50 条
  • [1] Lifting 2D StyleGAN for 3D-Aware Face Generation
    Shi, Yichun
    Aggarwal, Divyansh
    Jain, Anil K.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6254 - 6262
  • [2] 3D-aware Image Generation using 2D Diffusion Models
    Xiang, Jianfeng
    Yang, Jiaolong
    Huang, Binbin
    Tong, Xin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2383 - 2393
  • [3] Generative Multiplane Images: Making a 2D GAN 3D-Aware
    Zhao, Xiaoming
    Ma, Fangchang
    Guera, David
    Ren, Zhile
    Schwing, Alexander G.
    Colburn, Alex
    COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 18 - 35
  • [4] Fine-tuning growth in gold nanostructures from achiral 2D to chiral 3D geometries
    Tan, Lili
    Chen, Zhi
    Xiao, Chengyu
    Geng, Zhiyong
    Jin, Yinran
    Wei, Chaoyang
    Teng, Fei
    Fu, Wenlong
    Wang, Peng-peng
    NANO RESEARCH, 2024, 17 (07) : 6654 - 6660
  • [5] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
    Shi, Zifan
    Xu, Yinghao
    Shen, Yujun
    Zhao, Deli
    Chen, Qifeng
    Yeung, Dit-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Dual Mapping of 2D StyleGAN for 3D-Aware Image Generation and Manipulation (Student Abstract)
    Chen, Zhuo
    Zhao, Haimei
    Wang, Chaoyue
    Yuan, Bo
    Li, Xiu
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23458 - 23459
  • [7] 3D-Aware Face Swapping
    Li, Yixuan
    Ma, Chao
    Yan, Yichao
    Zhu, Wenhan
    Yang, Xiaokang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12705 - 12714
  • [8] 3D-aware Image Synthesis via Learning Structural and Textural Representations
    Xu, Yinghao
    Peng, Sida
    Yang, Ceyuan
    Shen, Yujun
    Zhou, Bolei
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18409 - 18418
  • [9] Learning 3D-Aware GANs from Unposed Images with Template Feature Field
    Chen, Xinya
    Guo, Hanlei
    Bin, Yanrui
    Zhang, Shangzhan
    Yang, Yuanbo
    Wang, Yue
    Shen, Yujun
    Liao, Yiyi
    COMPUTER VISION - ECCV 2024, PT XVI, 2025, 15074 : 39 - 56
  • [10] 3D-aware Conditional Image Synthesis
    Deng, Kangle
    Yang, Gengshan
    Ramanan, Deva
    Zhu, Jun-Yan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4434 - 4445