Unsupervised Deep Single-Image Intrinsic Decomposition using Illumination-Varying Image Sequences

被引:39
|
作者
Lettry, L. [1 ]
Vanhoey, K. [1 ,2 ]
Van Gool, L. [1 ,3 ]
机构
[1] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
[2] Unity Technol, San Francisco, CA USA
[3] Katholieke Univ Leuven, PSI ESAT, Leuven, Belgium
基金
瑞士国家科学基金会;
关键词
D O I
10.1111/cgf.13578
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Machine learning based Single Image Intrinsic Decomposition (SIID) methods decompose a captured scene into its albedo and shading images by using the knowledge of a large set of known and realistic ground truth decompositions. Collecting and annotating such a dataset is an approach that cannot scale to sufficient variety and realism. We free ourselves from this limitation by training on unannotated images. Our method leverages the observation that two images of the same scene but with different lighting provide useful information on their intrinsic properties: by definition, albedo is invariant to lighting conditions, and cross-combining the estimated albedo of a first image with the estimated shading of a second one should lead back to the second one's input image. We transcribe this relationship into a siamese training scheme for a deep convolutional neural network that decomposes a single image into albedo and shading. The siamese setting allows us to introduce a new loss function including such cross-combinations, and to train solely on (time-lapse) images, discarding the need for any ground truth annotations. As a result, our method has the good properties of i) taking advantage of the time-varying information of image sequences in the (pre-computed) training step, ii) not requiring ground truth data to train on, and iii) being able to decompose single images of unseen scenes at runtime. To demonstrate and evaluate our work, we additionally propose a new rendered dataset containing illumination-varying scenes and a set of quantitative metrics to evaluate SIID algorithms. Despite its unsupervised nature, our results compete with state of the art methods, including supervised and non data-driven methods.
引用
收藏
页码:409 / 419
页数:11
相关论文
共 50 条
  • [31] Intrinsic Omnidirectional Image Decomposition With Illumination Pre-Extraction
    Xu, Rong-Kai
    Zhang, Lei
    Zhang, Fang-Lue
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 4416 - 4428
  • [32] Single-Image Defogging Algorithm Based on Deep Learning
    Zhao Jiantang
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (11)
  • [33] ConceptExpress: Harnessing Diffusion Models for Single-Image Unsupervised Concept Extraction
    Hao, Shaozhe
    Han, Kai
    Lv, Zhengyao
    Zhao, Shihao
    Wong, Kwan-Yee K.
    COMPUTER VISION - ECCV 2024, PT LIX, 2025, 15117 : 215 - 233
  • [34] Intrinsic Image Decomposition Using Paradigms
    Forsyth, David
    Rock, Jason J.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 7624 - 7637
  • [35] Intrinsic decomposition from a single spectral image
    Chen, Xi
    Zhu, Weixin
    Zhao, Yang
    Yu, Yao
    Zhou, Yu
    Yue, Tao
    Du, Sidan
    Cao, Xun
    APPLIED OPTICS, 2017, 56 (20) : 5676 - 5684
  • [36] DerainAttentionGAN: unsupervised single-image deraining using attention-guided generative adversarial networks
    Guo, ZhaoKang
    Hou, Mingzheng
    Sima, Mingjun
    Feng, ZiLiang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (01) : 185 - 192
  • [37] Decomposing intrinsic images from a single fabric image using an unsupervised method
    Xiang, Jun
    Wang, Jingan
    Pan, Ruru
    Gao, Weidong
    JOURNAL OF THE TEXTILE INSTITUTE, 2024, 115 (01) : 87 - 96
  • [38] DerainAttentionGAN: unsupervised single-image deraining using attention-guided generative adversarial networks
    ZhaoKang Guo
    Mingzheng Hou
    Mingjun Sima
    ZiLiang Feng
    Signal, Image and Video Processing, 2022, 16 : 185 - 192
  • [39] DROP-DIP: A SINGLE-IMAGE DENOISING METHOD BASED ON DEEP IMAGE PRIOR
    Zhang, Xueding
    LI, Zhemin
    Wang, Hongxia
    JOURNAL OF NONLINEAR AND VARIATIONAL ANALYSIS, 2023, 7 (04): : 505 - 526
  • [40] Leveraging Multi-View Image Sets for Unsupervised Intrinsic Image Decomposition and Highlight Separation
    Yi, Renjiao
    Tan, Ping
    Lin, Stephen
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12685 - 12692