The Potential of Diffusion-Based Near-Infrared Image Colorization

被引:1
|
作者
Borstelmann, Ayk [1 ]
Haucke, Timm [1 ,2 ]
Steinhage, Volker [1 ]
机构
[1] Univ Bonn, Inst Comp Sci 4, Friedrich Hirzebruch Allee 8, D-53115 Bonn, Germany
[2] MIT, Comp Sci & Artificial Intelligence Lab, 32 Vassar St, Cambridge, MA 02139 USA
关键词
near-infrared; diffusion models; camera trapping; unpaired dataset; neural networks; machine learning;
D O I
10.3390/s24051565
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Camera traps, an invaluable tool for biodiversity monitoring, capture wildlife activities day and night. In low-light conditions, near-infrared (NIR) imaging is commonly employed to capture images without disturbing animals. However, the reflection properties of NIR light differ from those of visible light in terms of chrominance and luminance, creating a notable gap in human perception. Thus, the objective is to enrich near-infrared images with colors, thereby bridging this domain gap. Conventional colorization techniques are ineffective due to the difference between NIR and visible light. Moreover, regular supervised learning methods cannot be applied because paired training data are rare. Solutions to such unpaired image-to-image translation problems currently commonly involve generative adversarial networks (GANs), but recently, diffusion models gained attention for their superior performance in various tasks. In response to this, we present a novel framework utilizing diffusion models for the colorization of NIR images. This framework allows efficient implementation of various methods for colorizing NIR images. We show NIR colorization is primarily controlled by the translation of the near-infrared intensities to those of visible light. The experimental evaluation of three implementations with increasing complexity shows that even a simple implementation inspired by visible-near-infrared (VIS-NIR) fusion rivals GANs. Moreover, we show that the third implementation is capable of outperforming GANs. With our study, we introduce an intersection field joining the research areas of diffusion models, NIR colorization, and VIS-NIR fusion.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Integrated Near-Infrared Spectral Sensor based on Near-Infrared Detector Arrays
    Hakkel, Kaylee D.
    Petruzzella, Maurangelo
    Ou, Fang
    van Klinken, Anne
    Pagliano, Francesco
    Liu, Tianran
    van Veldhoven, Rene P. J.
    Fiore, Andrea
    2021 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2021,
  • [32] NEAR-INFRARED IMAGE GUIDED REFLECTION REMOVAL
    Hong, Yuchen
    Lyu, Youwei
    Li, Si
    Shi, Boxin
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [33] NEAR-INFRARED GUIDED COLOR IMAGE DEHAZING
    Feng, Chen
    Zhuo, Shaojie
    Zhang, Xiaopeng
    Shen, Liang
    Suesstrunk, Sabine
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2363 - 2367
  • [34] An algorithm of image dehazing using near-infrared
    Cheng, Peng
    Lan, Shi-Yong
    Li, Xiao-Feng
    Li, Xin-Sheng
    Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2013, 45 (SUPPL2): : 155 - 159
  • [35] Near-Infrared Image Filtering for Pedestrian Surveillance
    Rodhouse, Kathryn N.
    Watkins, Steve E.
    NONDESTRUCTIVE CHARACTERIZATION FOR COMPOSITE MATERIALS, AEROSPACE ENGINEERING, CIVIL INFRASTRUCTURE, AND HOMELAND SECURITY 2012, 2012, 8347
  • [36] NEAR-INFRARED VIDEO IMAGE-ANALYSIS
    ROBERT, P
    DEVAUX, MF
    BERTRAND, D
    SCIENCES DES ALIMENTS, 1991, 11 (04) : 565 - 574
  • [37] COLOR IMAGE DEHAZING USING THE NEAR-INFRARED
    Schaul, Lex
    Fredembach, Clement
    Suesstrunk, Sabine
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1629 - 1632
  • [38] Learning Sparse Masks for Diffusion-Based Image Inpainting
    Alt, Tobias
    Peter, Pascal
    Weickert, Joachim
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2022), 2022, 13256 : 528 - 539
  • [39] Diffusion-based image denoising combining curvelet and wavelet
    Ashamol, V. G.
    Sreelekha, G.
    Sathidevi, P. S.
    PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 169 - 172
  • [40] Text-image Alignment for Diffusion-based Perception
    Kondapanenil, Neehar
    Marksl, Markus
    Knott, Manuel
    Guimaraes, Rogerio
    Perona, Pietro
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13883 - 13893