Misalignment-Robust Joint Filter for Cross-Modal Image Pairs

被引:22
|
作者
Shibata, Takashi [1 ]
Tanaka, Masayuki [2 ]
Okutomi, Masatoshi [2 ]
机构
[1] NEC Corp Ltd, Tokyo, Japan
[2] Tokyo Inst Technol, Tokyo, Japan
关键词
D O I
10.1109/ICCV.2017.357
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although several powerful joint filters for cross-modal image pairs have been proposed, the existing joint filters generate severe artifacts when there are misalignments between a target and a guidance images. Our goal is to generate an artifact-free output image even from the misaligned target and guidance images. We propose a novel misalignment-robust joint filter based on weight-volume-based image composition and joint-filter cost volume. Our proposed method first generates a set of translated guidances.Next, the joint-filter cost volume and a set of filtered images are computed from the target image and the set of the translated guidances. Then, a weight volume is obtained from the joint-filter cost volume while considering a spatial smoothness and a label-sparseness. The final output image is composed by fusing the set of the filtered images with the weight volume for the filtered images. The key is to generate the final output image directly from the set of the filtered images by weighted averaging using the weight volume that is obtained from the joint-filter cost volume. The proposed framework is widely applicable and can involve any kind of joint filter. Experimental results show that the proposed method is effective for various applications including image denosing, image up-sampling, haze removal and depth map interpolation.
引用
收藏
页码:3315 / 3324
页数:10
相关论文
共 50 条
  • [1] Cross-modal misalignment-robust feature fusion for crowd counting
    Kong, Weihang
    Yu, Zepeng
    Li, He
    Zhang, Junge
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [2] Misalignment-Robust Pedestrian Detection Framework for Visible and Far-Infrared Image Pairs
    Shibata, Takashi
    Sawada, Azusa
    2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2019,
  • [3] Misalignment-Robust Frequency Distribution Loss for Image Transformation
    Ni, Zhangkai
    Wu, Juncheng
    Wang, Zian
    Yang, Wenhan
    Wang, Hanli
    Ma, Lin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 2910 - 2919
  • [4] Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval
    Han, Haochen
    Zheng, Qinghua
    Dai, Guang
    Luo, Minnan
    Wang, Jingdong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 26669 - 26678
  • [5] Misalignment-robust, edge-based image fusion method
    Xi, Cai
    Wei, Zhao
    OPTICAL ENGINEERING, 2012, 51 (07)
  • [6] Cross-Modal Retrieval With Partially Mismatched Pairs
    Hu, Peng
    Huang, Zhenyu
    Peng, Dezhong
    Wang, Xu
    Peng, Xi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9595 - 9610
  • [7] Cross-Modal Joint Prediction and Alignment for Composed Query Image Retrieval
    Yang, Yuchen
    Wang, Min
    Zhou, Wengang
    Li, Houqiang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3303 - 3311
  • [8] Joint feature approach for image-text cross-modal retrieval
    Gao, Dihui
    Sheng, Lijie
    Xu, Xiaodong
    Miao, Qiguang
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2024, 51 (04): : 128 - 138
  • [9] Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image Captioning
    Liu, Zhiyue
    Liu, Jinyuan
    Ma, Fanrong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3864 - 3872
  • [10] Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval
    Mithun, Niluthpol Chowdhury
    Panda, Rameswar
    Papalexakis, Evangelos E.
    Roy-Chowdhury, Amit K.
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1856 - 1864