Source-Free Image-Text Matching via Uncertainty-Aware Learning

被引:0
|
作者
Tian, Mengxiao [1 ,2 ]
Yang, Shuo [3 ]
Wu, Xinxiao [1 ,2 ]
Jia, Yunde [3 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
[2] Shenzhen MSU BIT Univ, Guangdong Prov Lab Machine Percept & Intelligent C, Shenzhen 518172, Peoples R China
[3] Shenzhen MSU BIT Univ, Guangdong Prov Lab Machine Percept & Intelligent C, Shenzhen 518172, Peoples R China
关键词
Adaptation models; Uncertainty; Noise measurement; Data models; Training; Noise; Visualization; Measurement uncertainty; Computational modeling; Testing; Image-text matching; source-free adaptation; uncertainty-aware learning;
D O I
10.1109/LSP.2024.3488521
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
When applying a trained image-text matching model to a new scenario, the performance may largely degrade due to domain shift, which makes it impractical in real-world applications. In this paper, we make the first attempt on adapting the image-text matching model well-trained on a labeled source domain to an unlabeled target domain in the absence of source data, namely, source-free image-text matching. This task is challenging since it has no direct access to the source data when learning to reduce the doma in shift. To address this challenge, we propose a simple yet effective method that introduces uncertainty-aware learning to generate high-quality pseudo-pairs of image and text for target adaptation. Specifically, starting with using the pre-trained source model to retrieve several top-ranked image-text pairs from the target domain as pseudo-pairs, we then model uncertainty of each pseudo-pair by calculating the variance of retrieved texts (resp. images) given the paired image (resp. text) as query, and finally incorporate the uncertainty into an objective function to down-weight noisy pseudo-pairs for better training, thereby enhancing adaptation. This uncertainty-aware training approach can be generally applied on all existing models. Extensive experiments on the COCO and Flickr30K datasets demonstrate the effectiveness of the proposed method.
引用
收藏
页码:3059 / 3063
页数:5
相关论文
共 50 条
  • [1] Uncertainty-Aware Source-Free Domain Adaptive Semantic Segmentation
    Lu, Zhihe
    Li, Da
    Song, Yi-Zhe
    Xiang, Tao
    Hospedales, Timothy M. M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4664 - 4676
  • [2] Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer
    Ai, Yuang
    Zhou, Xiaoqiang
    Huang, Huaibo
    Zhang, Lei
    He, Ran
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 8142 - 8152
  • [3] Cross-Modal Remote Sensing Image-Text Retrieval via Context and Uncertainty-Aware Prompt
    Wang, Yijing
    Tang, Xu
    Ma, Jingjing
    Zhang, Xiangrong
    Liu, Fang
    Jiao, Licheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [4] Uncertainty-aware pseudo-label filtering for source-free unsupervised domain adaptation
    Chen, Xi
    Yang, Haosen
    Zhang, Huicong
    Yao, Hongxun
    Zhu, Xiatian
    NEUROCOMPUTING, 2024, 575
  • [5] UPL-SFDA: Uncertainty-Aware Pseudo Label Guided Source-Free Domain Adaptation for Medical Image Segmentation
    Wu, Jianghao
    Wang, Guotai
    Gu, Ran
    Lu, Tao
    Chen, Yinan
    Zhu, Wentao
    Vercauteren, Tom
    Ourselin, Sebastien
    Zhang, Shaoting
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3932 - 3943
  • [6] A NEIGHBOR-AWARE APPROACH FOR IMAGE-TEXT MATCHING
    Liu, Chunxiao
    Mao, Zhendong
    Zang, Wenyu
    Wang, Bin
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3970 - 3974
  • [7] Reference-Aware Adaptive Network for Image-Text Matching
    Xiong, Guoxin
    Meng, Meng
    Zhang, Tianzhu
    Zhang, Dongming
    Zhang, Yongdong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9678 - 9691
  • [8] Knowledge Aware Semantic Concept Expansion for Image-Text Matching
    Shi, Botian
    Ji, Lei
    Lu, Pan
    Niu, Zhendong
    Duan, Nan
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 5182 - 5189
  • [9] Rare-aware attention network for image-text matching
    Wang, Yan
    Su, Yuting
    Li, Wenhui
    Sun, Zhengya
    Wei, Zhiqiang
    Nie, Jie
    Li, Xuanya
    Liu, An-An
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [10] Negative-Aware Attention Framework for Image-Text Matching
    Zhang, Kun
    Mao, Zhendong
    Wang, Quan
    Zhang, Yongdong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15640 - 15649