Norm-guided Adaptive Visual Embedding for Zero-Shot Sketch-Based Image Retrieval

被引:0
|
作者
Wang, Wenjie [1 ]
Shi, Yufeng [1 ]
Chen, Shiming [1 ]
Peng, Qinmu [1 ]
Zheng, Feng [3 ]
You, Xinge [1 ,2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Peoples R China
[2] Huazhong Univ Sci & Technol, Shenzhen Res Inst, Wuhan, Peoples R China
[3] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot sketch-based image retrieval (ZS-SBIR), which aims to retrieve photos with sketches under the zero-shot scenario, has shown extraordinary talents in real-world applications. Most existing methods leverage language models to generate class-prototypes and use them to arrange the locations of all categories in the common space for photos and sketches. Although great progress has been made, few of them consider whether such pre-defined prototypes are necessary for ZS-SBIR, where locations of unseen class samples in the embedding space are actually determined by visual appearance and a visual embedding actually performs better. To this end, we propose a novel Norm-guided Adaptive Visual Embedding (NAVE) model, for adaptively building the common space based on visual similarity instead of language-based pre-defined prototypes. To further enhance the representation quality of unseen classes for both photo and sketch modality, modality norm discrepancy and noisy label regularizer are jointly employed to measure and repair the modality bias of the learned common embedding. Experiments on two challenging datasets demonstrate the superiority of our NAVE over state-of-the-art competitors.
引用
收藏
页码:1106 / 1112
页数:7
相关论文
共 50 条
  • [41] Zero-Shot Sketch-Based Remote-Sensing Image Retrieval Based on Multi-Level and Attention-Guided Tokenization
    Yang, Bo
    Wang, Chen
    Ma, Xiaoshuang
    Song, Beiping
    Liu, Zhuang
    Sun, Fangde
    REMOTE SENSING, 2024, 16 (10)
  • [42] Progressive Domain-Independent Feature Decomposition Network for Zero-Shot Sketch-Based Image Retrieval
    Xu, Xinxun
    Yang, Muli
    Yang, Yanhua
    Wang, Hao
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 984 - 990
  • [43] Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval
    Jiao, Shichao
    Han, Xie
    Xiong, Fengguang
    Yang, Xiaowen
    Han, Huiyan
    He, Ligang
    Kuang, Liqun
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16): : 13469 - 13483
  • [44] Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval
    Shichao Jiao
    Xie Han
    Fengguang Xiong
    Xiaowen Yang
    Huiyan Han
    Ligang He
    Liqun Kuang
    Neural Computing and Applications, 2022, 34 : 13469 - 13483
  • [45] Cross-Modal Visual Correspondences Learning Without External Semantic Information for Zero-Shot Sketch-Based Image Retrieval
    Gao, Zhijie
    Wang, Kai
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 342 - 353
  • [46] A Zero-Shot Sketch-Based Intermodal Object Retrieval Scheme for Remote Sensing Images
    Chaudhuri, Ushasi
    Banerjee, Biplab
    Bhattacharya, Avik
    Datcu, Mihai
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [47] Stacked Adversarial Network for Zero-Shot Sketch based Image Retrieval
    Pandey, Anubha
    Mishra, Ashish
    Verma, Vinay Kumar
    Mittal, Anurag
    Murthy, Hema A.
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2529 - 2538
  • [48] Zero-Shot Sketch Based Image Retrieval Using Graph Transformer
    Gupta, Sumrit
    Chaudhuri, Ushasi
    Banerjee, Biplab
    Kumar, Saurabh
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1685 - 1691
  • [49] Domain-aware double attention network for zero-shot sketch-based image retrieval with similarity loss
    Ming Zhu
    Chen Zhao
    Nian Wang
    Feiyang Gu
    Yu Liu
    Xin Li
    The Visual Computer, 2024, 40 : 3091 - 3101
  • [50] Domain-aware double attention network for zero-shot sketch-based image retrieval with similarity loss
    Zhu, Ming
    Zhao, Chen
    Wang, Nian
    Gu, Feiyang
    Liu, Yu
    Li, Xin
    VISUAL COMPUTER, 2024, 40 (05): : 3091 - 3101