StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval

被引:45
|
作者
Sain, Aneeshan [1 ,2 ]
Bhunia, Ayan Kumar [1 ]
Yang, Yongxin [1 ,2 ]
Xiang, Tao [1 ,2 ]
Song, Yi-Zhe [1 ,2 ]
机构
[1] Univ Surrey, CVSSP, SketchX, Guildford, Surrey, England
[2] iFlyTek Surrey Joint Res Ctr Artificial Intellige, Guildford, Surrey, England
关键词
D O I
10.1109/CVPR46437.2021.00840
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sketch-based image retrieval (SBIR) is a cross-modal matching problem which is typically solved by learning a joint embedding space where the semantic content shared between photo and sketch modalities are preserved. However, a fundamental challenge in SBIR has been largely ignored so far, that is, sketches are drawn by humans and considerable style variations exist amongst different users. An effective SBIR model needs to explicitly account for this style diversity, crucially, to generalise to unseen user styles. To this end, a novel style-agnostic SBIR model is proposed. Different from existing models, a cross-modal variational autoencoder (VAE) is employed to explicitly disentangle each sketch into a semantic content part shared with the corresponding photo, and a style part unique to the sketcher. Importantly, to make our model dynamically adaptable to any unseen user styles, we propose to metatrain our cross-modal VAE by adding two style-adaptive components: a set of feature transformation layers to its encoder and a regulariser to the disentangled semantic content latent code. With this meta-learning framework, our model can not only disentangle the cross-modal shared semantic content for SBIR, but can adapt the disentanglement to any unseen user style as well, making the SBIR model truly style-agnostic. Extensive experiments show that our style-agnostic model yields state-of-the-art performance for both category-level and instance-level SBIR.
引用
收藏
页码:8500 / 8509
页数:10
相关论文
共 50 条
  • [1] Towards Human Performance on Sketch-Based Image Retrieval
    Seddati, Omar
    Dupont, Stephane
    Mahmoudi, Said
    Dutoit, Thierry
    19TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2022, 2022, : 77 - 83
  • [2] A survey of sketch-based image retrieval
    Yi Li
    Wenzhao Li
    Machine Vision and Applications, 2018, 29 : 1083 - 1100
  • [3] SKETCH-BASED AERIAL IMAGE RETRIEVAL
    Jiang, Tianbi
    Xia, Gui-Song
    Lu, Qikai
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3690 - 3694
  • [4] A survey of sketch-based image retrieval
    Li, Yi
    Li, Wenzhao
    MACHINE VISION AND APPLICATIONS, 2018, 29 (07) : 1083 - 1100
  • [5] Sketch-based Image Retrieval using Sketch Tokens
    Wang, Shu
    Miao, Zhenjiang
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 396 - 400
  • [6] Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style
    Lin, Fengyin
    Li, Mingkang
    Li, Da
    Hospedales, Timothy
    Song, Yi-Zhe
    Qi, Yonggang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23349 - 23358
  • [7] Sketch-based image retrieval using keyshapes
    Jose M. Saavedra
    Benjamin Bustos
    Multimedia Tools and Applications, 2014, 73 : 2033 - 2062
  • [8] Sketch-based image retrieval using keyshapes
    Saavedra, Jose M.
    Bustos, Benjamin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 73 (03) : 2033 - 2062
  • [9] Sketch-based Image Retrieval by Using Saliency
    Pan, Da
    Shi, Ping
    Li, Cuiying
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 825 - 829
  • [10] Quadruplet Networks for Sketch-Based Image Retrieval
    Seddati, Omar
    Dupont, Stephane
    Mahmoudi, Said
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 189 - 196