Text-Based Face Retrieval: Methods and Challenges

被引:0
|
作者
Deng, Yuchuan [1 ]
Zhao, Qijun [1 ]
Hu, Zhanpeng [1 ]
Xu, Zixiang [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu, Peoples R China
来源
关键词
Text-based Face Retrieval; Visual-Language Pre-trainning;
D O I
10.1007/978-981-99-8565-4_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous researches on face retrieval have concentrated on using image-based queries. In this paper, we focus on the task of retrieving faces from a database based on queries given as texts, which holds significant potential for practical applications in public security and multimedia. Our approach employs a vision-language pre-training model as the backbone, effectively incorporating contrastive learning, image-text matching learning, and masked language modeling tasks. Furthermore, it employs a coarse-to-fine retrieval strategy to enhance the accuracy of text-based face retrieval. We present CelebA-Text-Identity dataset, comprising of 202,599 facial images of 10,178 unique identities, each paired with an accompanying textual description. The experimental results we obtained on CelebA-Text-Identity demonstrate the inherent challenges of text-based face retrieval. We expect that our proposed benchmark will encourage the advancement of biometric retrieval techniques and expand the range of applications for text-image retrieval technology.
引用
收藏
页码:150 / 159
页数:10
相关论文
共 50 条
  • [21] SUM: Serialized Updating and Matching for text-based person retrieval
    Wang, Zijie
    Zhu, Aichun
    Xue, Jingyi
    Jiang, Daihong
    Liu, Chao
    Li, Yifeng
    Hu, Fangqiang
    KNOWLEDGE-BASED SYSTEMS, 2022, 248
  • [22] Text-based emotion detection: Advances, challenges, and opportunities
    Acheampong, Francisca Adoma
    Chen Wenyu
    Nunoo-Mensah, Henry
    ENGINEERING REPORTS, 2020, 2 (07)
  • [23] Document Expansion for Text-Based Image Retrieval at CLEF 2009
    Min, Jinming
    Wilkins, Peter
    Leveling, Johannes
    Jones, Gareth J. F.
    MULTILINGUAL INFORMATION ACCESS EVALUATION II: MULTIMEDIA EXPERIMENTS, PT II, 2010, 6242 : 172 - 176
  • [24] Detected text-based image retrieval approach for textual images
    Unar, Salahuddin
    Wang, Xingyuan
    Zhang, Chuan
    Wang, Chunpeng
    IET IMAGE PROCESSING, 2019, 13 (03) : 515 - 521
  • [25] Exploring automatic query refinement for text-based video retrieval
    Volkmer, Timo
    Natsev, Apostol
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 765 - 768
  • [26] SketchCLIP: Text-based Attribute Manipulation for Face Sketch Synthesis
    Dong, Mengdi
    Peng, Chunlei
    Liu, Decheng
    Zheng, Yu
    Wang, Nannan
    Gao, Xinbo
    2022 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB), 2022,
  • [27] Multi-modal molecule structure–text model for text-based retrieval and editing
    Shengchao Liu
    Weili Nie
    Chengpeng Wang
    Jiarui Lu
    Zhuoran Qiao
    Ling Liu
    Jian Tang
    Chaowei Xiao
    Animashree Anandkumar
    Nature Machine Intelligence, 2023, 5 : 1447 - 1457
  • [28] Voice-based Information Retrieval - how far are we from the text-based information retrieval ?
    Lee, Lin-shan
    Pan, Yi-cheng
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 26 - 43
  • [29] Using concept hierarchies in text-based image retrieval: A user evaluation
    Petrelli, Daniela
    Clough, Paul
    ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 297 - 306
  • [30] Computing similarity of text-based assembly processes for knowledge retrieval and reuse
    Renu, Rahul Sharan
    Mocko, Gregory
    JOURNAL OF MANUFACTURING SYSTEMS, 2016, 39 : 101 - 110