Text-Based Face Retrieval: Methods and Challenges

被引:0
|
作者
Deng, Yuchuan [1 ]
Zhao, Qijun [1 ]
Hu, Zhanpeng [1 ]
Xu, Zixiang [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu, Peoples R China
来源
关键词
Text-based Face Retrieval; Visual-Language Pre-trainning;
D O I
10.1007/978-981-99-8565-4_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous researches on face retrieval have concentrated on using image-based queries. In this paper, we focus on the task of retrieving faces from a database based on queries given as texts, which holds significant potential for practical applications in public security and multimedia. Our approach employs a vision-language pre-training model as the backbone, effectively incorporating contrastive learning, image-text matching learning, and masked language modeling tasks. Furthermore, it employs a coarse-to-fine retrieval strategy to enhance the accuracy of text-based face retrieval. We present CelebA-Text-Identity dataset, comprising of 202,599 facial images of 10,178 unique identities, each paired with an accompanying textual description. The experimental results we obtained on CelebA-Text-Identity demonstrate the inherent challenges of text-based face retrieval. We expect that our proposed benchmark will encourage the advancement of biometric retrieval techniques and expand the range of applications for text-image retrieval technology.
引用
收藏
页码:150 / 159
页数:10
相关论文
共 50 条
  • [1] A text-based synthetic face with emotions*
    Fitrianie, Siska
    Rothkrantz, Leon J. M.
    EUROMEDIA '2006, 2006, : 28 - +
  • [2] Text-based experiment retrieval in genomic databases
    Sener, Duygu Dede
    Ogul, Hasan
    Basak, Selen
    JOURNAL OF INFORMATION SCIENCE, 2024, 50 (05) : 1334 - 1344
  • [3] EFFECTS OF CENTRALITY ON RETRIEVAL OF TEXT-BASED CONCEPTS
    ALBRECHT, JE
    OBRIEN, EJ
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1991, 17 (05) : 932 - 939
  • [4] A Scene Text-Based Image Retrieval System
    Thuy Ho
    Ngoc Ly
    2012 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2012, : 79 - 84
  • [5] Linguistic Hallucination for Text-Based Video Retrieval
    Fang, Sheng
    Dang, Tiantian
    Wang, Shuhui
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9692 - 9705
  • [6] Text2Face: Text-Based Face Generation With Geometry and Appearance Control
    Zhang, Zhaoyang
    Chen, Junliang
    Fu, Hongbo
    Zhao, Jianjun
    Chen, Shu-Yu
    Gao, Lin
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (09) : 6481 - 6492
  • [7] Exploring fonts as retrieval cues in text-based learning
    Krieglstein, Felix
    Jansen, Sebastian
    Meusel, Felicia
    Scheller, Nadine
    Schmitz, Manuel
    Wesenberg, Lukas
    Rey, Guenter Daniel
    ACTA PSYCHOLOGICA, 2024, 251
  • [8] Image Sense Classification in Text-Based Image Retrieval
    Chang, Yih-Chen
    Chen, Hsin-Hsi
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 124 - 135
  • [9] Chatting with interactive memory for text-based person retrieval
    He, Chen
    Li, Shenshen
    Wang, Zheng
    Chen, Hua
    Shen, Fumin
    Xu, Xing
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [10] External query reformulation for text-based image retrieval
    Min, Jinming
    Jones, Gareth J. F.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2011, 7024 LNCS : 249 - 260