An Overview of Text-based Person Search: Recent Advances and Future Directions

被引:0
|
作者
Niu K. [1 ]
Liu Y. [1 ]
Long Y. [1 ]
Huang Y. [3 ]
Wang L. [3 ]
Zhang Y. [1 ]
机构
[1] Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, Beijing
基金
中国国家自然科学基金;
关键词
Benchmark testing; cross-modal retrieval; feature extraction; Feature extraction; Pedestrians; semantic alignments; Semantics; Task analysis; Text-based person search; Training; video surveillance; Visualization;
D O I
10.1109/TCSVT.2024.3376373
中图分类号
学科分类号
摘要
Due to the practical significance in smart video surveillance systems, Text-Based Person Search (TBPS) has been one of the research hotspots recently, which refers to searching for the interested pedestrian images given natural language sentences. To help researchers quickly grasp the developments of this important task, we comprehensively summarize the recent research advances of TBPS from two perspectives, <italic>i.e</italic>., Feature Extraction (FE) and Semantic Alignments (SA). Specifically, the FE mainly consists of pre-processing approaches and end-to-end frameworks, and the SA could be briefly divided into cross-modal attention mechanism, non-attention alignments, training objectives, and generative approaches. Afterwards, we elaborate four widely-used benchmarks and also the evaluation criterion for TBPS. And comparisons and analyses among the state-of-the-art (SOTA) solutions are provided based on these large-scale benchmarks. At last, we point out some future research directions that need to be further addressed, which will greatly facilitate the practical applications of TBPS. IEEE
引用
收藏
页码:1 / 1
相关论文
共 50 条
  • [21] Joint Token and Feature Alignment Framework for Text-Based Person Search
    Li, Shangze
    Lu, Andong
    Huang, Yan
    Li, Chenglong
    Wang, Liang
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2238 - 2242
  • [22] An Overview over Recent Advances and Future Directions of the Theory of Structured Intelligence
    Wolf, Martin J.
    Jipp, Meike
    Kloess, Sebastian
    Badreddin, Essameddin
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2139 - 2144
  • [23] Neuroendocrine mechanisms for parental sensitivity: overview, recent advances and future directions
    Swain, James E.
    Ho, Shao-Hsuan Shaun
    CURRENT OPINION IN PSYCHOLOGY, 2017, 15 : 105 - 110
  • [24] Improving embedding learning by virtual attribute decoupling for text-based person search
    Wang, Chengji
    Luo, Zhiming
    Lin, Yaojin
    Li, Shaozi
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (07): : 5625 - 5647
  • [25] Cross-modal alignment with synthetic caption for text-based person search
    Weichen Zhao
    Yuxing Lu
    Zhiyuan Liu
    Yuan Yang
    Ge Jiao
    International Journal of Multimedia Information Retrieval, 2025, 14 (2)
  • [26] Contrastive Transformer Learning With Proximity Data Generation for Text-Based Person Search
    Wu, Hefeng
    Chen, Weifeng
    Liu, Zhibin
    Chen, Tianshui
    Chen, Zhiguang
    Lin, Liang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7005 - 7016
  • [27] CLIP-Based Multi-level Alignment for Text-based Person Search
    Wu, Zhijun
    Ma, Shiwei
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 610 - 614
  • [28] RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search
    Bai, Yang
    Cao, Min
    Gao, Daming
    Cao, Ziqiang
    Chen, Chen
    Fan, Zhenfeng
    Nie, Liqiang
    Zhang, Min
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 555 - 563
  • [29] Text-based Person Search via Multi-Granularity Embedding Learning
    Wang, Chengji
    Luo, Zhiming
    Lin, Yaojin
    Li, Shaozi
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1068 - 1074
  • [30] Text-based person search via cross-modal alignment learning
    Ke, Xiao
    Liu, Hao
    Xu, Peirong
    Lin, Xinru
    Guo, Wenzhong
    PATTERN RECOGNITION, 2024, 152