An Overview of Text-based Person Search: Recent Advances and Future Directions

被引:0
|
作者
Niu K. [1 ]
Liu Y. [1 ]
Long Y. [1 ]
Huang Y. [3 ]
Wang L. [3 ]
Zhang Y. [1 ]
机构
[1] Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, Beijing
基金
中国国家自然科学基金;
关键词
Benchmark testing; cross-modal retrieval; feature extraction; Feature extraction; Pedestrians; semantic alignments; Semantics; Task analysis; Text-based person search; Training; video surveillance; Visualization;
D O I
10.1109/TCSVT.2024.3376373
中图分类号
学科分类号
摘要
Due to the practical significance in smart video surveillance systems, Text-Based Person Search (TBPS) has been one of the research hotspots recently, which refers to searching for the interested pedestrian images given natural language sentences. To help researchers quickly grasp the developments of this important task, we comprehensively summarize the recent research advances of TBPS from two perspectives, <italic>i.e</italic>., Feature Extraction (FE) and Semantic Alignments (SA). Specifically, the FE mainly consists of pre-processing approaches and end-to-end frameworks, and the SA could be briefly divided into cross-modal attention mechanism, non-attention alignments, training objectives, and generative approaches. Afterwards, we elaborate four widely-used benchmarks and also the evaluation criterion for TBPS. And comparisons and analyses among the state-of-the-art (SOTA) solutions are provided based on these large-scale benchmarks. At last, we point out some future research directions that need to be further addressed, which will greatly facilitate the practical applications of TBPS. IEEE
引用
收藏
页码:1 / 1
相关论文
共 50 条
  • [41] Part-Based Multi-Scale Attention Network for Text-Based Person Search
    Wang, Yubin
    Qi, Ding
    Zhao, Cairong
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 462 - 474
  • [42] Relation-aware aggregation network with auxiliary guidance for text-based person search
    Zeng, Pengpeng
    Jing, Shuaiqi
    Song, Jingkuan
    Fan, Kaixuan
    Li, Xiangpeng
    We, Liansuo
    Guo, Yuan
    World Wide Web, 2022, 25 (04) : 1565 - 1582
  • [43] Relation-aware aggregation network with auxiliary guidance for text-based person search
    Pengpeng Zeng
    Shuaiqi Jing
    Jingkuan Song
    Kaixuan Fan
    Xiangpeng Li
    Liansuo We
    Yuan Guo
    World Wide Web, 2022, 25 : 1565 - 1582
  • [44] Full-view salient feature mining and alignment for text-based person search
    Xie, Sheng
    Zhang, Canlong
    Ning, Enhao
    Li, Zhixin
    Wang, Zhiwen
    Wei, Chunrong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [45] Growth factor therapy for cardiac repair: an overview of recent advances and future directions
    White S.J.
    Chong J.J.H.
    Biophysical Reviews, 2020, 12 (4) : 805 - 815
  • [46] Learning shared features from specific and ambiguous descriptions for text-based person search
    Ke Cheng
    Qikai Geng
    Shucheng Huang
    Juanjuan Tu
    Hu Lu
    Multimedia Systems, 2024, 30
  • [47] Learning shared features from specific and ambiguous descriptions for text-based person search
    Cheng, Ke
    Geng, Qikai
    Huang, Shucheng
    Tu, Juanjuan
    Lu, Hu
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [48] Recent advances and future directions in superplasticity
    Higashi, K
    SUPERPLASTICITY IN ADVANCED MATERIALS, ICSAM-2000, 2001, 357-3 : 345 - 356
  • [49] Laminitis: Recent advances and future directions
    Marr, C. M.
    EQUINE VETERINARY JOURNAL, 2012, 44 (06) : 733 - 734
  • [50] Sonogenetics: Recent advances and future directions
    Liu, Tianyi
    Choi, Mi Hyun
    Zhu, Jiejun
    Zhu, Tingting
    Yang, Jin
    Li, Na
    Chen, Zihao
    Xian, Quanxiang
    Hou, Xuandi
    He, Dongmin
    Guo, Jinghui
    Fei, Chunlong
    Sun, Lei
    Qiu, Zhihai
    BRAIN STIMULATION, 2022, 15 (05) : 1308 - 1317