An Overview of Text-based Person Search: Recent Advances and Future Directions

被引:0
|
作者
Niu K. [1 ]
Liu Y. [1 ]
Long Y. [1 ]
Huang Y. [3 ]
Wang L. [3 ]
Zhang Y. [1 ]
机构
[1] Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, Beijing
基金
中国国家自然科学基金;
关键词
Benchmark testing; cross-modal retrieval; feature extraction; Feature extraction; Pedestrians; semantic alignments; Semantics; Task analysis; Text-based person search; Training; video surveillance; Visualization;
D O I
10.1109/TCSVT.2024.3376373
中图分类号
学科分类号
摘要
Due to the practical significance in smart video surveillance systems, Text-Based Person Search (TBPS) has been one of the research hotspots recently, which refers to searching for the interested pedestrian images given natural language sentences. To help researchers quickly grasp the developments of this important task, we comprehensively summarize the recent research advances of TBPS from two perspectives, <italic>i.e</italic>., Feature Extraction (FE) and Semantic Alignments (SA). Specifically, the FE mainly consists of pre-processing approaches and end-to-end frameworks, and the SA could be briefly divided into cross-modal attention mechanism, non-attention alignments, training objectives, and generative approaches. Afterwards, we elaborate four widely-used benchmarks and also the evaluation criterion for TBPS. And comparisons and analyses among the state-of-the-art (SOTA) solutions are provided based on these large-scale benchmarks. At last, we point out some future research directions that need to be further addressed, which will greatly facilitate the practical applications of TBPS. IEEE
引用
收藏
页码:1 / 1
相关论文
共 50 条
  • [1] An Empirical Study of CLIP for Text-Based Person Search
    Cao, Min
    Bai, Yang
    Zeng, Ziyin
    Ye, Mang
    Zhang, Min
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 465 - 473
  • [2] On Text-based Personality Computing: Challenges and Future Directions
    Fang, Qixiang
    Giachanou, Anastasia
    Bagheri, Ayoub
    Boeschoten, Laura
    Van Kesteren, Erik-Jan
    Kamalabad, Mandi Shafiee
    Oberski, Daniel L.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10861 - 10879
  • [3] Diverse Person: Customize Your Own Dataset for Text-Based Person Search
    Song, Zifan
    Hu, Guosheng
    Zhao, Cairong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4943 - 4951
  • [4] Local-enhanced representation for text-based person search
    Zhang, Guoqing
    Chen, Yuhao
    Zheng, Yuhui
    Martin, Gaven
    Wang, Ruili
    PATTERN RECOGNITION, 2025, 161
  • [5] Text-based Person Search via Virtual Attribute Learning
    Wang C.-J.
    Su J.-W.
    Luo Z.-M.
    Cao D.-L.
    Lin Y.-J.
    Li S.-Z.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2035 - 2050
  • [6] Hierarchical Gumbel Attention Network for Text-based Person Search
    Zheng, Kecheng
    Liu, Wu
    Liu, Jiawei
    Zha, Zheng-Jun
    Mei, Tao
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3441 - 3449
  • [7] An Adaptive Correlation Filtering Method for Text-Based Person Search
    Sun, Mengyang
    Suo, Wei
    Wang, Peng
    Niu, Kai
    Liu, Le
    Lin, Guosheng
    Zhang, Yanning
    Wu, Qi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4440 - 4455
  • [8] Noise correspondence with evidence learning for text-based person search
    Xie, Yihan
    Zhang, Baohua
    Li, Yang
    Shan, Chongrui
    Wang, Shun
    Zhang, Jiale
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (05):
  • [9] Sponsored Search Auctions: Recent Advances and Future Directions
    Qin, Tao
    Chen, Wei
    Liu, Tie-Yan
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 5 (04)
  • [10] Conditional Feature Learning Based Transformer for Text-Based Person Search
    Gao, Chenyang
    Cai, Guanyu
    Jiang, Xinyang
    Zheng, Feng
    Zhang, Jun
    Gong, Yifei
    Lin, Fangzhou
    Sun, Xing
    Bai, Xiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6097 - 6108