A review on video person re-identification based on deep learning

被引:3
|
作者
Ma, Haifei [1 ,3 ]
Zhang, Canlong [1 ,2 ]
Zhang, Yifeng [1 ]
Li, Zhixin [1 ,2 ]
Wang, Zhiwen [4 ]
Wei, Chunrong [1 ]
机构
[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Peoples R China
[3] Guangdong Univ Sci & Technol, Dongguan, Peoples R China
[4] Guangxi Univ Sci & Technol, Sch Elect Engn, Liuzhou, Peoples R China
基金
美国国家科学基金会;
关键词
Video-based person ReID; Temporal learning; Literature survey and perspectives; Attention mechanism; Convolutional neural network; UNSUPERVISED DOMAIN ADAPTATION; NEURAL-NETWORK;
D O I
10.1016/j.neucom.2024.128479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person Re-Identification (ReID) is an essential technology for matching a person across non-overlapping cameras. It has attracted increasing attention in recent years due to its wide range of applications in various real-world scenarios such as security surveillance and criminal investigation. Different from other person ReID tasks, video-based ReID uses a video clip as the retrieval input, which can provide more promising ReID performance because that the video has rich information on appearance, motion cues and pose variations on temporal pipeline. Over the last few years, many deep learning-based video person ReID have been proposed to address various challenges, such as illumination variation, complex background, occlusion, etc. To provide a more comprehensive and readable review on existing video-based person ReID methods, we propose a novel taxonomy method that observes existing methods from four perspectives: data, algorithms, computing power, and applications. Specifically, we first introduce some popular datasets and evaluation criterion used for video-based person ReID. Next, from limited data and little annotation view, we introduce data augmentation and unsupervised learning ReID. From algorithm view, we focus on reviewing supervised methods including spatial feature learning, temporal feature learning and spatio-temporal feature learning, and further discuss and conduct a systematic comparison among these approaches. From complex open-world application view, we mainly summarized domain adaption and multimodal ReID. From insufficient GPU computing power view, we mainly discuss modality-agnostic unified large-scale ReID and their lightweighting. Finally, we provide a discussion of open problems and potential research directions for the community.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Learning Bidirectional Temporal Cues for Video-Based Person Re-Identification
    Zhang, Wei
    Yu, Xiaodong
    He, Xuanyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2768 - 2776
  • [42] Sequences consistency feature learning for video-based person re-identification
    Zhao, Kai
    Cheng, Deqiang
    Kou, Qiqi
    Li, Jiahan
    Liu, Ruihang
    ELECTRONICS LETTERS, 2022, 58 (04) : 142 - 144
  • [43] Feature Aggregation With Reinforcement Learning for Video-Based Person Re-Identification
    Zhang, Wei
    He, Xuanyu
    Lu, Weizhi
    Qiao, Hong
    Li, Yibin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (12) : 3847 - 3852
  • [44] Person Re-identification by Video Ranking
    Wang, Taiqing
    Gong, Shaogang
    Zhu, Xiatian
    Wang, Shengjin
    COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 688 - 703
  • [45] Video-based person re-identification based on regularised hull distance learning
    Xu, Xiaoyue
    Chen, Ying
    IET COMPUTER VISION, 2019, 13 (04) : 385 - 394
  • [46] Anchor Association Learning for Unsupervised Video Person Re-Identification
    Zeng, Shujun
    Wang, Xueping
    Liu, Min
    Liu, Qing
    Wang, Yaonan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 1013 - 1024
  • [47] Learning Intra-Video Difference for Person Re-Identification
    Zhang, Wei
    Li, Yimeng
    Lu, Weizhi
    Xu, Xinshun
    Liu, Zhaowei
    Ji, Xiangyang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 3028 - 3036
  • [48] Adaptation and Re-Identification Network: An Unsupervised Deep Transfer Learning Approach to Person Re-Identification
    Li, Yu-Jhe
    Yang, Fu-En
    Liu, Yen-Cheng
    Yeh, Yu-Ying
    Du, Xiaofei
    Wang, Yu-Chiang Frank
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 285 - 291
  • [49] Adaptive Graph Representation Learning for Video Person Re-Identification
    Wu, Yiming
    Bourahla, Omar El Farouk
    Li, Xi
    Wu, Fei
    Tian, Qi
    Zhou, Xue
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 8821 - 8830
  • [50] Video-based Person Re-identification via Self-Paced Learning and Deep Reinforcement Learning Framework
    Ouyang, Deqiang
    Shao, Jie
    Zhang, Yonghui
    Yang, Yang
    Shen, Heng Tao
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1562 - 1570