Cross-Modality Person Re-Identification with Generative Adversarial Training

被引:0
|
作者
Dai, Pingyang [1 ,2 ]
Ji, Rongrong [1 ,2 ]
Wang, Haibin [1 ,2 ]
Wu, Qiong [2 ]
Huang, Yuyu [1 ,2 ]
机构
[1] Xiamen Univ, Fujian Key Lab Sensing & Comp Smart City, Xiamen, Peoples R China
[2] Xiamen Univ, Sch Informat Sci & Engn, Xiamen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person re-identification (Re-ID) is an important task in video surveillance which automatically searches and identifies people across different cameras. Despite the extensive Re-ID progress in RGB cameras, few works have studied the Re-ID between infrared and RGB images, which is essentially a cross-modality problem and widely encountered in real-world scenarios. The key challenge lies in two folds, i.e., the lack of discriminative information to re-identify the same person between RGB and infrared modalities, and the difficulty to learn a robust metric for such a large-scale cross-modality retrieval. In this paper, we tackle the above two challenges by proposing a novel cross-modality generative adversarial network (termed cmGAN). To handle the lack of insufficient discriminative information, we design a cutting-edge generative adversarial training based discriminator to learn discriminative feature representation from different modalities. To handle the issue of largescale cross-modality metric learning, we integrate both identification loss and cross-modality triplet loss, which minimize inter-class ambiguity while maximizing cross-modality similarity among instances. The entire cmGAN can be trained in an end-to-end manner by using standard deep neural network framework. We have quantized the performance of our work in the newly-released SYSU RGB-IR Re-ID benchmark, and have reported superior performance, i.e., Cumulative Match Characteristic curve (CMC) and Mean Average Precision (MAP), over the state-of-the-art works [Wu et al., 2017], at least 12.17% and 11.85% respectively.
引用
收藏
页码:677 / 683
页数:7
相关论文
共 50 条
  • [41] Two-stage Metric Learning for Cross-Modality Person Re-Identification
    Wang, Jiabao
    Jiao, ShanShan
    Li, Yang
    Miao, Zhuang
    PROCEEDINGS OF 2020 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP 2020), 2020, : 28 - 32
  • [42] RGB-IR Person Re-identification by Cross-Modality Similarity Preservation
    Wu, Ancong
    Zheng, Wei-Shi
    Gong, Shaogang
    Lai, Jianhuang
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (06) : 1765 - 1785
  • [43] Dual-alignment Feature Embedding for Cross-modality Person Re-identification
    Hao, Yi
    Wang, Nannan
    Gao, Xinbo
    Li, Jie
    Wang, Xiaoyu
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 57 - 65
  • [44] Domain Shifting: A Generalized Solution for Heterogeneous Cross-Modality Person Re-Identification
    Jiang, Yan
    Cheng, Xu
    Yu, Hao
    Liu, Xingyu
    Chen, Haoyu
    Zhao, Guoying
    COMPUTER VISION - ECCV 2024, PT LXXII, 2025, 15130 : 289 - 306
  • [45] Visible-infrared cross-modality person re-identification based on whole-individual training
    Sun, Jia
    Li, Yanfeng
    Chen, Houjin
    Peng, Yahui
    Zhu, Xiaodi
    Zhu, Jinlei
    NEUROCOMPUTING, 2021, 440 : 1 - 11
  • [46] The Multi-Layer Constrained Loss for Cross-Modality Person Re-Identification
    Sun, Zhanrui
    Zhu, Yongxin
    Song, Shijin
    Hou, Junjie
    Du, Sen
    Song, Yuefeng
    2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2020,
  • [47] Cross-modality consistency learning for visible-infrared person re-identification
    Shao, Jie
    Tang, Lei
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [48] RGB-IR Person Re-identification by Cross-Modality Similarity Preservation
    Ancong Wu
    Wei-Shi Zheng
    Shaogang Gong
    Jianhuang Lai
    International Journal of Computer Vision, 2020, 128 : 1765 - 1785
  • [49] Bridge Gap in Pixel and Feature Level for Cross-Modality Person Re-Identification
    Ling, Yongguo
    Zhong, Zhun
    Luo, Zhiming
    Li, Shaozi
    Sebe, Nicu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5104 - 5117
  • [50] Cross-Modality Person Re-Identification Method with Joint-Modality Generation and Feature Enhancement
    Bi, Yihan
    Wang, Rong
    Zhou, Qianli
    Zeng, Zhaolong
    Lin, Ronghui
    Wang, Mingjie
    ENTROPY, 2024, 26 (08)