Cross-Modality Person Re-Identification with Generative Adversarial Training

被引:0
|
作者
Dai, Pingyang [1 ,2 ]
Ji, Rongrong [1 ,2 ]
Wang, Haibin [1 ,2 ]
Wu, Qiong [2 ]
Huang, Yuyu [1 ,2 ]
机构
[1] Xiamen Univ, Fujian Key Lab Sensing & Comp Smart City, Xiamen, Peoples R China
[2] Xiamen Univ, Sch Informat Sci & Engn, Xiamen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person re-identification (Re-ID) is an important task in video surveillance which automatically searches and identifies people across different cameras. Despite the extensive Re-ID progress in RGB cameras, few works have studied the Re-ID between infrared and RGB images, which is essentially a cross-modality problem and widely encountered in real-world scenarios. The key challenge lies in two folds, i.e., the lack of discriminative information to re-identify the same person between RGB and infrared modalities, and the difficulty to learn a robust metric for such a large-scale cross-modality retrieval. In this paper, we tackle the above two challenges by proposing a novel cross-modality generative adversarial network (termed cmGAN). To handle the lack of insufficient discriminative information, we design a cutting-edge generative adversarial training based discriminator to learn discriminative feature representation from different modalities. To handle the issue of largescale cross-modality metric learning, we integrate both identification loss and cross-modality triplet loss, which minimize inter-class ambiguity while maximizing cross-modality similarity among instances. The entire cmGAN can be trained in an end-to-end manner by using standard deep neural network framework. We have quantized the performance of our work in the newly-released SYSU RGB-IR Re-ID benchmark, and have reported superior performance, i.e., Cumulative Match Characteristic curve (CMC) and Mean Average Precision (MAP), over the state-of-the-art works [Wu et al., 2017], at least 12.17% and 11.85% respectively.
引用
收藏
页码:677 / 683
页数:7
相关论文
共 50 条
  • [21] Cross-Modality Transformer for Visible-Infrared Person Re-Identification
    Jiang, Kongzhu
    Zhang, Tianzhu
    Liu, Xiang
    Qian, Bingqiao
    Zhang, Yongdong
    Wu, Feng
    COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 480 - 496
  • [22] Co-segmentation assisted cross-modality person re-identification
    Huang, Nianchang
    Xing, Baichao
    Zhang, Qiang
    Han, Jungong
    Huang, Jin
    INFORMATION FUSION, 2024, 104
  • [23] Cross-modality person re-identification using hybrid mutual learning
    Zhang, Zhong
    Dong, Qing
    Wang, Sen
    Liu, Shuang
    Xiao, Baihua
    Durrani, Tariq S.
    IET COMPUTER VISION, 2023, 17 (01) : 1 - 12
  • [24] Triplet interactive attention network for cross-modality person re-identification
    Zhang, Chenrui
    Chen, Ping
    Lei, Tao
    Meng, Hongying
    PATTERN RECOGNITION LETTERS, 2021, 152 : 202 - 209
  • [25] Deep feature learning with attributes for cross-modality person re-identification
    Zhang, Shikun
    Chen, Changhong
    Song, Wanru
    Gan, Zongliang
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (03)
  • [26] Leaning compact and representative features for cross-modality person re-identification
    Gao, Guangwei
    Shao, Hao
    Wu, Fei
    Yang, Meng
    Yu, Yi
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (04): : 1649 - 1666
  • [27] Cross-modality person re-identification via modality-synergy alignment learning
    Lin, Yuju
    Wang, Banghai
    MACHINE VISION AND APPLICATIONS, 2024, 35 (06)
  • [28] Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification
    Liang, Tengfei
    Jin, Yi
    Liu, Wu
    Li, Yidong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8432 - 8444
  • [29] Rethinking Shared Features and Re-ranking for Cross-Modality Person Re-identification
    Jiang, Na
    Wang, Zhaofa
    Xu, Peng
    Wu, Xinyue
    Zhang, Lei
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 305 - 317
  • [30] Cross-modality average precision optimization for visible thermal person re-identification
    Ling, Yongguo
    Luo, Zhiming
    Lin, Dazhen
    Li, Shaozi
    Jiang, Min
    Sebe, Nicu
    Zhong, Zhun
    PATTERN RECOGNITION, 2025, 164