Robust Duality Learning for Unsupervised Visible-Infrared Person Re-Identification

被引:0
|
作者
Li, Yongxiang [1 ]
Sun, Yuan [1 ]
Qin, Yang [1 ]
Peng, Dezhong [1 ,2 ]
Peng, Xi [1 ]
Hu, Peng [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Sichuan Natl Innovat New Vis UHD Video Technol Co, Chengdu 610095, Peoples R China
基金
中国国家自然科学基金;
关键词
Noise measurement; Noise; Overfitting; Adaptation models; Training; Predictive models; Semantics; Interference; Robustness; Optimization; Unsupervised VI-ReID; pseudo-label noise; noise correspondence; cluster consistency;
D O I
10.1109/TIFS.2025.3536613
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Unsupervised visible-infrared person re-identification (UVI-ReID) aims at retrieving pedestrian images of the same individual across distinct modalities, presenting challenges due to the inherent heterogeneity gap and the absence of cost-prohibitive annotations. Although existing methods employ self-training with clustering-generated pseudo-labels to bridge this gap, they always implicitly assume that these pseudo-labels are predicted correctly. In practice, however, this presumption is impossible to satisfy due to the difficulty of training a perfect model let alone without any ground truths, resulting in pseudo-labeling errors. Based on the observation, this study introduces a new learning paradigm for UVI-ReID considering Pseudo-Label Noise (PLN), which encompasses three challenges: noise overfitting, error accumulation, and noisy cluster correspondence. To conquer these challenges, we propose a novel robust duality learning framework (RoDE) for UVI-ReID to mitigate the adverse impact of noisy pseudo-labels. Specifically, for noise overfitting, we propose a novel Robust Adaptive Learning mechanism (RAL) to dynamically prioritize clean samples while deprioritizing noisy ones, thus avoiding overemphasizing noise. To circumvent error accumulation of self-training, where the model tends to confirm its mistakes, RoDE alternately trains dual distinct models using pseudo-labels predicted by their counterparts, thereby maintaining diversity and avoiding collapse into noise. However, this will lead to cross-cluster misalignment between the two distinct models, not to mention the misalignment between different modalities, resulting in dual noisy cluster correspondence and thus difficult to optimize. To address this issue, a Cluster Consistency Matching mechanism (CCM) is presented to ensure reliable alignment across distinct modalities as well as across different models by leveraging cross-cluster similarities. Extensive experiments on three benchmark datasets demonstrate the effectiveness of the proposed RoDE.
引用
收藏
页码:1937 / 1948
页数:12
相关论文
共 50 条
  • [41] Cross-Modality Hierarchical Clustering and Refinement for Unsupervised Visible-Infrared Person Re-Identification
    Pang, Zhiqi
    Wang, Chunyu
    Zhao, Lingling
    Liu, Yang
    Sharma, Gaurav
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2706 - 2718
  • [42] Identity Feature Disentanglement for Visible-Infrared Person Re-Identification
    Chen, Xiumei
    Zheng, Xiangtao
    Lu, Xiaoqiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [43] MIMR: Modality-Invariance Modeling and Refinement for unsupervised visible-infrared person re-identification*
    Pang, Zhiqi
    Wang, Chunyu
    Pan, Honghu
    Zhao, Lingling
    Wang, Junjie
    Guo, Maozu
    KNOWLEDGE-BASED SYSTEMS, 2024, 285
  • [44] Diverse-Feature Collaborative Progressive Learning for Visible-Infrared Person Re-Identification
    Chan, Sixian
    Meng, Weihao
    Bai, Cong
    Hu, Jie
    Chen, Shenyong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (05) : 7754 - 7763
  • [45] Margin-Based Modal Adaptive Learning for Visible-Infrared Person Re-Identification
    Zhao, Qianqian
    Wu, Hanxiao
    Zhu, Jianqing
    SENSORS, 2023, 23 (03)
  • [46] Cross-Modality Semantic Consistency Learning for Visible-Infrared Person Re-Identification
    Liu, Min
    Zhang, Zhu
    Bian, Yuan
    Wang, Xueping
    Sun, Yeqing
    Zhang, Baida
    Wang, Yaonan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 568 - 580
  • [47] Inter-Intra Modality Knowledge Learning and Clustering Noise Alleviation for Unsupervised Visible-Infrared Person Re-Identification
    Li, Zhiyong
    Liu, Haojie
    Peng, Xiantao
    Jiang, Wei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (08) : 3934 - 3947
  • [48] Dual-attentive cascade clustering learning for visible-infrared person re-identification
    Wang, Xianju
    Chen, Cuiqun
    Zhu, Yong
    Chen, Shuguang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19729 - 19746
  • [49] Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-Identification
    Zhang, Yiyuan
    Zhao, Sanyuan
    Kang, Yuhao
    Shen, Jianbing
    COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 462 - 479
  • [50] Learning multi-granularity representation with transformer for visible-infrared person re-identification
    Feng, Yujian
    Chen, Feng
    Sun, Guozi
    Wu, Fei
    Ji, Yimu
    Liu, Tianliang
    Liu, Shangdong
    Jing, Xiao-Yuan
    Luo, Jiebo
    PATTERN RECOGNITION, 2025, 164