Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification

被引:18
|
作者
Li, Ming [1 ]
Liu, Jun [2 ]
Zheng, Ce [3 ]
Huang, Xinming [4 ]
Zhang, Ziming [4 ]
机构
[1] Natl Univ Singapore, Inst Data Sci, Singapore 119077, Singapore
[2] Singapore Univ Technol & Design, Informat Syst Technol & Design, Singapore 487372, Singapore
[3] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA
[4] Worcester Polytech Inst, Dept Elect & Comp Engn, Worcester, MA 01609 USA
关键词
Transformers; Correlation; Feature extraction; Visualization; Training; Benchmark testing; Task analysis; Correlation exploiting; multi-view learning; transformer; vehicle re-identification;
D O I
10.1109/TMM.2021.3134839
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image-based vehicle re-identification (ReID) has witnessed much progress in recent years. However, most of existing works struggled to extract robust but discriminative features from a single image to represent one vehicle instance. We argue that images taken from distinct viewpoints, e.g., front and back, have significantly different appearances and patterns for recognition. In order to identify each vehicle, these models have to capture consistent "ID codes " from totally different views, causing learning difficulties. Additionally, we claim that part-level correspondences among views, i.e., various vehicle parts observed from the identical image and the same part visible from different viewpoints, contribute to instance-level feature learning as well. Motivated by these, we propose to extract comprehensive vehicle instance representations from multiple views through modelling part-wise correlations. To this end, we present our efficient transformer-based framework to exploit both inner- and inter-view correlations for vehicle ReID. In specific, we first adopt a convnet encoder to condense a series of patch embeddings from each view. Then our efficient transformer, consisting of a distillation token and a noise token in addition to a regular classification token, is constructed for enforcing these patch embeddings to interact with each other regardless of whether they are taken from identical or different views. We conduct extensive experiments on widely used vehicle ReID benchmarks, and our approach achieves the state-of-the-art performance, showing the effectiveness of our method.
引用
收藏
页码:919 / 929
页数:11
相关论文
共 50 条
  • [21] Visual Cognition-Inspired Multi-View Vehicle Re-Identification via Laplacian-Regularized Correlative Sparse Ranking
    Zheng, Aihua
    Dong, Jiacheng
    Lin, Xianmin
    Liu, Lidan
    Jiang, Bo
    Luo, Bin
    COGNITIVE COMPUTATION, 2021, 13 (04) : 859 - 872
  • [22] Progressively Hybrid Transformer for Multi-Modal Vehicle Re-Identification
    Pan, Wenjie
    Huang, Linhan
    Liang, Jianbao
    Hong, Lan
    Zhu, Jianqing
    SENSORS, 2023, 23 (09)
  • [23] QuadNet: Quadruplet loss for multi-view learning in baggage re-identification
    Yang, Hao
    Chu, Xiuxiu
    Zhang, Li
    Sun, Yunda
    Li, Dong
    Maybank, Stephen J.
    PATTERN RECOGNITION, 2022, 126
  • [24] Scaling up SoccerNet with multi-view spatial localization and re-identification
    Cioppa, Anthony
    Deliege, Adrien
    Giancola, Silvio
    Ghanem, Bernard
    Van Droogenbroeck, Marc
    SCIENTIFIC DATA, 2022, 9 (01)
  • [25] Multi-attribute adaptive aggregation transformer for vehicle re-identification
    Yu, Zhi
    Pei, Jiaming
    Zhu, Mingpeng
    Zhang, Jiwei
    Li, Jinhai
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (02)
  • [26] Multi-View Label Prediction for Unsupervised Learning Person Re-Identification
    Yin, Qingze
    Wang, Guan'an
    Ding, Guodong
    Gong, Shaogang
    Tang, Zhenmin
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1390 - 1394
  • [27] Scaling up SoccerNet with multi-view spatial localization and re-identification
    Anthony Cioppa
    Adrien Deliège
    Silvio Giancola
    Bernard Ghanem
    Marc Van Droogenbroeck
    Scientific Data, 9
  • [28] Consistent Iterative Multi-view Transfer Learning for Person Re-identification
    Zhao, Cairong
    Wang, Xuekuan
    Chen, Yipeng
    Gao, Can
    Zuo, Wangmeng
    Miao, Duoqian
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 1087 - 1094
  • [29] Online learning of dynamic multi-view gallery for person Re-identification
    Yanna Zhao
    Xu Zhao
    Zongjie Xiang
    Yuncai Liu
    Multimedia Tools and Applications, 2017, 76 : 217 - 241
  • [30] Multi-view Based Pose Alignment Method for Person Re-identification
    Zhang, Yulei
    Zhao, Qingjie
    Li, You
    PROCEEDINGS OF 2019 CHINESE INTELLIGENT AUTOMATION CONFERENCE, 2020, 586 : 439 - 447