Relation-aware aggregation network with auxiliary guidance for text-based person search

被引:0
|
作者
Zeng, Pengpeng [1 ]
Jing, Shuaiqi [1 ]
Song, Jingkuan [1 ,2 ]
Fan, Kaixuan [1 ]
Li, Xiangpeng [1 ]
We, Liansuo [2 ]
Guo, Yuan [2 ]
机构
[1] University of Electronic Science and Technology of China, Chengdu, China
[2] Qiqihar University, Qiqihar, China
关键词
Machine learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel Relation-aware Aggregation Network with Auxiliary Guidance for text-based person search, namely RANAG. Existing works are still hard to capture the detailed appearance of a person and compute the similarity between images and texts. RANAN is designed to address the above problem from two aspects: relation-aware visual and additional auxiliary signal. Specifically, we introduce a Relation-aware Aggregation Network (RAN) that exploits the relation between the person and local objects. Then, we propose three auxiliary tasks to acquire additional knowledge of semantic representations. Each task has a respective objective: identifying the gender of the pedestrian in the image, distinguishing the images of the similar pedestrian, and aligning the semantic information between description and image. In addition, some data augmentation methods we explored can further improve the performance. Extensive experiments demonstrate that our model provides superior performance than the state-of-the-art methods on the CUHK-PEDES dataset. © 2021, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
引用
收藏
页码:1565 / 1582
相关论文
共 50 条
  • [21] Text-based Person Search without Parallel Image-Text Data
    Bai, Yang
    Wang, Jingyao
    Cao, Min
    Chen, Chen
    Cao, Ziqiang
    Nie, Liqiang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 757 - 767
  • [22] Text-Guided Visual Feature Refinement for Text-Based Person Search
    Gao, Liying
    Niu, Kai
    Ma, Zehong
    Jiao, Bingliang
    Tan, Tonghao
    Wang, Peng
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 118 - 126
  • [23] Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search
    Jing, Ya
    Si, Chenyang
    Wang, Junbo
    Wang, Wei
    Wang, Liang
    Tan, Tieniu
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11189 - 11196
  • [24] VGSG: Vision-Guided Semantic-Group Network for Text-Based Person Search
    He, Shuting
    Luo, Hao
    Jiang, Wei
    Jiang, Xudong
    Ding, Henghui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 163 - 176
  • [25] Relation-aware non-local attention network for person re-identification
    Li, Sujuan
    Xie, Gengsheng
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [26] Person Re-Identification Using Local Relation-Aware Graph Convolutional Network
    Lian, Yu
    Huang, Wenmin
    Liu, Shuang
    Guo, Peng
    Zhang, Zhong
    Durrani, Tariq S.
    SENSORS, 2023, 23 (19)
  • [27] Asymmetric Cross-Scale Alignment for Text-Based Person Search
    Ji, Zhong
    Hu, Junhua
    Liu, Deyin
    Wu, Lin Yuanbo
    Zhao, Ye
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7699 - 7709
  • [28] Feature semantic alignment and information supplement for Text-based person search
    Zhou, Hang
    Li, Fan
    Tian, Xuening
    Huang, Yuling
    FRONTIERS IN PHYSICS, 2023, 11
  • [29] A Self-Supervised Learning Approach for Text-Based Person Search
    Ji Z.
    Hu J.
    Ding X.
    Li S.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2023, 56 (02): : 169 - 176
  • [30] A Simple and Robust Correlation Filtering Method for Text-Based Person Search
    Suo, Wei
    Sun, Mengyang
    Niu, Kai
    Gao, Yiqi
    Wang, Peng
    Zhang, Yanning
    Wu, Qi
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 726 - 742