Cross-scale global attention feature pyramid network for person search

被引:3
|
作者
Li, Yang [1 ,2 ]
Xu, Huahu [2 ,3 ]
Bian, Minjie [3 ]
Xiao, Junsheng [2 ]
机构
[1] Shanghai Jianqiao Univ, Sch Informat Technol, Shanghai 201306, Peoples R China
[2] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[3] Shanghai Univ, Informat Off, Shanghai 200444, Peoples R China
关键词
Person search; Global attention; Feature pyramid network; Multi-scale; Fine-grained;
D O I
10.1016/j.imavis.2021.104332
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person search aims to locate the target person in real unconstrained scene images. It faces many challenges such as multi-scale and fine-grained. To address the challenges, a novel cross-scale global attention feature pyramid network (CSGAFPN) is proposed. Firstly, we design a novel multi-head global attention module (MHGAM), which adopts cosine similarity and sparse query location methods to effectively capture cross-scale long-distance dependence. Then, we design the CSGAFPN, which extends top-down feature pyramid network with bottom-up connections and embeds MHGAMs to the connections. CSGAFPN can capture cross-scale long-distance global correlation from multi-scale feature maps, selectively strengthen important features and restrain less important features. CSGAFPN is applied for both person detection and person re-identification (reID) sub-tasks of person search, it can well handle the multi-scale and fine-grained challenges, and significantly improve person search performance. Furthermore, the output multi-scale feature maps of CSGAFPN are processed by an adaptive feature aggregation with attention (AFAA) layer to further improve the performance. Numerous exper-iments with two public person search datasets, CUHK-SYSU and PRW, show our CSGAFPN based approach ac-quires better performance than other state-of-the-art (SOTA) person search approaches. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Infrared Pedestrian Detection Based on Cross-scale Feature Aggregation and Hierarchical Attention Mapping
    Hao Shuai
    Gao Shan
    Ma Xu
    An Beiyi
    He Tian
    Wen Hu
    Wang Feng
    ACTA PHOTONICA SINICA, 2022, 51 (06)
  • [22] Dual Attention Based Feature Pyramid Network
    Xing, Huijun
    Wang, Shuai
    Zheng, Dezhi
    Zhao, Xiaotong
    CHINA COMMUNICATIONS, 2020, 17 (08) : 242 - 252
  • [23] Pyramid Feature Attention Network for Saliency detection
    Zhao, Ting
    Wu, Xiangqian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3080 - 3089
  • [24] Dual Attention Based Feature Pyramid Network
    Huijun Xing
    Shuai Wang
    Dezhi Zheng
    Xiaotong Zhao
    中国通信, 2020, 17 (08) : 242 - 252
  • [25] LCFFNet: A Lightweight Cross-scale Feature Fusion Network for human pose estimation
    Zou, Xuelian
    Bi, Xiaojun
    NEURAL NETWORKS, 2025, 183
  • [26] CFFDist: Cross-Scale Feature Fusion Distillation Network for Industrial Anomaly Localization
    Zhi, Hui
    Qin, Hao
    Zhang, Lanning
    Guo, Jie
    Song, Bin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [27] Multi-scale global context feature pyramid network for object detector
    Yunhao Li
    Mingwen Shao
    Bingbing Fan
    Wei Zhang
    Signal, Image and Video Processing, 2022, 16 : 705 - 713
  • [28] Multi-scale global context feature pyramid network for object detector
    Li, Yunhao
    Shao, Mingwen
    Fan, Bingbing
    Zhang, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (03) : 705 - 713
  • [29] Multi-scale feature fusion pyramid attention network for single image dehazing
    Liu, Jianlei
    Liu, Peng
    Zhang, Yuanke
    IET IMAGE PROCESSING, 2023, 17 (09) : 2726 - 2735
  • [30] Cross-Scale Correlation Stereo Network
    Yang, Chao
    Yao, Wenbin
    Li, Xiaoyong
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,