Inception Convolution and Feature Fusion for Person Search

被引:4
|
作者
Ouyang, Huan [1 ,2 ]
Zeng, Jiexian [1 ,3 ]
Leng, Lu [1 ,2 ]
机构
[1] Nanchang Hangkong Univ, Sch Software, Nanchang 330063, Peoples R China
[2] Nanchang Hangkong Univ, Key Lab Jiangxi Prov Image Proc & Pattern Recognit, Nanchang 330063, Peoples R China
[3] Nanchang Hangkong Univ, Sci & Technol Coll, Gongqingcheng 332020, Peoples R China
基金
中国国家自然科学基金;
关键词
person search; Faster R-CNN; inception convolution; feature fusion; region proposal network (RPN); double-head; efficient learning; REIDENTIFICATION; NETWORK;
D O I
10.3390/s23041984
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
With the rapid advancement of deep learning theory and hardware device computing capacity, computer vision tasks, such as object detection and instance segmentation, have entered a revolutionary phase in recent years. As a result, extremely challenging integrated tasks, such as person search, might develop quickly. The majority of efficient network frameworks, such as Seq-Net, are based on Faster R-CNN. However, because of the parallel structure of Faster R-CNN, the performance of re-ID can be significantly impacted by the single-layer, low resolution, and occasionally overlooked check feature diagrams retrieved during pedestrian detection. To address these issues, this paper proposed a person search methodology based on an inception convolution and feature fusion module (IC-FFM) using Seq-Net (Sequential End-to-end Network) as the benchmark. First, we replaced the general convolution in ResNet-50 with the new inception convolution module (ICM), allowing the convolution operation to effectively and dynamically distribute various channels. Then, to improve the accuracy of information extraction, the feature fusion module (FFM) was created to combine multi-level information using various levels of convolution. Finally, Bounding Box regression was created using convolution and the double-head module (DHM), which considerably enhanced the accuracy of pedestrian retrieval by combining global and fine-grained information. Experiments on CHUK-SYSU and PRW datasets showed that our method has higher accuracy than Seq-Net. In addition, our method is simpler and can be easily integrated into existing two-stage frameworks.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Feature Fusion and Ellipse Segmentation for Person Re-identification
    Qi, Meibin
    Zeng, Junxian
    Jiang, Jianguo
    Chen, Cuiqun
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 50 - 61
  • [32] Feature-level data fusion for bimodal person recognition
    Chibelushi, CC
    Mason, JSD
    Deravi, F
    SIXTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND ITS APPLICATIONS, VOL 1, 1997, (443): : 399 - 403
  • [33] Branch Feature Fusion Convolution Network for Remote Sensing Scene Classification
    Shi, Cuiping
    Wang, Tao
    Wang, Liguo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 5194 - 5210
  • [34] Hierarchical Feature Fusion With Mixed Convolution Attention for Single Image Dehazing
    Zhang, Xiaoqin
    Wang, Jinxin
    Wang, Tao
    Jiang, Runhua
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 510 - 522
  • [35] Feature Fusion Convolution-Aided Transformer for Automatic Modulation Recognition
    Hu, Mutian
    Ma, Jitong
    Yang, Zhengyan
    Wang, Jie
    Lu, Jingjing
    Wu, Zhanjun
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (10) : 2643 - 2647
  • [36] Data compensation and feature fusion for sketch based person retrieval
    Ye, Yu
    Chen, Jun
    Sun, Zhihong
    Mukherjee, Mithun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [37] Palm Vein Recognition Using Convolution Neural Network Based on Feature Fusion with HOG Feature
    Kuang, Hailan
    Zhong, Zhenhua
    Liu, Xinhua
    Ma, Xiaolin
    2020 5TH INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2020), 2020, : 295 - 299
  • [38] Strong Feature Fusion Networks for Person Re-Identification
    Liu Y.
    Zhou C.
    Li Z.
    Li H.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (02): : 232 - 240
  • [39] Feature fusion over hyperbolic graph convolution networks for video summarisation
    Wu, Guangli
    Wang, Shengtao
    Xu, Shipeng
    IET COMPUTER VISION, 2024, 18 (01) : 150 - 164
  • [40] Person Re-identification Algorithm Based on the Fusion of Deep Feature and LOMO Feature
    Liang, Zi
    Chi, Jianning
    Hua, Ruzhao
    Zhang, Yue
    Wang, Wenhao
    ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179