Inception Convolution and Feature Fusion for Person Search

被引:4
|
作者
Ouyang, Huan [1 ,2 ]
Zeng, Jiexian [1 ,3 ]
Leng, Lu [1 ,2 ]
机构
[1] Nanchang Hangkong Univ, Sch Software, Nanchang 330063, Peoples R China
[2] Nanchang Hangkong Univ, Key Lab Jiangxi Prov Image Proc & Pattern Recognit, Nanchang 330063, Peoples R China
[3] Nanchang Hangkong Univ, Sci & Technol Coll, Gongqingcheng 332020, Peoples R China
基金
中国国家自然科学基金;
关键词
person search; Faster R-CNN; inception convolution; feature fusion; region proposal network (RPN); double-head; efficient learning; REIDENTIFICATION; NETWORK;
D O I
10.3390/s23041984
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
With the rapid advancement of deep learning theory and hardware device computing capacity, computer vision tasks, such as object detection and instance segmentation, have entered a revolutionary phase in recent years. As a result, extremely challenging integrated tasks, such as person search, might develop quickly. The majority of efficient network frameworks, such as Seq-Net, are based on Faster R-CNN. However, because of the parallel structure of Faster R-CNN, the performance of re-ID can be significantly impacted by the single-layer, low resolution, and occasionally overlooked check feature diagrams retrieved during pedestrian detection. To address these issues, this paper proposed a person search methodology based on an inception convolution and feature fusion module (IC-FFM) using Seq-Net (Sequential End-to-end Network) as the benchmark. First, we replaced the general convolution in ResNet-50 with the new inception convolution module (ICM), allowing the convolution operation to effectively and dynamically distribute various channels. Then, to improve the accuracy of information extraction, the feature fusion module (FFM) was created to combine multi-level information using various levels of convolution. Finally, Bounding Box regression was created using convolution and the double-head module (DHM), which considerably enhanced the accuracy of pedestrian retrieval by combining global and fine-grained information. Experiments on CHUK-SYSU and PRW datasets showed that our method has higher accuracy than Seq-Net. In addition, our method is simpler and can be easily integrated into existing two-stage frameworks.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Inception Convolution with Efficient Dilation Search
    Liu, Jie
    Li, Chuming
    Liang, Feng
    Lin, Chen
    Sun, Ming
    Yan, Junjie
    Ouyang, Wanli
    Xu, Dong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11481 - 11490
  • [2] FACE FEATURE RECOVERY VIA TEMPORAL FUSION FOR PERSON SEARCH
    Fan, Cheng-Yu
    Liu, Chao-Peng
    Wang, Kuan-Chun
    Jhan, Jiun-Hao
    Wang, Yu-Chiang Frank
    Chen, Jun-Cheng
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1893 - 1897
  • [3] Scale Voting With Pyramidal Feature Fusion Network for Person Search
    Hong, Zheran
    Liu, Bin
    Lu, Yan
    Yin, Guojun
    Yu, Nenghai
    IEEE ACCESS, 2019, 7 : 139692 - 139702
  • [4] Bottom-Up Foreground-Aware Feature Fusion for Person Search
    Yang, Wenjie
    Li, Dangwei
    Chen, Xiaotang
    Huang, Kaiqi
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3404 - 3412
  • [5] DHFF: ROBUST MULTI-SCALE PERSON SEARCH BY DYNAMIC HIERARCHICAL FEATURE FUSION
    Lu, Yan
    Hong, Zheran
    Liu, Bin
    Li, Weihai
    Yu, Nenghai
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3935 - 3939
  • [6] A DISCRIMINATIVELY LEARNED FEATURE EMBEDDING BASED ON MULTI-LOSS FUSION FOR PERSON SEARCH
    Liu, Hong
    Shi, Wei
    Huang, Weipeng
    Guan, Qiao
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1668 - 1672
  • [7] Bottom-Up Foreground-Aware Feature Fusion for Practical Person Search
    Yang, Wenjie
    Huang, Houjing
    Chen, Xiaotang
    Huang, Kaiqi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 262 - 274
  • [8] A Cascaded Inception of Inception Network with Attention Modulated Feature Fusion for Human Pose Estimation
    Liu, Wentao
    Chen, Jie
    Li, Cheng
    Qian, Chen
    Chu, Xiao
    Hu, Xiaolin
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7170 - 7177
  • [9] Boosting person ReID feature extraction via dynamic convolution
    Akbaba, Elif Ecem
    Gurkan, Filiz
    Gunsel, Bilge
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (03)
  • [10] Enhanced Deep Feature Representation for Person Search
    Yang, Jinfu
    Wang, Meijie
    Li, Mingai
    Zhang, Jingling
    COMPUTER VISION, PT III, 2017, 773 : 315 - 327