Inception Convolution and Feature Fusion for Person Search

被引：4

作者：

Ouyang, Huan ^{[1
,2
]}

Zeng, Jiexian ^{[1
,3
]}

Leng, Lu ^{[1
,2
]}

机构：

[1] Nanchang Hangkong Univ, Sch Software, Nanchang 330063, Peoples R China

[2] Nanchang Hangkong Univ, Key Lab Jiangxi Prov Image Proc & Pattern Recognit, Nanchang 330063, Peoples R China

[3] Nanchang Hangkong Univ, Sci & Technol Coll, Gongqingcheng 332020, Peoples R China

来源：

SENSORS | 2023年 / 23卷 / 04期

基金：

中国国家自然科学基金;

关键词：

person search; Faster R-CNN; inception convolution; feature fusion; region proposal network (RPN); double-head; efficient learning; REIDENTIFICATION; NETWORK;

D O I：

10.3390/s23041984

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

With the rapid advancement of deep learning theory and hardware device computing capacity, computer vision tasks, such as object detection and instance segmentation, have entered a revolutionary phase in recent years. As a result, extremely challenging integrated tasks, such as person search, might develop quickly. The majority of efficient network frameworks, such as Seq-Net, are based on Faster R-CNN. However, because of the parallel structure of Faster R-CNN, the performance of re-ID can be significantly impacted by the single-layer, low resolution, and occasionally overlooked check feature diagrams retrieved during pedestrian detection. To address these issues, this paper proposed a person search methodology based on an inception convolution and feature fusion module (IC-FFM) using Seq-Net (Sequential End-to-end Network) as the benchmark. First, we replaced the general convolution in ResNet-50 with the new inception convolution module (ICM), allowing the convolution operation to effectively and dynamically distribute various channels. Then, to improve the accuracy of information extraction, the feature fusion module (FFM) was created to combine multi-level information using various levels of convolution. Finally, Bounding Box regression was created using convolution and the double-head module (DHM), which considerably enhanced the accuracy of pedestrian retrieval by combining global and fine-grained information. Experiments on CHUK-SYSU and PRW datasets showed that our method has higher accuracy than Seq-Net. In addition, our method is simpler and can be easily integrated into existing two-stage frameworks.

引用

页数：20

共 50 条

[1] Inception Convolution with Efficient Dilation Search
Liu, Jie
Li, Chuming
Liang, Feng
Lin, Chen
Sun, Ming
Yan, Junjie
Ouyang, Wanli
Xu, Dong
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11481 - 11490
[2] FACE FEATURE RECOVERY VIA TEMPORAL FUSION FOR PERSON SEARCH
Fan, Cheng-Yu
Liu, Chao-Peng
Wang, Kuan-Chun
Jhan, Jiun-Hao
Wang, Yu-Chiang Frank
Chen, Jun-Cheng
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1893 - 1897
[3] Scale Voting With Pyramidal Feature Fusion Network for Person Search
Hong, Zheran
Liu, Bin
Lu, Yan
Yin, Guojun
Yu, Nenghai
IEEE ACCESS, 2019, 7 : 139692 - 139702
[4] Bottom-Up Foreground-Aware Feature Fusion for Person Search
Yang, Wenjie
Li, Dangwei
Chen, Xiaotang
Huang, Kaiqi
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3404 - 3412
[5] DHFF: ROBUST MULTI-SCALE PERSON SEARCH BY DYNAMIC HIERARCHICAL FEATURE FUSION
Lu, Yan
Hong, Zheran
Liu, Bin
Li, Weihai
Yu, Nenghai
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3935 - 3939
[6] A DISCRIMINATIVELY LEARNED FEATURE EMBEDDING BASED ON MULTI-LOSS FUSION FOR PERSON SEARCH
Liu, Hong
Shi, Wei
Huang, Weipeng
Guan, Qiao
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1668 - 1672
[7] Bottom-Up Foreground-Aware Feature Fusion for Practical Person Search
Yang, Wenjie
Huang, Houjing
Chen, Xiaotang
Huang, Kaiqi
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 262 - 274
[8] A Cascaded Inception of Inception Network with Attention Modulated Feature Fusion for Human Pose Estimation
Liu, Wentao
Chen, Jie
Li, Cheng
Qian, Chen
Chu, Xiao
Hu, Xiaolin
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7170 - 7177
[9] Boosting person ReID feature extraction via dynamic convolution
Akbaba, Elif Ecem
Gurkan, Filiz
Gunsel, Bilge
PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (03)
[10] Enhanced Deep Feature Representation for Person Search
Yang, Jinfu
Wang, Meijie
Li, Mingai
Zhang, Jingling
COMPUTER VISION, PT III, 2017, 773 : 315 - 327

← 1 2 3 4 5 →