Inception Convolution and Feature Fusion for Person Search

被引：4

作者：

Ouyang, Huan ^{[1
,2
]}

Zeng, Jiexian ^{[1
,3
]}

Leng, Lu ^{[1
,2
]}

机构：

[1] Nanchang Hangkong Univ, Sch Software, Nanchang 330063, Peoples R China

[2] Nanchang Hangkong Univ, Key Lab Jiangxi Prov Image Proc & Pattern Recognit, Nanchang 330063, Peoples R China

[3] Nanchang Hangkong Univ, Sci & Technol Coll, Gongqingcheng 332020, Peoples R China

来源：

SENSORS | 2023年 / 23卷 / 04期

基金：

中国国家自然科学基金;

关键词：

person search; Faster R-CNN; inception convolution; feature fusion; region proposal network (RPN); double-head; efficient learning; REIDENTIFICATION; NETWORK;

D O I：

10.3390/s23041984

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

With the rapid advancement of deep learning theory and hardware device computing capacity, computer vision tasks, such as object detection and instance segmentation, have entered a revolutionary phase in recent years. As a result, extremely challenging integrated tasks, such as person search, might develop quickly. The majority of efficient network frameworks, such as Seq-Net, are based on Faster R-CNN. However, because of the parallel structure of Faster R-CNN, the performance of re-ID can be significantly impacted by the single-layer, low resolution, and occasionally overlooked check feature diagrams retrieved during pedestrian detection. To address these issues, this paper proposed a person search methodology based on an inception convolution and feature fusion module (IC-FFM) using Seq-Net (Sequential End-to-end Network) as the benchmark. First, we replaced the general convolution in ResNet-50 with the new inception convolution module (ICM), allowing the convolution operation to effectively and dynamically distribute various channels. Then, to improve the accuracy of information extraction, the feature fusion module (FFM) was created to combine multi-level information using various levels of convolution. Finally, Bounding Box regression was created using convolution and the double-head module (DHM), which considerably enhanced the accuracy of pedestrian retrieval by combining global and fine-grained information. Experiments on CHUK-SYSU and PRW datasets showed that our method has higher accuracy than Seq-Net. In addition, our method is simpler and can be easily integrated into existing two-stage frameworks.

引用

页数：20

共 50 条

[31] Feature Fusion and Ellipse Segmentation for Person Re-identification
Qi, Meibin
Zeng, Junxian
Jiang, Jianguo
Chen, Cuiqun
PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 50 - 61
[32] Feature-level data fusion for bimodal person recognition
Chibelushi, CC
Mason, JSD
Deravi, F
SIXTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND ITS APPLICATIONS, VOL 1, 1997, (443): : 399 - 403
[33] Branch Feature Fusion Convolution Network for Remote Sensing Scene Classification
Shi, Cuiping
Wang, Tao
Wang, Liguo
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 5194 - 5210
[34] Hierarchical Feature Fusion With Mixed Convolution Attention for Single Image Dehazing
Zhang, Xiaoqin
Wang, Jinxin
Wang, Tao
Jiang, Runhua
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 510 - 522
[35] Feature Fusion Convolution-Aided Transformer for Automatic Modulation Recognition
Hu, Mutian
Ma, Jitong
Yang, Zhengyan
Wang, Jie
Lu, Jingjing
Wu, Zhanjun
IEEE COMMUNICATIONS LETTERS, 2023, 27 (10) : 2643 - 2647
[36] Data compensation and feature fusion for sketch based person retrieval
Ye, Yu
Chen, Jun
Sun, Zhihong
Mukherjee, Mithun
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
[37] Palm Vein Recognition Using Convolution Neural Network Based on Feature Fusion with HOG Feature
Kuang, Hailan
Zhong, Zhenhua
Liu, Xinhua
Ma, Xiaolin
2020 5TH INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2020), 2020, : 295 - 299
[38] Strong Feature Fusion Networks for Person Re-Identification
Liu Y.
Zhou C.
Li Z.
Li H.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (02): : 232 - 240
[39] Feature fusion over hyperbolic graph convolution networks for video summarisation
Wu, Guangli
Wang, Shengtao
Xu, Shipeng
IET COMPUTER VISION, 2024, 18 (01) : 150 - 164
[40] Person Re-identification Algorithm Based on the Fusion of Deep Feature and LOMO Feature
Liang, Zi
Chi, Jianning
Hua, Ruzhao
Zhang, Yue
Wang, Wenhao
ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179

← 1 2 3 4 5 →