Inception Convolution and Feature Fusion for Person Search

被引：4

作者：

Ouyang, Huan ^{[1
,2
]}

Zeng, Jiexian ^{[1
,3
]}

Leng, Lu ^{[1
,2
]}

机构：

[1] Nanchang Hangkong Univ, Sch Software, Nanchang 330063, Peoples R China

[2] Nanchang Hangkong Univ, Key Lab Jiangxi Prov Image Proc & Pattern Recognit, Nanchang 330063, Peoples R China

[3] Nanchang Hangkong Univ, Sci & Technol Coll, Gongqingcheng 332020, Peoples R China

来源：

SENSORS | 2023年 / 23卷 / 04期

基金：

中国国家自然科学基金;

关键词：

person search; Faster R-CNN; inception convolution; feature fusion; region proposal network (RPN); double-head; efficient learning; REIDENTIFICATION; NETWORK;

D O I：

10.3390/s23041984

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

With the rapid advancement of deep learning theory and hardware device computing capacity, computer vision tasks, such as object detection and instance segmentation, have entered a revolutionary phase in recent years. As a result, extremely challenging integrated tasks, such as person search, might develop quickly. The majority of efficient network frameworks, such as Seq-Net, are based on Faster R-CNN. However, because of the parallel structure of Faster R-CNN, the performance of re-ID can be significantly impacted by the single-layer, low resolution, and occasionally overlooked check feature diagrams retrieved during pedestrian detection. To address these issues, this paper proposed a person search methodology based on an inception convolution and feature fusion module (IC-FFM) using Seq-Net (Sequential End-to-end Network) as the benchmark. First, we replaced the general convolution in ResNet-50 with the new inception convolution module (ICM), allowing the convolution operation to effectively and dynamically distribute various channels. Then, to improve the accuracy of information extraction, the feature fusion module (FFM) was created to combine multi-level information using various levels of convolution. Finally, Bounding Box regression was created using convolution and the double-head module (DHM), which considerably enhanced the accuracy of pedestrian retrieval by combining global and fine-grained information. Experiments on CHUK-SYSU and PRW datasets showed that our method has higher accuracy than Seq-Net. In addition, our method is simpler and can be easily integrated into existing two-stage frameworks.

引用

页数：20

共 50 条

[21] Multi-feature fusion network for person reidentification
Wang, Xihe
Zhang, Yongjun
Xu, Yujie
Cui, Zhongwei
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (02)
[22] Hybrid Feature Fusion for Person Recognition in Photo Albums
Li, Sheng
Huang, Likun
Zhang, Wei
Tang, Bing
MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
[23] Cross-Modal Feature Fusion-Based Knowledge Transfer for Text-Based Person Search
You, Kaiyang
Chen, Wenjing
Wang, Chengji
Sun, Hao
Xie, Wei
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2230 - 2234
[24] Vehicle Reidentification Based on Convolution and Vision Transformer Feature Fusion
Gong, Rui
Zhang, Xue
Pan, Jianan
Guo, Jie
Nie, Xiushan
IEEE MULTIMEDIA, 2024, 31 (02) : 61 - 68
[25] An Efficient Group Convolution and Feature Fusion Method for Weed Detection
Chen, Chaowen
Zang, Ying
Jiao, Jinkang
Yan, Daoqing
Fan, Zhuorong
Cui, Zijian
Zhang, Minghua
AGRICULTURE-BASEL, 2025, 15 (01):
[26] CATrack: Convolution and Attention Feature Fusion for Visual Object Tracking
Zhang, Longkun
Wen, Jiajun
Dai, Zichen
Zhou, Rouyi
Lai, Zhihui
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 469 - 480
[27] Person Reidentification Based on Multiscale Convolutional Feature Fusion
Xu Longzhuang
Peng Li
LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (14)
[28] INCEPTION OF THE RADIOLOGY FEATURE
BUONOCORE, E
CLEVELAND CLINIC QUARTERLY, 1982, 49 (04): : 269 - 269
[29] KIDBA-Net: A Multi-Feature Fusion Brain Tumor Segmentation Network Utilizing Kernel Inception Depthwise Convolution and Bi-Cross Attention
Min, Jie
Huang, Tongyuan
Huang, Boxiong
Hu, Chuanxin
Zhang, Zhixing
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (02)
[30] Prototype-Guided Saliency Feature Learning for Person Search
Kim, Hanjae
Joung, Sunghun
Kim, Ig-Jae
Sohn, Kwanghoon
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4863 - 4872

← 1 2 3 4 5 →