Inception Convolution and Feature Fusion for Person Search

被引:4
|
作者
Ouyang, Huan [1 ,2 ]
Zeng, Jiexian [1 ,3 ]
Leng, Lu [1 ,2 ]
机构
[1] Nanchang Hangkong Univ, Sch Software, Nanchang 330063, Peoples R China
[2] Nanchang Hangkong Univ, Key Lab Jiangxi Prov Image Proc & Pattern Recognit, Nanchang 330063, Peoples R China
[3] Nanchang Hangkong Univ, Sci & Technol Coll, Gongqingcheng 332020, Peoples R China
基金
中国国家自然科学基金;
关键词
person search; Faster R-CNN; inception convolution; feature fusion; region proposal network (RPN); double-head; efficient learning; REIDENTIFICATION; NETWORK;
D O I
10.3390/s23041984
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
With the rapid advancement of deep learning theory and hardware device computing capacity, computer vision tasks, such as object detection and instance segmentation, have entered a revolutionary phase in recent years. As a result, extremely challenging integrated tasks, such as person search, might develop quickly. The majority of efficient network frameworks, such as Seq-Net, are based on Faster R-CNN. However, because of the parallel structure of Faster R-CNN, the performance of re-ID can be significantly impacted by the single-layer, low resolution, and occasionally overlooked check feature diagrams retrieved during pedestrian detection. To address these issues, this paper proposed a person search methodology based on an inception convolution and feature fusion module (IC-FFM) using Seq-Net (Sequential End-to-end Network) as the benchmark. First, we replaced the general convolution in ResNet-50 with the new inception convolution module (ICM), allowing the convolution operation to effectively and dynamically distribute various channels. Then, to improve the accuracy of information extraction, the feature fusion module (FFM) was created to combine multi-level information using various levels of convolution. Finally, Bounding Box regression was created using convolution and the double-head module (DHM), which considerably enhanced the accuracy of pedestrian retrieval by combining global and fine-grained information. Experiments on CHUK-SYSU and PRW datasets showed that our method has higher accuracy than Seq-Net. In addition, our method is simpler and can be easily integrated into existing two-stage frameworks.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Multi-feature fusion network for person reidentification
    Wang, Xihe
    Zhang, Yongjun
    Xu, Yujie
    Cui, Zhongwei
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (02)
  • [22] Hybrid Feature Fusion for Person Recognition in Photo Albums
    Li, Sheng
    Huang, Likun
    Zhang, Wei
    Tang, Bing
    MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
  • [23] Cross-Modal Feature Fusion-Based Knowledge Transfer for Text-Based Person Search
    You, Kaiyang
    Chen, Wenjing
    Wang, Chengji
    Sun, Hao
    Xie, Wei
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2230 - 2234
  • [24] Vehicle Reidentification Based on Convolution and Vision Transformer Feature Fusion
    Gong, Rui
    Zhang, Xue
    Pan, Jianan
    Guo, Jie
    Nie, Xiushan
    IEEE MULTIMEDIA, 2024, 31 (02) : 61 - 68
  • [25] An Efficient Group Convolution and Feature Fusion Method for Weed Detection
    Chen, Chaowen
    Zang, Ying
    Jiao, Jinkang
    Yan, Daoqing
    Fan, Zhuorong
    Cui, Zijian
    Zhang, Minghua
    AGRICULTURE-BASEL, 2025, 15 (01):
  • [26] CATrack: Convolution and Attention Feature Fusion for Visual Object Tracking
    Zhang, Longkun
    Wen, Jiajun
    Dai, Zichen
    Zhou, Rouyi
    Lai, Zhihui
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 469 - 480
  • [27] Person Reidentification Based on Multiscale Convolutional Feature Fusion
    Xu Longzhuang
    Peng Li
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (14)
  • [28] INCEPTION OF THE RADIOLOGY FEATURE
    BUONOCORE, E
    CLEVELAND CLINIC QUARTERLY, 1982, 49 (04): : 269 - 269
  • [29] KIDBA-Net: A Multi-Feature Fusion Brain Tumor Segmentation Network Utilizing Kernel Inception Depthwise Convolution and Bi-Cross Attention
    Min, Jie
    Huang, Tongyuan
    Huang, Boxiong
    Hu, Chuanxin
    Zhang, Zhixing
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (02)
  • [30] Prototype-Guided Saliency Feature Learning for Person Search
    Kim, Hanjae
    Joung, Sunghun
    Kim, Ig-Jae
    Sohn, Kwanghoon
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4863 - 4872