Inception Convolution and Feature Fusion for Person Search

被引:4
|
作者
Ouyang, Huan [1 ,2 ]
Zeng, Jiexian [1 ,3 ]
Leng, Lu [1 ,2 ]
机构
[1] Nanchang Hangkong Univ, Sch Software, Nanchang 330063, Peoples R China
[2] Nanchang Hangkong Univ, Key Lab Jiangxi Prov Image Proc & Pattern Recognit, Nanchang 330063, Peoples R China
[3] Nanchang Hangkong Univ, Sci & Technol Coll, Gongqingcheng 332020, Peoples R China
基金
中国国家自然科学基金;
关键词
person search; Faster R-CNN; inception convolution; feature fusion; region proposal network (RPN); double-head; efficient learning; REIDENTIFICATION; NETWORK;
D O I
10.3390/s23041984
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
With the rapid advancement of deep learning theory and hardware device computing capacity, computer vision tasks, such as object detection and instance segmentation, have entered a revolutionary phase in recent years. As a result, extremely challenging integrated tasks, such as person search, might develop quickly. The majority of efficient network frameworks, such as Seq-Net, are based on Faster R-CNN. However, because of the parallel structure of Faster R-CNN, the performance of re-ID can be significantly impacted by the single-layer, low resolution, and occasionally overlooked check feature diagrams retrieved during pedestrian detection. To address these issues, this paper proposed a person search methodology based on an inception convolution and feature fusion module (IC-FFM) using Seq-Net (Sequential End-to-end Network) as the benchmark. First, we replaced the general convolution in ResNet-50 with the new inception convolution module (ICM), allowing the convolution operation to effectively and dynamically distribute various channels. Then, to improve the accuracy of information extraction, the feature fusion module (FFM) was created to combine multi-level information using various levels of convolution. Finally, Bounding Box regression was created using convolution and the double-head module (DHM), which considerably enhanced the accuracy of pedestrian retrieval by combining global and fine-grained information. Experiments on CHUK-SYSU and PRW datasets showed that our method has higher accuracy than Seq-Net. In addition, our method is simpler and can be easily integrated into existing two-stage frameworks.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Multimodal Person Search Combining Information Fusion and Relevance Feedback
    Goldmann, Lutz
    Samour, Amjad
    Ebrahimi, Touradj
    Sikora, Thomas
    2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 506 - +
  • [42] Deep Adversarial Graph Attention Convolution Network for Text-Based Person Search
    Liu, Jiawei
    Zha, Zheng-Jun
    Hong, Richang
    Wang, Meng
    Zhang, Yongdong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 665 - 673
  • [43] DAFCNN: A Dual-Channel Feature Extraction and Attention Feature Fusion Convolution Neural Network for SAR Image and MS Image Fusion
    Luo, Jiahao
    Zhou, Fang
    Yang, Jun
    Xing, Mengdao
    REMOTE SENSING, 2023, 15 (12)
  • [44] Asymmetric Convolution Networks Based on Multi-feature Fusion for Object Detection
    Yang, Zhenkun
    Ma, Xianghua
    An, Jing
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 1355 - 1360
  • [45] Feature fusion temporal convolution: Wind power forecasting with light hyperparameter optimization
    Mansoor, Majad
    Tao, Gong
    Mirza, Adeel Feroz
    Irfan, Muhammad
    Chen, Wei
    ENERGY REPORTS, 2025, 13 : 2468 - 2481
  • [46] Multi-layer Feature Fusion Network with Atrous Convolution for Pedestrian Detection
    Li, You
    Zhang, Qingxuan
    Zhang, Yulei
    2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267
  • [47] Point Cloud Classification Method Based on Graph Convolution and Multilayer Feature Fusion
    Sheng, Tian
    Anyang, Long
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (14)
  • [48] Feature Fusion Based on Graph Convolution Network for Modulation Classification in Underwater Communication
    Yao, Xiaohui
    Yang, Honghui
    Sheng, Meiping
    ENTROPY, 2023, 25 (07)
  • [49] Recognition of Maize Seedling and Weed Based on Light Weight Convolution and Feature Fusion
    Meng Q.
    Zhang M.
    Yang X.
    Liu Y.
    Zhang Z.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2020, 51 (12): : 238 - 245and303
  • [50] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
    Donghua Liu
    Guodong Wang
    Guangtao Zhai
    Multimedia Tools and Applications, 2022, 81 : 37939 - 37952