Multi-layer Feature Fusion Network with Atrous Convolution for Pedestrian Detection

被引:0
|
作者
Li, You [1 ]
Zhang, Qingxuan [1 ]
Zhang, Yulei [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing, Peoples R China
关键词
D O I
10.1088/1742-6596/1267/1/012047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a simple but effective framework K-AFPN that incorporates feature pyramid method for small-size pedestrian detection, fully utilizing the lower-layer detail features and higher-layer semantic features. The method not only enhances the robustness of the features, but also improves the discrimination of the feature maps, achieving competitive accuracy. In addition, atrous convolution is used to optimize the network for high-resolution feature maps, avoiding information loss caused by frequent down or up sampling. On top of the backbone network, K-means algorithm is used to obtain optimal initial anchor base sizes, which reduces computational costs and improves location accuracy. Hence, our method pays more concentration on pedestrians, especially those of relatively small size. Comprehensive experimental results on two classic pedestrian benchmarks illustrate the effectiveness of the proposed approach.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] MLFF: A Object Detector based on a Multi-Layer Feature Fusion
    Peng, Panyu
    Liu, Yong
    Lv, Xingfeng
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [42] Pedestrian reidentification based on multiscale convolution feature fusion
    Kaiyang Liao
    Gang Huang
    Yuanlin Zheng
    Guangfeng Lin
    Congjun Cao
    Signal, Image and Video Processing, 2022, 16 : 1691 - 1699
  • [43] Pedestrian reidentification based on multiscale convolution feature fusion
    Liao, Kaiyang
    Huang, Gang
    Zheng, Yuanlin
    Lin, Guangfeng
    Cao, Congjun
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (06) : 1691 - 1699
  • [44] MMHFNet: Multi-modal and multi-layer hybrid fusion network for voice pathology detection
    Mohammed, Hussein M. A.
    Omeroglu, Asli Nur
    Oral, Emin Argun
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 223
  • [45] Multi-view Stereo Vision Reconstruction Network with Fusion Attention Mechanism and Multi-layer Dynamic Deformable Convolution
    Sun, Kai
    Zhang, Cheng
    Zhan, Tian
    Su, Di
    Binggong Xuebao/Acta Armamentarii, 2024, 45 (10): : 3631 - 3641
  • [46] Multi-window Transformer parallel fusion feature pyramid network for pedestrian orientation detection
    Xiao Li
    Shexiang Ma
    Liqing Shan
    Xiao Li
    Multimedia Systems, 2023, 29 : 587 - 603
  • [47] Multimodal Sentiment Analysis Based on Attentional Temporal Convolutional Network and Multi-Layer Feature Fusion
    Cheng, Hongju
    Yang, Zizhen
    Zhang, Xiaoqi
    Yang, Yang
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 3149 - 3163
  • [48] Metro Pedestrian Detection Algorithm Based on Multi-scale Weighted Feature Fusion Network
    Dong Xiaowei
    Han Yue
    Zhang Zheng
    Qu Hongbin
    Gao Guofei
    Chen Mingdian
    Li Bo
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (07) : 2113 - 2120
  • [49] Multi-window Transformer parallel fusion feature pyramid network for pedestrian orientation detection
    Li, Xiao
    Ma, Shexiang
    Shan, Liqing
    Li, Xiao
    MULTIMEDIA SYSTEMS, 2023, 29 (02) : 587 - 603
  • [50] Hyperspectral image classification based on a novel Lush multi-layer feature fusion bias network
    Shi, Cuiping
    Chen, Jiaxiang
    Wang, Liguo
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247