SPCS: a spatial pyramid convolutional shuffle module for YOLO to detect occluded object

被引:0
|
作者
Xiang Li
Miao He
Yan Liu
Haibo Luo
Moran Ju
机构
[1] Chinese Academy of Sciences,Key Laboratory of Opto
[2] Chinese Academy of Sciences,Electronic Information Processing
[3] Chinese Academy of Sciences,Shenyang Institute of Automation
[4] University of Chinese Academy of Sciences,Institutes for Robotics and Intelligent Manufacturing
[5] Space Star Technology Co,College of Information Science and Technology
[6] LTD,undefined
[7] Dalian Maritime University,undefined
来源
关键词
Pedestrian detection; Occluded object detection; Feature extraction; Convolutional neural network;
D O I
暂无
中图分类号
学科分类号
摘要
In crowded scenes, one of the most important issues is that heavily overlapped objects are hardly distinguished from each other since most of their pixels are shared and the visible pixels of the occluded objects, which are used to represent their features, are limited. In this paper, a spatial pyramid convolutional shuffle (SPCS) module is proposed to extract refined information from the limited visible pixels of the occluded objects and generate distinguishable representations for the heavily overlapped objects. We adopt four convolutional kernels with different sizes and dilation rates at each location in the pyramid features and adjacently recombine their fused outputs spatially using a pixel shuffle module. In this way, four distinguishable instance predictions corresponding different convolutional kernels can be produced for each location in the pyramid feature. In addition, multiple convolutional operations with different kernel sizes and dilation rates at the same location can generate refined information for the corresponding regions, which is helpful to extract features for the occluded objects from their limited visible pixels. Extensive experimental results demonstrate that SPCS module can effectively boost the performance in crowded human detection. YOLO detector with SPCS module achieves 94.11% AP, 41.75% MR, 97.75% Recall on CrowdHuman, 93.04% AP, and 98.45% Recall on WiderPerson, which are the best compared with previous state-of-the-art models.
引用
收藏
页码:301 / 315
页数:14
相关论文
共 9 条
  • [1] SPCS: a spatial pyramid convolutional shuffle module for YOLO to detect occluded object
    Li, Xiang
    He, Miao
    Liu, Yan
    Luo, Haibo
    Ju, Moran
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (01) : 301 - 315
  • [2] YOLO Object Detection Algorithm with Hybrid Atrous Convolutional Pyramid
    Wang, Hui
    Wang, Zhiqiang
    Yu, Lijun
    He, Xinting
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 940 - 945
  • [3] Group and Shuffle Convolutional Neural Networks with Pyramid Pooling Module for Automated Pterygium Segmentation
    Abdani, Siti Raihanah
    Zulkifley, Mohd Asyraf
    Zulkifley, Nuraisyah Hani
    DIAGNOSTICS, 2021, 11 (06)
  • [4] DC-SPP-YOLO: Dense connection and spatial pyramid pooling based YOLO for object detection
    Huang, Zhanchao
    Wang, Jianlin
    Fu, Xuesong
    Yu, Tao
    Guo, Yongqi
    Wang, Rutong
    INFORMATION SCIENCES, 2020, 522 (522) : 241 - 258
  • [5] Residual-Shuffle Network with Spatial Pyramid Pooling Module for COVID-19 Screening
    Zulkifley, Mohd Asyraf
    Abdani, Siti Raihanah
    Zulkifley, Nuraisyah Hani
    Shahrimin, Mohamad Ibrani
    DIAGNOSTICS, 2021, 11 (08)
  • [6] WEAKLY SUPERVISED OBJECT LOCALIZATION WITH DEEP CONVOLUTIONAL NEURAL NETWORK BASED ON SPATIAL PYRAMID SALIENCY MAP
    Wan, Zhiqiang
    He, Haibo
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4177 - 4181
  • [7] YOLO-SSP: an object detection model based on pyramid spatial attention and improved downsampling strategy for remote sensing images
    Liu, Yongli
    Yang, Degang
    Song, Tingting
    Ye, Yichen
    Zhang, Xin
    VISUAL COMPUTER, 2025, 41 (03): : 1467 - 1484
  • [8] Improve YOLOv3 using dilated spatial pyramid module for multi-scale object detection
    Zhang, Xiaoguo
    Gao, Ye
    Wang, Huiqing
    Wang, Qing
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (04)
  • [9] A deep convolutional neural network for the automatic segmentation of glioblastoma brain tumor: Joint spatial pyramid module and attention mechanism network
    Liu, Hengxin
    Huang, Jingteng
    Li, Qiang
    Guan, Xin
    Tseng, Minglang
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 148