EFMF-pillars: 3D object detection based on enhanced features and multi-scale fusion

被引:0
|
作者
Zhang, Wenbiao [1 ]
Chen, Gang [2 ,3 ]
Wang, Hongyan [1 ]
Yang, Lina [2 ]
Sun, Tao [1 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Informat Sci & Engn, Hangzhou 310018, Peoples R China
[2] Jiaxing Univ, Coll Informat Sci & Engn, Jiaxing 314001, Peoples R China
[3] Jiaxing Soy Intelligent Co Ltd, Jiaxing, Peoples R China
来源
关键词
3D object detection; PointPillars; CSPDarknet; SENet;
D O I
10.1186/s13634-024-01186-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As unmanned vehicle technology advances rapidly, obstacle recognition and target detection are crucial links, which directly affect the driving safety and efficiency of unmanned vehicles. In response to the inaccurate localization of small targets such as pedestrians in current object detection tasks and the problem of losing local features in the PointPillars, this paper proposes a three-dimensional object detection method based on improved PointPillars. Firstly, addressing the issue of lost spatial and local information in the PointPillars, the feature encoding part of the PointPillars is improved, and a new pillar feature enhancement extraction module, CSM-Module, is proposed. Channel encoding and spatial encoding are introduced in the new pillar feature enhancement extraction module, fully considering the spatial information and local detailed geometric information of each pillar, thereby enhancing the feature representation capability of each pillar. Secondly, based on the fusion of CSPDarknet and SENet, a new backbone network CSE-Net is designed in this paper, enabling the extraction of rich contextual semantic information and multi-scale global features, thereby enhancing the feature extraction capability. Our method achieves higher detection accuracy when validated on the KITTI dataset. Compared to the original network, the improved algorithm's average detection accuracy is increased by 3.42%, it shows that the method is reasonable and valuable.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] 3D Object Detection Based on Attention and Multi-Scale Feature Fusion
    Liu, Minghui
    Ma, Jinming
    Zheng, Qiuping
    Liu, Yuchen
    Shi, Gang
    SENSORS, 2022, 22 (10)
  • [2] Deep multi-scale and multi-modal fusion for 3D object detection
    Guo, Rui
    Li, Deng
    Han, Yahong
    PATTERN RECOGNITION LETTERS, 2021, 151 : 236 - 242
  • [3] 3D-MSFC: A 3D multi-scale features compression method for object detection☆
    Li, Zhengxin
    Tian, Chongzhen
    Yuan, Hui
    Lu, Xin
    Malekmohamadi, Hossein
    DISPLAYS, 2024, 85
  • [4] Multi-Scale PointPillars 3D Object Detection Network
    Ya, Hang
    Luo, Guiming
    PROCEEDINGS OF THE 2019 IEEE 18TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2019), 2019, : 174 - 179
  • [5] Interactive Multi-Scale Fusion of 2D and 3D Features for Multi-Object Vehicle Tracking
    Wang, Guangming
    Peng, Chensheng
    Gu, Yingying
    Zhang, Jinpeng
    Wang, Hesheng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (10) : 10618 - 10627
  • [6] Multi-Scale Enhanced Depth Knowledge Distillation for Monocular 3D Object Detection with SEFormer
    Zhang, Han
    Li, Jun
    Tang, Rui
    Shi, Zhiping
    Bu, Aojie
    2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 38 - 43
  • [7] Enhanced frustrum multi-scale VoteNet for 3D object detection in cluttered indoor scene
    Zhang, Xuesong
    He, Yu
    Song, Cunli
    Zhuang, Yan
    APPLIED INTELLIGENCE, 2025, 55 (07)
  • [8] Pedestrian Detection Based on Multi-Scale Fusion Features
    Jiang, Hao
    Zhang, Chuang
    Wu, Ming
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 329 - 333
  • [9] Multi-Scale Keypoints Feature Fusion Network for 3D Object Detection from Point Clouds
    Zhang, Xu
    Bai, Linjuan
    Zhang, Zuyu
    Li, Yan
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2022, 12
  • [10] 3D pedestrian detection based on hybrid multi-scale cascade fusion network
    Chen, Yang
    Mu, Yan
    Ni, Rongrong
    Yang, Biao
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123