PDL3D: 3D Attention Module with Partial Dense Layer for Small-to-Medium Dataset on Object Detection

被引:0
|
作者
Wang, Kai-Yi [1 ]
Chen, Jen-Jee [1 ]
Kuo, Po-Tsun Paul [1 ,2 ]
Tseng, Yu-Ghee [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Coll Artificial Intelligence, Hsinchu, Taiwan
[2] Advantech Co, AI Reasearch Ctr, Taipei, Taiwan
关键词
Deep learning; Attention; Defect detection;
D O I
10.1109/APWCS61586.2024.10679319
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning typically requires a large amount of training data. In this paper, we propose a PDL3D module, which exploits both channel attention and spatial attention mechanisms to improve the performance of deep convolutional neural networks (CNNs). PDL3D is a generic module that can be inserted into any CNN architecture and can be trained end-to-end with the inserted CNN architecture. Following the concept of MobileNet, PDL3D incurs less computation complexity in spatial attention. We prove it to be helpful in handing small to medium datasets by dividing MS COCO into smaller datasets, which we call mini coco datasets, and validating PDL3D on them with extensive experiments. Finally, we test it on a real PCB (Printed Circuit Board) dataset from electronic industry. Our experiments show that training PDL3D with small-to-medium datasets achieves similar or better performance compared to training existing networks with large datasets. Several CNN backbones have been tested to validate our claims.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Investigating Attention Mechanism in 3D Point Cloud Object Detection
    Qiu, Shi
    Wu, Yunfan
    Anwar, Saeed
    Li, Chongyi
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 403 - 412
  • [22] Attention-based Proposals Refinement for 3D Object Detection
    Minh-Quan Dao
    Hery, Elwan
    Fremont, Vincent
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 197 - 205
  • [23] ARPNET: attention region proposal network for 3D object detection
    Yangyang Ye
    Chi Zhang
    Xiaoli Hao
    Science China Information Sciences, 2019, 62
  • [24] 3D Object Detection with Attention: Shell-Based Modeling
    Zhang X.
    Zhao Z.
    Sun W.
    Cui Q.
    Computer Systems Science and Engineering, 2023, 46 (01): : 537 - 550
  • [25] Image attention transformer network for indoor 3D object detection
    Ren, Keyan
    Yan, Tong
    Hu, Zhaoxin
    Han, Honggui
    Zhang, Yunlu
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (07) : 2176 - 2190
  • [26] Image attention transformer network for indoor 3D object detection
    REN KeYan
    YAN Tong
    HU ZhaoXin
    HAN HongGui
    ZHANG YunLu
    Science China(Technological Sciences), 2024, (07) : 2176 - 2190
  • [27] Image attention transformer network for indoor 3D object detection
    REN KeYan
    YAN Tong
    HU ZhaoXin
    HAN HongGui
    ZHANG YunLu
    Science China(Technological Sciences), 2024, 67 (07) : 2176 - 2190
  • [28] FusionPainting: Multimodal Fusion with Adaptive Attention for 3D Object Detection
    Xu, Shaoqing
    Zhou, Dingfu
    Fang, Jin
    Yin, Junbo
    Bin, Zhou
    Zhang, Liangjun
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 3047 - 3054
  • [29] 3D Lane Detection With Attention in Attention
    Gu, Yinchao
    Ma, Chao
    Li, Qian
    Yang, Xiaokang
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 (1104-1108) : 1104 - 1108
  • [30] ARPNET: attention region proposal network for 3D object detection
    Yangyang YE
    Chi ZHANG
    Xiaoli HAO
    ScienceChina(InformationSciences), 2019, 62 (12) : 44 - 52