PDL3D: 3D Attention Module with Partial Dense Layer for Small-to-Medium Dataset on Object Detection

被引:0
|
作者
Wang, Kai-Yi [1 ]
Chen, Jen-Jee [1 ]
Kuo, Po-Tsun Paul [1 ,2 ]
Tseng, Yu-Ghee [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Coll Artificial Intelligence, Hsinchu, Taiwan
[2] Advantech Co, AI Reasearch Ctr, Taipei, Taiwan
关键词
Deep learning; Attention; Defect detection;
D O I
10.1109/APWCS61586.2024.10679319
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning typically requires a large amount of training data. In this paper, we propose a PDL3D module, which exploits both channel attention and spatial attention mechanisms to improve the performance of deep convolutional neural networks (CNNs). PDL3D is a generic module that can be inserted into any CNN architecture and can be trained end-to-end with the inserted CNN architecture. Following the concept of MobileNet, PDL3D incurs less computation complexity in spatial attention. We prove it to be helpful in handing small to medium datasets by dividing MS COCO into smaller datasets, which we call mini coco datasets, and validating PDL3D on them with extensive experiments. Finally, we test it on a real PCB (Printed Circuit Board) dataset from electronic industry. Our experiments show that training PDL3D with small-to-medium datasets achieves similar or better performance compared to training existing networks with large datasets. Several CNN backbones have been tested to validate our claims.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Stereo 3D Object Detection Using a Feature Attention Module
    Zhao, Kexin
    Jiang, Rui
    He, Jun
    ALGORITHMS, 2023, 16 (12)
  • [2] ARM3D: Attention-based relation module for indoor 3D object detection
    Lan, Yuqing
    Duan, Yao
    Liu, Chenyi
    Zhu, Chenyang
    Xiong, Yueshan
    Huang, Hui
    Xu, Kai
    COMPUTATIONAL VISUAL MEDIA, 2022, 8 (03) : 395 - 414
  • [3] ARM3D: Attention-based relation module for indoor 3D object detection
    Yuqing Lan
    Yao Duan
    Chenyi Liu
    Chenyang Zhu
    Yueshan Xiong
    Hui Huang
    Kai Xu
    ComputationalVisualMedia, 2022, 8 (03) : 395 - 414
  • [4] ARM3D: Attention-based relation module for indoor 3D object detection
    Yuqing Lan
    Yao Duan
    Chenyi Liu
    Chenyang Zhu
    Yueshan Xiong
    Hui Huang
    Kai Xu
    Computational Visual Media, 2022, 8 : 395 - 414
  • [5] Sparse Dense Fusion for 3D Object Detection
    Gao, Yulu
    Sima, Chonghao
    Shi, Shaoshuai
    Di, Shangzhe
    Liu, Si
    Li, Hongyang
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10939 - 10946
  • [6] Dense Point Diffusion for 3D Object Detection
    Liu, Xu
    Cao, Jiayan
    Bi, Qianqian
    Wang, Jian
    Shi, Boxin
    Wei, Yichen
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 762 - 770
  • [7] Dense Voxel Fusion for 3D Object Detection
    Mahmoud, Anas
    Hu, Jordan S. K.
    Waslander, Steven L.
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 663 - 672
  • [8] Dense projection fusion for 3D object detection
    Chen, Zhao
    Hu, Bin-Jie
    Luo, Chengxi
    Chen, Guohao
    Zhu, Haohui
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [9] Unsupervised Learning of 3D Object Reconstruction with Small Dataset
    Chen, Shan-Ling
    Shih, Kuang-Tsu
    Chen, Homer H.
    2021 4TH IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR 2021), 2021, : 54 - 59
  • [10] 3D Object Detection on large-scale dataset
    Zhao, Yan
    Zhu, Jihong
    Liang, Haoyu
    Chen, Lyujie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,