PDL3D: 3D Attention Module with Partial Dense Layer for Small-to-Medium Dataset on Object Detection

被引:0
|
作者
Wang, Kai-Yi [1 ]
Chen, Jen-Jee [1 ]
Kuo, Po-Tsun Paul [1 ,2 ]
Tseng, Yu-Ghee [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Coll Artificial Intelligence, Hsinchu, Taiwan
[2] Advantech Co, AI Reasearch Ctr, Taipei, Taiwan
关键词
Deep learning; Attention; Defect detection;
D O I
10.1109/APWCS61586.2024.10679319
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning typically requires a large amount of training data. In this paper, we propose a PDL3D module, which exploits both channel attention and spatial attention mechanisms to improve the performance of deep convolutional neural networks (CNNs). PDL3D is a generic module that can be inserted into any CNN architecture and can be trained end-to-end with the inserted CNN architecture. Following the concept of MobileNet, PDL3D incurs less computation complexity in spatial attention. We prove it to be helpful in handing small to medium datasets by dividing MS COCO into smaller datasets, which we call mini coco datasets, and validating PDL3D on them with extensive experiments. Finally, we test it on a real PCB (Printed Circuit Board) dataset from electronic industry. Our experiments show that training PDL3D with small-to-medium datasets achieves similar or better performance compared to training existing networks with large datasets. Several CNN backbones have been tested to validate our claims.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] ARPNET: attention region proposal network for 3D object detection
    Ye, Yangyang
    Zhang, Chi
    Hao, Xiaoli
    SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (12)
  • [32] Falling Things: A Synthetic Dataset for 3D Object Detection and Pose Estimation
    Tremblay, Jonathan
    To, Thang
    Birchfield, Stan
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2119 - 2122
  • [33] CMD: A Cross Mechanism Domain Adaptation Dataset for 3D Object Detection
    Deng, Jinhao
    Ye, Wei
    Wu, Hai
    Huang, Xun
    Xia, Qiming
    Li, Xin
    Fang, Jin
    Li, Wei
    Wen, Chenglu
    Wang, Cheng
    COMPUTER VISION-ECCV 2024, PT LVII, 2025, 15115 : 219 - 236
  • [34] Automotive Radar Dataset for Deep Learning Based 3D Object Detection
    Meyer, Michael
    Kuschk, Georg
    2019 16TH EUROPEAN RADAR CONFERENCE (EURAD), 2019, : 129 - 132
  • [35] 3D Object Detection with Pointformer
    Pan, Xuran
    Xia, Zhuofan
    Song, Shiji
    Li, Li Erran
    Huang, Gao
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7459 - 7468
  • [36] A survey of 3D object detection
    Wei Liang
    Pengfei Xu
    Ling Guo
    Heng Bai
    Yang Zhou
    Feng Chen
    Multimedia Tools and Applications, 2021, 80 : 29617 - 29641
  • [37] A survey of 3D object detection
    Liang, Wei
    Xu, Pengfei
    Guo, Ling
    Bai, Heng
    Zhou, Yang
    Chen, Feng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (19) : 29617 - 29641
  • [38] CDAF3D: Cross-Dimensional Attention Fusion for Indoor 3D Object Detection
    Wang, Shilin
    Huang, Hai
    Zhu, Yueyan
    Tang, Zhenqi
    PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 165 - 177
  • [39] Rope3D: The Roadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task
    Ye, Xiaoqing
    Shu, Mao
    Li, Hanyu
    Shi, Yifeng
    Li, Yingying
    Wang, Guangjie
    Tan, Xiao
    Ding, Errui
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21309 - 21318
  • [40] 3D object detection algorithm fusing dense connectivity and Gaussian distance
    Cheng, Xin
    Liu, Sheng-Xian
    Zhou, Jing-Mei
    Zhou, Zhou
    Zhao, Xiang-Mo
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (12): : 3589 - 3600