PDL3D: 3D Attention Module with Partial Dense Layer for Small-to-Medium Dataset on Object Detection

被引:0
|
作者
Wang, Kai-Yi [1 ]
Chen, Jen-Jee [1 ]
Kuo, Po-Tsun Paul [1 ,2 ]
Tseng, Yu-Ghee [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Coll Artificial Intelligence, Hsinchu, Taiwan
[2] Advantech Co, AI Reasearch Ctr, Taipei, Taiwan
关键词
Deep learning; Attention; Defect detection;
D O I
10.1109/APWCS61586.2024.10679319
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning typically requires a large amount of training data. In this paper, we propose a PDL3D module, which exploits both channel attention and spatial attention mechanisms to improve the performance of deep convolutional neural networks (CNNs). PDL3D is a generic module that can be inserted into any CNN architecture and can be trained end-to-end with the inserted CNN architecture. Following the concept of MobileNet, PDL3D incurs less computation complexity in spatial attention. We prove it to be helpful in handing small to medium datasets by dividing MS COCO into smaller datasets, which we call mini coco datasets, and validating PDL3D on them with extensive experiments. Finally, we test it on a real PCB (Printed Circuit Board) dataset from electronic industry. Our experiments show that training PDL3D with small-to-medium datasets achieves similar or better performance compared to training existing networks with large datasets. Several CNN backbones have been tested to validate our claims.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] 3D Deformable Spatial Pyramid for Dense 3D Motion Flow of Deformable Object
    Hur, Junhwa
    Lim, Hwasup
    Ahn, Sang Chul
    ADVANCES IN VISUAL COMPUTING (ISVC 2014), PT 1, 2014, 8887 : 118 - 127
  • [42] 3DRM: Pair-wise relation module for 3D object detection
    Lan, Yuqing
    Duan, Yao
    Shi, Yifei
    Huang, Hui
    Xu, Kai
    COMPUTERS & GRAPHICS-UK, 2021, 98 (98): : 58 - 70
  • [43] AEPF: Attention-Enabled Point Fusion for 3D Object Detection
    Sharma, Sachin
    Meyer, Richard T.
    Asher, Zachary D.
    SENSORS, 2024, 24 (17)
  • [44] EPNet with Self-Attention for Fast and Accurate 3D Object Detection
    Sakai, Yuto
    Nishikawa, Hiroki
    Kong, Xiangbo
    Tomiyama, Hiroyuki
    2024 INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS, AND COMMUNICATIONS, ITC-CSCC 2024, 2024,
  • [45] 3D Object Detection with LiDAR Based on Multi-Attention Mechanism
    Cao, Jie
    Peng, Yiqiang
    Fan, Likang
    Mo, Lingfan
    Wang, Longfei
    LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
  • [46] DyFusion: Cross-Attention 3D Object Detection with Dynamic Fusion
    Bi, Jiangfeng
    Wei, Haiyue
    Zhang, Guoxin
    Yang, Kuihe
    Song, Ziying
    IEEE LATIN AMERICA TRANSACTIONS, 2024, 22 (02) : 106 - 112
  • [47] 3D Object Detection Method Combining on Graph Sampling and Graph Attention
    Li, Wenju
    Chu, Wanghui
    Cui, Liu
    Su, Pan
    Zhang, Gan
    Computer Engineering and Applications, 2023, 59 (09) : 237 - 244
  • [48] SCANET: SPATIAL-CHANNEL ATTENTION NETWORK FOR 3D OBJECT DETECTION
    Lu, Haihua
    Chen, Xuesong
    Zhang, Guiying
    Zhou, Qiuhao
    Ma, Yanbo
    Zhao, Yong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1992 - 1996
  • [49] AA3DNet: Attention Augmented Real Time 3D Object Detection
    Sagar, Abhinav
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 628 - 635
  • [50] SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction from Video Data
    Hu, Yuan-Ting
    Wang, Jiahong
    Yeh, Raymond A.
    Schwing, Alexander G.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3359 - 3369