PDL3D: 3D Attention Module with Partial Dense Layer for Small-to-Medium Dataset on Object Detection

被引：0

作者：

Wang, Kai-Yi ^{[1
]}

Chen, Jen-Jee ^{[1
]}

Kuo, Po-Tsun Paul ^{[1
,2
]}

Tseng, Yu-Ghee ^{[1
]}

机构：

[1] Natl Yang Ming Chiao Tung Univ, Coll Artificial Intelligence, Hsinchu, Taiwan

[2] Advantech Co, AI Reasearch Ctr, Taipei, Taiwan

来源：

2024 IEEE VTS ASIA PACIFIC WIRELESS COMMUNICATIONS SYMPOSIUM, APWCS 2024 | 2024年

关键词：

Deep learning; Attention; Defect detection;

D O I：

10.1109/APWCS61586.2024.10679319

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep learning typically requires a large amount of training data. In this paper, we propose a PDL3D module, which exploits both channel attention and spatial attention mechanisms to improve the performance of deep convolutional neural networks (CNNs). PDL3D is a generic module that can be inserted into any CNN architecture and can be trained end-to-end with the inserted CNN architecture. Following the concept of MobileNet, PDL3D incurs less computation complexity in spatial attention. We prove it to be helpful in handing small to medium datasets by dividing MS COCO into smaller datasets, which we call mini coco datasets, and validating PDL3D on them with extensive experiments. Finally, we test it on a real PCB (Printed Circuit Board) dataset from electronic industry. Our experiments show that training PDL3D with small-to-medium datasets achieves similar or better performance compared to training existing networks with large datasets. Several CNN backbones have been tested to validate our claims.

引用

页数：6

共 50 条

[41] 3D Deformable Spatial Pyramid for Dense 3D Motion Flow of Deformable Object
Hur, Junhwa
Lim, Hwasup
Ahn, Sang Chul
ADVANCES IN VISUAL COMPUTING (ISVC 2014), PT 1, 2014, 8887 : 118 - 127
[42] 3DRM: Pair-wise relation module for 3D object detection
Lan, Yuqing
Duan, Yao
Shi, Yifei
Huang, Hui
Xu, Kai
COMPUTERS & GRAPHICS-UK, 2021, 98 (98): : 58 - 70
[43] AEPF: Attention-Enabled Point Fusion for 3D Object Detection
Sharma, Sachin
Meyer, Richard T.
Asher, Zachary D.
SENSORS, 2024, 24 (17)
[44] EPNet with Self-Attention for Fast and Accurate 3D Object Detection
Sakai, Yuto
Nishikawa, Hiroki
Kong, Xiangbo
Tomiyama, Hiroyuki
2024 INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS, AND COMMUNICATIONS, ITC-CSCC 2024, 2024,
[45] 3D Object Detection with LiDAR Based on Multi-Attention Mechanism
Cao, Jie
Peng, Yiqiang
Fan, Likang
Mo, Lingfan
Wang, Longfei
LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
[46] DyFusion: Cross-Attention 3D Object Detection with Dynamic Fusion
Bi, Jiangfeng
Wei, Haiyue
Zhang, Guoxin
Yang, Kuihe
Song, Ziying
IEEE LATIN AMERICA TRANSACTIONS, 2024, 22 (02) : 106 - 112
[47] 3D Object Detection Method Combining on Graph Sampling and Graph Attention
Li, Wenju
Chu, Wanghui
Cui, Liu
Su, Pan
Zhang, Gan
Computer Engineering and Applications, 2023, 59 (09) : 237 - 244
[48] SCANET: SPATIAL-CHANNEL ATTENTION NETWORK FOR 3D OBJECT DETECTION
Lu, Haihua
Chen, Xuesong
Zhang, Guiying
Zhou, Qiuhao
Ma, Yanbo
Zhao, Yong
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1992 - 1996
[49] AA3DNet: Attention Augmented Real Time 3D Object Detection
Sagar, Abhinav
2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 628 - 635
[50] SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction from Video Data
Hu, Yuan-Ting
Wang, Jiahong
Yeh, Raymond A.
Schwing, Alexander G.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3359 - 3369

← 1 2 3 4 5 →