Research of object detection method based on DCGAN data-set enhancement technique

被引:0
|
作者
Shi Dunhuang [1 ]
Yu Yanan [1 ,2 ]
Li Huiping [1 ]
机构
[1] Tianjin Univ Technol & Educ, Sch Informat Technol Engn, Tianjin 300222, Peoples R China
[2] Tianjin Univ, Key Lab Micro Optoelectro Mech Syst Technol, Minist Educ, Tianjin 300072, Peoples R China
关键词
Deep learning; DCGAN; object detection; data enhancement;
D O I
10.1117/12.2606531
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
With the rise of the new generation of artificial intelligence technology, the object detection method based on deep learning has achieved remarkable results. In this paper, the detection accuracy of three popular object detection algorithms such as You Only Look Once (YOLO V3), Region-CNN (Faster R-CNN) and Single Shot MultiBox Detector (SSD) has been compared. Aiming at the actual detection problems of building block parts with irregular shape and different sizes, a method that combines deep convolutional generative adversarial networks (DCGAN) with deep learning based object detection algorithm is proposed to solve the problems of over fitting or weak generalization ability in the case of limited datasets, and to improve the detection accuracy of object detection algorithm. Experimental results show that: 1. Using public datasets, when the training data is reduced, the mean average precision (mAP) values of the above three algorithms are reduced respectively. Among those, SSD algorithm has the smallest change, which decreases 7.81%. 2. The control variable method is used to train the building block parts. In the case of insufficient training data, the detection accuracy of three object detection algorithms is low. 3. After combining SSD algorithm with DCGAN algorithm and applying it into the detection task of building block parts, the mAP value is improved from 79.63% to 83.32%, and the detection accuracy is obviously improved.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] A Microscopic Traffic Flow Data Generation Method Based on an Improved DCGAN
    Wang, Pengyu
    Chen, Qiyao
    Li, Jianhua
    Ma, Lang
    Feng, Maoquan
    Han, Yuanliang
    Zhang, Zhiyang
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [32] A Moving-object Segmentation Method Based on Instance Enhancement on LiDAR Data
    Guo, Ruibin
    Wang, Neng
    Chen-Xie, Yuanli
    Yu, Qinghua
    Zhou, Zongtan
    Lu, Huiming
    Jiqiren/Robot, 2024, 46 (05): : 534 - 543
  • [33] Foreign Object Detection using Hybrid Assessment and Enhancement Technique
    Jayadharini, J.
    Ajitha, S.
    Divya, T.
    AnnisFathima, A.
    Vaidehi, V.
    2013 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2013, : 537 - 542
  • [34] An Adaptive Enhancement Method of Malicious Traffic Samples Based on DCGAN-ResNet System
    Li, Qiankun
    Li, Juan
    Li, Yao
    Jiu, Feng
    Chu, Yunxia
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2023, 17 (01)
  • [35] Research on Vehicle Object Detection Method Based on Convolutional Neural Network
    Zhang, Qinghui
    Wan, Chenxia
    Bian, Shanfeng
    2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2018, : 271 - 274
  • [36] Object detection algorithm based on feature enhancement
    Zheng, Qiumei
    Wang, Lulu
    Wang, Fenghua
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2021, 32 (08)
  • [37] A visual model for object detection based on active contours and level-set method
    Shunji Satoh
    Biological Cybernetics, 2006, 95 : 259 - 270
  • [38] A visual model for object detection based on active contours and level-set method
    Satoh, Shunji
    BIOLOGICAL CYBERNETICS, 2006, 95 (03) : 259 - 270
  • [39] Research of intrusion detection method based on rough set and adaptive boost
    Song Jian
    Zou Muchun
    Sun Wei
    Zou Muchun
    ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, PROCEEDINGS, 2007, : 142 - 145
  • [40] A flame detection technique based on fast mean shift and level set method
    Qiu Wenhua
    Wu Jianhua
    Gao Wu
    INTERNATIONAL SYMPOSIUM ON PHOTOELECTRONIC DETECTION AND IMAGING 2007: RELATED TECHNOLOGIES AND APPLICATIONS, 2008, 6625