MCBAN: A Small Object Detection Multi-Convolutional Block Attention Network

被引:0
|
作者
Bhanbhro, Hina [1 ]
Hooi, Yew Kwang [1 ]
Zakaria, Mohammad Nordin Bin [1 ]
Kusakunniran, Worapan [2 ]
Amur, Zaira Hassan [1 ]
机构
[1] Univ Teknol PETRONAS, Comp Informat Sci Dept, Seri Iskandar 31750, Malaysia
[2] Mahidol Univ, Fac Informat & Commun Technol, Nakhon Pathom 73170, Thailand
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 81卷 / 02期
关键词
Multi-convolutional; channel attention; spatial attention; YOLO;
D O I
10.32604/cmc.2024.052138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection has made a significant leap forward in recent years. However, the detection of small objects continues to be a great difficulty for various reasons, such as they have a very small size and they are susceptible to missed detection due to background noise. Additionally, small object information is affected due to the downsampling operations. Deep learning-based detection methods have been utilized to address the challenge posed by small objects. In this work, we propose a novel method, the Multi-Convolutional Block Attention Network (MCBAN), to increase the detection accuracy of minute objects aiming to overcome the challenge of information loss during the downsampling process. The multi-convolutional attention block (MCAB); channel attention and spatial attention module (SAM) that make up MCAB, have been crafted to accomplish small object detection with higher precision. We have carried out the experiments on the Karlsruhe Institute of Technology and Toyota Technological Institute (KITTI) and Pattern Analysis, Statical Modeling and Computational Learning (PASCAL) Visual Object Classes (VOC) datasets and have followed a step-wise process to analyze the results. These experiment results demonstrate that significant gains in performance are achieved, such as 97.75% for KITTI and 88.97% for PASCAL VOC. The findings of this study assert quite unequivocally the fact that MCBAN is much more efficient in the small object detection domain as compared to other existing approaches.
引用
收藏
页码:2243 / 2259
页数:17
相关论文
共 50 条
  • [31] An Adaptive Attention Fusion Mechanism Convolutional Network for Object Detection in Remote Sensing Images
    Ye, Yuanxin
    Ren, Xiaoyue
    Zhu, Bai
    Tang, Tengfeng
    Tan, Xin
    Gui, Yang
    Yao, Qin
    REMOTE SENSING, 2022, 14 (03)
  • [32] Convolutional block attention based network for copy-move image forgery detection
    Sabeena, M.
    Abraham, Lizy
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 2383 - 2405
  • [33] Convolutional block attention based network for copy-move image forgery detection
    M. Sabeena
    Lizy Abraham
    Multimedia Tools and Applications, 2024, 83 : 2383 - 2405
  • [34] Tea bud detection based on multi-scale convolutional block attention module
    Gui, Jiangsheng
    Wu, Dongwei
    Xu, Huirong
    Chen, Jianneng
    Tong, Junhua
    JOURNAL OF FOOD PROCESS ENGINEERING, 2024, 47 (02)
  • [35] Undefined Object Detection Based on Convolutional Neural Network Using Small Dataset
    Zeng, Yi-Chong
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 21 - 22
  • [36] Lightweight multi-scale network for small object detection
    Li L.
    Li B.
    Zhou H.
    PeerJ Computer Science, 2022, 8
  • [37] Lightweight multi-scale network for small object detection
    Li, Li
    Li, Bingxue
    Zhou, Hongjuan
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [38] Remote Sensing Small Object Detection Network Based on Attention Mechanism and Multi-Scale Feature Fusion
    Qu, Junsuo
    Tang, Zongbing
    Zhang, Le
    Zhang, Yanghai
    Zhang, Zhenguo
    REMOTE SENSING, 2023, 15 (11)
  • [39] MULTI-SCALE GRAPH CONVOLUTIONAL INTERACTION NETWORK FOR SALIENT OBJECT DETECTION
    Che, Wenqi
    Sun, Luoyi
    Xie, Zhifeng
    Ding, Youdong
    Han, Kaili
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 679 - 683
  • [40] Salient Object Detection with Chained Multi-Scale Fully Convolutional Network
    Tang, Youbao
    Wu, Xiangqian
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 618 - 626