Attention-based scale sequence network for small object detection

被引:2
|
作者
Lee, Young-Woon [1 ]
Kim, Byung-Gyu [2 ]
机构
[1] Sunmoon Univ, Dept Comp Engn, Asan, South Korea
[2] Sookmyung Womens Univ, Div Artificial Intelligence Engn, Seoul, South Korea
关键词
Small object detection; Feature pyramid network; Scale sequence; Attention mechanism; Deep learning;
D O I
10.1016/j.heliyon.2024.e32931
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recently, with the remarkable development of deep learning technology, achievements are being updated in various computer vision fields. In particular, the object recognition field is receiving the most attention. Nevertheless, recognition performance for small objects is still challenging. Its performance is of utmost importance in realistic applications such as searching for missing persons through aerial photography. The core structure of the object recognition neural network is the feature pyramid network (FPN). You Only Look Once (YOLO) is the most widely used representative model following this structure. In this study, we proposed an attention-based scale sequence network (ASSN) that improves the scale sequence feature pyramid network (ssFPN), enhancing the performance of the FPN-based detector for small objects. ASSN is a lightweight attention module optimized for FPN-based detectors and has the versatility to be applied to any model with a corresponding structure. The proposed ASSN demonstrated performance improvements compared to the baselines (YOLOv7 and YOLOv8) in average precision (AP) of up to 0.6%. Additionally, the AP for small objects (AP(s)) showed also improvements of up to 1.9%. Furthermore, ASSN exhibits higher performance than ssFPN while achieving lightweightness and optimization, thereby improving computational complexity and processing speed. ASSN is open-source based on YOLO version 7 and 8. This can be found in our public repository: https://github.com/smu-ivpl/ASSN.git
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Attention-based Weighted Fusion Network for Object Detection
    Yu, Ruixing
    Wang, Chuyin
    Tang, Yifei
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2024, 68 (06) : 1 - 18
  • [2] An attention-based feature pyramid network for single-stage small object detection
    Lin Jiao
    Chenrui Kang
    Shifeng Dong
    Peng Chen
    Gaoqiang Li
    Rujing Wang
    Multimedia Tools and Applications, 2023, 82 : 18529 - 18544
  • [3] An attention-based feature pyramid network for single-stage small object detection
    Jiao, Lin
    Kang, Chenrui
    Dong, Shifeng
    Chen, Peng
    Li, Gaoqiang
    Wang, Rujing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 18529 - 18544
  • [4] Reverse Attention-Based Residual Network for Salient Object Detection
    Chen, Shuhan
    Tan, Xiuli
    Wang, Ben
    Lu, Huchuan
    Hu, Xuelong
    Fu, Yun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3763 - 3776
  • [5] Attention-based Neighbor Selective Aggregation Network for Camouflaged Object Detection
    Cheng, Yao
    Hao, Hao-Zhou
    Ji, Yi
    Li, Ying
    Liu, Chun-Ping
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] Attention-based bi-directional refinement network for salient object detection
    Yuan, JunBin
    Wei, Jinhui
    Wattanachote, Kanoksak
    Zeng, Kun
    Luo, Xiaonan
    Xu, Qingzhen
    Gong, Yongyi
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14349 - 14361
  • [7] Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
    Yang, Feng
    Li, Wentong
    Hu, Haiwei
    Li, Wanyi
    Wang, Peng
    SENSORS, 2020, 20 (06)
  • [8] Attention-based bi-directional refinement network for salient object detection
    JunBin Yuan
    Jinhui Wei
    Kanoksak Wattanachote
    Kun Zeng
    Xiaonan Luo
    Qingzhen Xu
    Yongyi Gong
    Applied Intelligence, 2022, 52 : 14349 - 14361
  • [9] Attention-based sequence classification for affect detection
    Gorrostieta, Cristina
    Brutti, Richard
    Taylor, Kye
    Shapiro, Avi
    Moran, Joseph
    Azarbayejani, Ali
    Kane, John
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 506 - 510
  • [10] Video Object Segmentation Using Multi-Scale Attention-Based Siamese Network
    Zhu, Zhiliang
    Qiu, Leiningxin
    Wang, Jiaxin
    Xiong, Jinquan
    Peng, Hua
    ELECTRONICS, 2023, 12 (13)