Attention CoupleNet: Fully Convolutional Attention Coupling Network for Object Detection

被引:98
|
作者
Zhu, Yousong [1 ,2 ]
Zhao, Chaoyang [1 ,2 ]
Guo, Haiyun [1 ,2 ]
Wang, Jinqiao [1 ,2 ]
Zhao, Xu [1 ,2 ]
Lu, Hanqing [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; cascade attention; global structure; local parts;
D O I
10.1109/TIP.2018.2865280
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The field of object detection has made great progress in recent years. Most of these improvements are derived from using a more sophisticated convolutional neural network. However, in the case of humans, the attention mechanism, global structure information, and local details of objects all play an important role for detecting an object. In this paper, we propose a novel fully convolutional network, named as Attention CoupleNet, to incorporate the attention-related information and global and local information of objects to improve the detection performance. Specifically, we first design a cascade attention structure to perceive the global scene of the image and generate class-agnostic attention maps. Then the attention maps are encoded into the network to acquire object-aware features. Next, we propose a unique fully convolutional coupling structure to couple global structure and local parts of the object to further formulate a discriminative feature representation. To fully explore the global and local properties, we also design different coupling strategies and normalization ways to make full use of the complementary advantages between the global and local information. Extensive experiments demonstrate the effectiveness of our approach. We achieve state-of-the-art results on all three challenging data sets, i. e., a mAP of 85.7% on VOC07, 84.3% on VOC12, and 35.4% on COCO. Codes are publicly available at https://github. com/tshizys/CoupleNet.
引用
收藏
页码:113 / 126
页数:14
相关论文
共 50 条
  • [1] A Fully Convolutional Network based on Spatial Attention for Saliency Object Detection
    Chen, Kai
    Wang, Yongxiong
    Hu, Chuanfei
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 5707 - 5711
  • [2] Symmetric pyramid attention convolutional neural network for moving object detection
    Shaocheng Qu
    Hongrui Zhang
    Wenhui Wu
    Wenjun Xu
    Yifei Li
    Signal, Image and Video Processing, 2021, 15 : 1747 - 1755
  • [3] Symmetric pyramid attention convolutional neural network for moving object detection
    Qu, Shaocheng
    Zhang, Hongrui
    Wu, Wenhui
    Xu, Wenjun
    Li, Yifei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (08) : 1747 - 1755
  • [4] Co-Saliency Detection With Co-Attention Fully Convolutional Network
    Gao, Guangshuai
    Zhao, Wenting
    Liu, Qingjie
    Wang, Yunhong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 877 - 889
  • [5] Fully convolutional attention network for biomedical image segmentation
    Cheng, Junlong
    Tian, Shengwei
    Yu, Long
    Lu, Hongchun
    Lv, Xiaoyi
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 107
  • [6] Fully convolutional network with attention modules for semantic segmentation
    Yunjia Huang
    Haixia Xu
    Signal, Image and Video Processing, 2021, 15 : 1031 - 1039
  • [7] Fully convolutional network with attention modules for semantic segmentation
    Huang, Yunjia
    Xu, Haixia
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (05) : 1031 - 1039
  • [8] MCBAN: A Small Object Detection Multi-Convolutional Block Attention Network
    Bhanbhro, Hina
    Hooi, Yew Kwang
    Zakaria, Mohammad Nordin Bin
    Kusakunniran, Worapan
    Amur, Zaira Hassan
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (02): : 2243 - 2259
  • [9] Object detection based on polarization image fusion and grouped convolutional attention network
    Ailing Tan
    Tianan Guo
    Yong Zhao
    Yunxin Wang
    Xiaohang Li
    The Visual Computer, 2024, 40 : 3199 - 3215
  • [10] Object detection based on polarization image fusion and grouped convolutional attention network
    Tan, Ailing
    Guo, Tianan
    Zhao, Yong
    Wang, Yunxin
    Li, Xiaohang
    VISUAL COMPUTER, 2024, 40 (05): : 3199 - 3215