Image Annotation by Deep Neural Networks with Attention Shaping

被引:0
|
作者
Zheng, Kexin [1 ]
Lv, Shaohe [1 ]
Ma, Fang [1 ]
Chen, Fei [1 ]
Jin, Chi [2 ]
Dou, Yong [1 ]
机构
[1] Natl Univ Def Technol, Natl Lab Parallel & Distributed Proc, 109 Deya Rd, Changsha 410073, Hunan, Peoples R China
[2] Univ South China, Sch Comp Sci & Technol, Changsha 410012, Hunan, Peoples R China
关键词
computer vision; image annotation; DNN; attention mechanism;
D O I
10.1117/12.2281747
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Image annotation is a task of assigning semantic labels to an image. Recently, deep neural networks with visual attention have been utilized successfully in many computer vision tasks. In this paper, we show that conventional attention mechanism is easily misled by the salient class, i.e., the attended region always contains part of the image area describing the content of salient class at different attention iterations. To this end, we propose a novel attention shaping mechanism, which aims to maximize the non-overlapping area between consecutive attention processes by taking into account the history of previous attention vectors. Several weighting polices are studied to utilize the history information in different manners. In two benchmark datasets, i.e., PASCAL VOC2012 and MIRFlickr-25k, the average precision is improved by up to 10% in comparison with the state-of-the-art annotation methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Deep Context Networks for Image Annotation
    Jiu, Mingyuan
    Sahbi, Hichem
    Qi, Lin
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2422 - 2427
  • [2] Annotation of enhanced radiographs for medical image retrieval with deep convolutional neural networks
    Pelka, Obioma
    Nensa, Felix
    Friedrich, Christoph M.
    PLOS ONE, 2018, 13 (11):
  • [3] Image Annotation Via Deep Neural Network
    Sun Chengjian
    Zhu, Songhao
    Shi, Zhe
    2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 518 - 521
  • [4] Deep neural network based image annotation
    Zhu, Songhao
    Shi, Zhe
    Sun, Chengjian
    Shen, Shuhan
    PATTERN RECOGNITION LETTERS, 2015, 65 : 103 - 108
  • [5] DEEP KERNEL MAP NETWORKS FOR IMAGE ANNOTATION
    Jiu, Mingyuan
    Sahbi, Hichem
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1571 - 1575
  • [6] DNNAM: Image inpainting algorithm via deep neural networks and attention mechanism
    Chen, Yuantao
    Xia, Runlong
    Yang, Kai
    Zou, Ke
    APPLIED SOFT COMPUTING, 2024, 154
  • [7] MCA: Multidimensional collaborative attention in deep convolutional neural networks for image recognition
    Yu, Yang
    Zhang, Yi
    Cheng, Zeyu
    Song, Zhe
    Tang, Chengkai
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [8] Enhanced Visual Attention-Guided Deep Neural Networks for Image Classification
    Yeh, Chia-Hung
    Lin, Min-Hui
    Chang, Po-Chao
    Kang, Li-Wei
    IEEE ACCESS, 2020, 8 (08) : 163447 - 163457
  • [9] HAM: Hybrid attention module in deep convolutional neural networks for image classification
    Li, Guoqiang
    Fang, Qi
    Zha, Linlin
    Gao, Xin
    Zheng, Nenggan
    PATTERN RECOGNITION, 2022, 129
  • [10] Visual Attention with Deep Neural Networks
    Canziani, Alfredo
    Culurciello, Eugenio
    2015 49TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2015,