Image Annotation by Deep Neural Networks with Attention Shaping

被引:0
|
作者
Zheng, Kexin [1 ]
Lv, Shaohe [1 ]
Ma, Fang [1 ]
Chen, Fei [1 ]
Jin, Chi [2 ]
Dou, Yong [1 ]
机构
[1] Natl Univ Def Technol, Natl Lab Parallel & Distributed Proc, 109 Deya Rd, Changsha 410073, Hunan, Peoples R China
[2] Univ South China, Sch Comp Sci & Technol, Changsha 410012, Hunan, Peoples R China
关键词
computer vision; image annotation; DNN; attention mechanism;
D O I
10.1117/12.2281747
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Image annotation is a task of assigning semantic labels to an image. Recently, deep neural networks with visual attention have been utilized successfully in many computer vision tasks. In this paper, we show that conventional attention mechanism is easily misled by the salient class, i.e., the attended region always contains part of the image area describing the content of salient class at different attention iterations. To this end, we propose a novel attention shaping mechanism, which aims to maximize the non-overlapping area between consecutive attention processes by taking into account the history of previous attention vectors. Several weighting polices are studied to utilize the history information in different manners. In two benchmark datasets, i.e., PASCAL VOC2012 and MIRFlickr-25k, the average precision is improved by up to 10% in comparison with the state-of-the-art annotation methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Context-aware deep kernel networks for image annotation
    Jiu, Mingyuan
    Sahbi, Hichem
    NEUROCOMPUTING, 2022, 474 : 154 - 167
  • [22] Fast and accurate annotation of acoustic signals with deep neural networks
    Steinfath, Elsa
    Palacios-Munoz, Adrian
    Rottschaefer, Julian R.
    Yuezak, Deniz
    Clemens, Jan
    ELIFE, 2021, 10
  • [23] Hyperspectral Image Super-Resolution via Deep Spatiospectral Attention Convolutional Neural Networks
    Hu, Jin-Fan
    Huang, Ting-Zhu
    Deng, Liang-Jian
    Jiang, Tai-Xiang
    Vivone, Gemine
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7251 - 7265
  • [24] Visual vs internal attention mechanisms in deep neural networks for image classification and object detection
    Obeso, Abraham Montoya
    Benois-Pineau, Jenny
    Vazquez, Mireya Sarai Garcia
    Acosta, Alejandro Alvaro Ramirez
    PATTERN RECOGNITION, 2022, 123
  • [25] Deep Convolutional Neural Network with KNN Regression for Automatic Image Annotation
    Bensaci, Ramla
    Khaldi, Belal
    Aiadi, Oussama
    Benchabana, Ayoub
    APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [26] Attention deep residual networks for MR image analysis
    Mei, Mengqing
    He, Fazhi
    Xue, Shan
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (18): : 12957 - 12966
  • [27] Attention deep residual networks for MR image analysis
    Mengqing Mei
    Fazhi He
    Shan Xue
    Neural Computing and Applications, 2023, 35 : 12957 - 12966
  • [28] Transferable graph neural networks with deep alignment attention
    Xie, Ying
    Xu, Rongbin
    Yang, Yun
    INFORMATION SCIENCES, 2023, 643
  • [29] Spatial Channel Attention for Deep Convolutional Neural Networks
    Liu, Tonglai
    Luo, Ronghai
    Xu, Longqin
    Feng, Dachun
    Cao, Liang
    Liu, Shuangyin
    Guo, Jianjun
    MATHEMATICS, 2022, 10 (10)
  • [30] Spatial Pyramid Attention for Deep Convolutional Neural Networks
    Ma, Xu
    Guo, Jingda
    Sansom, Andrew
    McGuire, Mara
    Kalaani, Andrew
    Chen, Qi
    Tang, Sihai
    Yang, Qing
    Fu, Song
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3048 - 3058