Recurrent Models of Visual Attention

被引:0
|
作者
Mnih, Volodymyr [1 ]
Heess, Nicolas [1 ]
Graves, Alex [1 ]
Kavukcuoglu, Koray [1 ]
机构
[1] Google DeepMind, London, England
关键词
EYE-MOVEMENTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applying convolutional neural networks to large images is computationally expensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is capable of extracting information from an image or video by adaptively selecting a sequence of regions or locations and only processing the selected regions at high resolution. Like convolutional neural networks, the proposed model has a degree of translation invariance built-in, but the amount of computation it performs can be controlled independently of the input image size. While the model is non-differentiable, it can be trained using reinforcement learning methods to learn task-specific policies. We evaluate our model on several image classification tasks, where it significantly outperforms a convolutional neural network baseline on cluttered images, and on a dynamic visual control problem, where it learns to track a simple object without an explicit training signal for doing so.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Recurrent Models of Visual Co-Attention for Person Re-Identification
    Lin, Lan
    Luo, Huan
    Huang, Renjie
    Ye, Mao
    IEEE ACCESS, 2019, 7 : 8865 - 8875
  • [2] Visual attention and mental models
    Klauer, KC
    Oberauer, K
    Rossnagel, C
    Musch, J
    ZEITSCHRIFT FUR PSYCHOLOGIE, 1996, 204 (01): : 41 - 54
  • [3] Neurobiological models of visual attention
    Tsotsos, JK
    VISUAL ATTENTION MECHANISMS, 2002, : 229 - 237
  • [4] Computational models of visual attention
    Tsotsos, John K.
    Eckstein, Miguel P.
    Landy, Michael S.
    VISION RESEARCH, 2015, 116 : 93 - 94
  • [5] Image Classification with Recurrent Attention Models
    Semeniuta, Stanislau
    Barth, Erhardt
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [6] Toward Semantic Visual Attention Models
    Kucerova, J.
    Haladova, Z.
    PERCEPTION, 2013, 42 : 219 - 219
  • [7] Efficient Neural Models for Visual Attention
    Chevallier, Sylvain
    Cuperlier, Nicolas
    Gaussier, Philippe
    COMPUTER VISION AND GRAPHICS, PT I, 2010, 6374 : 257 - 264
  • [8] Learning Generative Models with Visual Attention
    Tang, Yichuan
    Srivastava, Nitish
    Salakhutdinov, Ruslan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [9] Visual relationship detection with recurrent attention and negative sampling
    Wang, Lei
    Lin, Peizhen
    Cheng, Jun
    Liu, Feng
    Ma, Xiaoliang
    Yin, Jianqin
    NEUROCOMPUTING, 2021, 434 : 55 - 66
  • [10] Vision and attention. I: Current models of visual attention
    Steinman, SB
    Steinman, BA
    OPTOMETRY AND VISION SCIENCE, 1998, 75 (02) : 146 - 155