Peripheral Vision Transformer

被引:0
|
作者
Min, Juhong [1 ]
Zhao, Yucheng [2 ,3 ]
Luo, Chong [2 ]
Cho, Minsu [1 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, Pohang, South Korea
[2] Microsoft Res Asia MSRA, Beijing, Peoples R China
[3] Univ Sci & Technol China, Hefei, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human vision possesses a special type of visual processing systems called peripheral vision. Partitioning the entire visual field into multiple contour regions based on the distance to the center of our gaze, the peripheral vision provides us the ability to perceive various visual features at different regions. In this work, we take a biologically inspired approach and explore to model peripheral vision in deep neural networks for visual recognition. We propose to incorporate peripheral position encoding to the multi-head self-attention layers to let the network learn to partition the visual field into diverse peripheral regions given training data. We evaluate the proposed network, dubbed PerViT, on ImageNet-1K and systematically investigate the inner workings of the model for machine perception, showing that the network learns to perceive visual data similarly to the way that human vision does. The performance improvements in image classification over the baselines across different model sizes demonstrate the efficacy of the proposed method.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
    Liu, Ze
    Lin, Yutong
    Cao, Yue
    Hu, Han
    Wei, Yixuan
    Zhang, Zheng
    Lin, Stephen
    Guo, Baining
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
  • [42] Orthogonal Transformer: An Efficient Vision Transformer Backbone with Token Orthogonalization
    Huang, Huaibo
    Zhou, Xiaoqiang
    He, Ran
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [43] Survey of Vision Transformer in Low-Level Computer Vision
    Zhu, Kai
    Li, Li
    Zhang, Tong
    Jiang, Sheng
    Bie, Yiming
    Computer Engineering and Applications, 2024, 60 (04) : 39 - 56
  • [44] Peripheral vision (The 'Maias')
    Keenan, Margaret
    NEW YORK TIMES BOOK REVIEW, 2007, : 6 - 6
  • [45] VISION - PERIPHERAL AND FOVEAL
    Ferree, C. E.
    PSYCHOLOGICAL BULLETIN, 1913, 10 (03) : 95 - 101
  • [46] VIRTUES OF PERIPHERAL VISION
    BECHINGER, D
    KONGHL, G
    KORNHUBE.HH
    PFLUGERS ARCHIV-EUROPEAN JOURNAL OF PHYSIOLOGY, 1973, 339 : R89 - R89
  • [47] Central and peripheral vision
    Green, FWE
    NATURE, 1932, 129 : 943 - 943
  • [48] Underconfidence in peripheral vision
    Toscani, Matteo
    Mamassian, Pascal
    Valsecchi, Matteo
    JOURNAL OF VISION, 2021, 21 (06): : 1 - 14
  • [49] Peculiarities of peripheral vision
    Stevens, HC
    PSYCHOLOGICAL REVIEW, 1908, 15 (02) : 69 - 93
  • [50] Peripheral vision (The 'Maias')
    Dolan, Kathleen
    NEW YORK TIMES BOOK REVIEW, 2007, : 6 - 6