Improving Visual Representation Learning through Perceptual Understanding

被引:2
|
作者
Tukra, Samyakh [1 ]
Hoffman, Frederick [1 ]
Chatfield, Ken [1 ]
机构
[1] Tractable AI, London, England
关键词
D O I
10.1109/CVPR52729.2023.01392
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an extension to masked autoencoders (MAE) which improves on the representations learnt by the model by explicitly encouraging the learning of higher scene-level features. We do this by: (i) the introduction of a perceptual similarity term between generated and real images (ii) incorporating several techniques from the adversarial training literature including multi-scale training and adaptive discriminator augmentation. The combination of these results in not only better pixel reconstruction but also representations which appear to capture better higher-level details within images. More consequentially, we show how our method, Perceptual MAE, leads to better performance when used for downstream tasks outperforming previous methods. We achieve 78.1% top-1 accuracy linear probing on ImageNet-1K and up to 88.1% when fine-tuning, with similar results for other downstream tasks, all without use of additional pre-trained models or data.
引用
收藏
页码:14486 / 14495
页数:10
相关论文
共 50 条
  • [41] Understanding brain plasticity in perceptual learning
    Anja Stemme
    Gustavo Deco
    Elmar Lang
    BMC Neuroscience, 10 (Suppl 1)
  • [42] Vector inversion is supported by a perceptual representation of visual space
    Bingley, Taryn
    Mason, Janell
    Heath, Matthew
    JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2009, 31 : S48 - S48
  • [43] Perceptual learning of visual letter recognition
    Huckauf, A
    PERCEPTION, 2002, 31 : 96 - 96
  • [44] Neuronal mechanisms of visual perceptual learning
    Kumano, Hironori
    Uka, Takanori
    BEHAVIOURAL BRAIN RESEARCH, 2013, 249 : 75 - 80
  • [45] Perceptual learning in visual conjunction search
    Lobley, K
    Walsh, V
    PERCEPTION, 1998, 27 (10) : 1245 - 1255
  • [46] Perceptual learning and the visual control of braking
    Fajen, Brett R.
    PERCEPTION & PSYCHOPHYSICS, 2008, 70 (06): : 1117 - 1129
  • [47] Advances in visual perceptual learning and plasticity
    Yuka Sasaki
    Jose E. Nanez
    Takeo Watanabe
    Nature Reviews Neuroscience, 2010, 11 : 53 - 60
  • [48] Current directions in visual perceptual learning
    Lu, Zhong-Lin
    Dosher, Barbara Anne
    NATURE REVIEWS PSYCHOLOGY, 2022, 1 (11): : 654 - 668
  • [49] Perceptual learning and the visual control of braking
    Brett R. Fajen
    Perception & Psychophysics, 2008, 70 : 1117 - 1129
  • [50] Stereoscopic Visual Perceptual Learning in Seniors
    Erbes, Sabine
    Michelson, Georg
    GERIATRICS, 2021, 6 (03)