Top-down generation of low-resolution representations improves visual perception and imagination

被引:3
|
作者
Bi, Zedong [1 ]
Li, Haoran [2 ]
Tian, Liang [2 ,3 ,4 ,5 ]
机构
[1] Lingang Lab, Shanghai 200031, Peoples R China
[2] Hong Kong Baptist Univ, Dept Phys, Hong Kong, Peoples R China
[3] Hong Kong Baptist Univ, Inst Computat & Theoret Studies, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Inst Syst Med & Hlth Sci, Hong Kong, Peoples R China
[5] Hong Kong Baptist Univ, State Key Lab Environm & Biol Anal, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative model; Visual system; Sketch generation; RECEPTIVE-FIELDS; WORKING-MEMORY; DYNAMICS; INHIBITION; MECHANISMS; CORTEX;
D O I
10.1016/j.neunet.2023.12.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Perception or imagination requires top-down signals from high-level cortex to primary visual cortex (V1) to reconstruct or simulate the representations bottom-up stimulated by the seen images. Interestingly, top-down signals in V1 have lower spatial resolution than bottom-up representations. It is unclear why the brain uses low-resolution signals to reconstruct or simulate high-resolution representations. By modeling the top-down pathway of the visual system using the decoder of a variational auto-encoder (VAE), we reveal that low resolution top-down signals can better reconstruct or simulate the information contained in the sparse activities of V1 simple cells, which facilitates perception and imagination. This advantage of low-resolution generation is related to facilitating high-level cortex to form geometry-respecting representations observed in experiments. Furthermore, we present two findings regarding this phenomenon in the context of AI-generated sketches, a style of drawings made of lines. First, we found that the quality of the generated sketches critically depends on the thickness of the lines in the sketches: thin-line sketches are harder to generate than thick-line sketches. Second, we propose a technique to generate high-quality thin-line sketches: instead of directly using original thin-line sketches, we use blurred sketches to train VAE or GAN (generative adversarial network), and then infer the thin-line sketches from the VAE-or GAN-generated blurred sketches. Collectively, our work suggests that low-resolution top-down generation is a strategy the brain uses to improve visual perception and imagination, which inspires new sketch-generation AI techniques.
引用
收藏
页码:440 / 456
页数:17
相关论文
共 50 条
  • [31] Top-down Visual Saliency Guided by Captions
    Ramanishka, Vasili
    Das, Abir
    Zhang, Jianming
    Saenko, Kate
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3135 - 3144
  • [32] Sources of Top-Down Control in Visual Search
    Weidner, Ralph
    Krummenacher, Joseph
    Reimann, Brit
    Mueller, Hermann J.
    Fink, Gereon R.
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2009, 21 (11) : 2100 - 2113
  • [33] Top-down cortical influences in visual expectation
    Bressler, Steven L.
    Richter, Craig G.
    Chen, Yonghong
    Ding, Mingzhou
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 188 - +
  • [34] The limits of top-down control of visual attention
    Van der Stigchel, Stefan
    Belopolsky, Artem V.
    Peters, Judith C.
    Wijnen, Jasper G.
    Meeter, Martijn
    Theeuwes, Jan
    ACTA PSYCHOLOGICA, 2009, 132 (03) : 201 - 212
  • [35] Top-down mechanisms and conscious visual experience
    Grieco, A.
    Oliveira, A.
    PERCEPTION, 2012, 41 : 214 - 215
  • [36] On the limits of top-down control of visual selection
    Theeuwes, Jan
    Van der Burg, Erik
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2011, 73 (07) : 2092 - 2103
  • [37] Functional connectivity during top-down modulation of visual short-term memory representations
    Kuo, Bo-Cheng
    Yeh, Yei-Yu
    Chen, Anthony J. -W.
    D'Esposito, Mark
    NEUROPSYCHOLOGIA, 2011, 49 (06) : 1589 - 1596
  • [38] Top-down resolution of visual ambiguity - knowledge from the future or footprints from the past?
    Kornmeier, Juergen
    Bhatia, Kriti
    Joos, Ellen
    PLOS ONE, 2021, 16 (10):
  • [39] Top-down mass spectrometry on low-resolution instruments: Characterization of phosphopantetheinylated carrier domains in polyketide and non-ribosomal biosynthetic pathways
    Meluzzi, Dario
    Zheng, Wei Hao
    Hensler, Mary
    Nizet, Victor
    Dorrestein, Pieter C.
    BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2008, 18 (10) : 3107 - 3111
  • [40] The new generation of top-down design methodology
    Fujimoto, T
    Yamaguchi, M
    Yamanouchi, T
    Ohnishi, M
    Takahashi, M
    Kambe, T
    SHARP TECHNICAL JOURNAL, 1997, (67): : 25 - 30