Top-down generation of low-resolution representations improves visual perception and imagination

被引:3
|
作者
Bi, Zedong [1 ]
Li, Haoran [2 ]
Tian, Liang [2 ,3 ,4 ,5 ]
机构
[1] Lingang Lab, Shanghai 200031, Peoples R China
[2] Hong Kong Baptist Univ, Dept Phys, Hong Kong, Peoples R China
[3] Hong Kong Baptist Univ, Inst Computat & Theoret Studies, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Inst Syst Med & Hlth Sci, Hong Kong, Peoples R China
[5] Hong Kong Baptist Univ, State Key Lab Environm & Biol Anal, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative model; Visual system; Sketch generation; RECEPTIVE-FIELDS; WORKING-MEMORY; DYNAMICS; INHIBITION; MECHANISMS; CORTEX;
D O I
10.1016/j.neunet.2023.12.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Perception or imagination requires top-down signals from high-level cortex to primary visual cortex (V1) to reconstruct or simulate the representations bottom-up stimulated by the seen images. Interestingly, top-down signals in V1 have lower spatial resolution than bottom-up representations. It is unclear why the brain uses low-resolution signals to reconstruct or simulate high-resolution representations. By modeling the top-down pathway of the visual system using the decoder of a variational auto-encoder (VAE), we reveal that low resolution top-down signals can better reconstruct or simulate the information contained in the sparse activities of V1 simple cells, which facilitates perception and imagination. This advantage of low-resolution generation is related to facilitating high-level cortex to form geometry-respecting representations observed in experiments. Furthermore, we present two findings regarding this phenomenon in the context of AI-generated sketches, a style of drawings made of lines. First, we found that the quality of the generated sketches critically depends on the thickness of the lines in the sketches: thin-line sketches are harder to generate than thick-line sketches. Second, we propose a technique to generate high-quality thin-line sketches: instead of directly using original thin-line sketches, we use blurred sketches to train VAE or GAN (generative adversarial network), and then infer the thin-line sketches from the VAE-or GAN-generated blurred sketches. Collectively, our work suggests that low-resolution top-down generation is a strategy the brain uses to improve visual perception and imagination, which inspires new sketch-generation AI techniques.
引用
收藏
页码:440 / 456
页数:17
相关论文
共 50 条
  • [41] Top-down influences on stereoscopic depth-perception
    Bulthoff, I
    Bulthoff, H
    Sinha, P
    NATURE NEUROSCIENCE, 1998, 1 (03) : 254 - 257
  • [42] TOP-DOWN, FAST-SAME, AND ACOUSTIC PERCEPTION
    VERLEGER, R
    BEHAVIORAL AND BRAIN SCIENCES, 1990, 13 (02) : 257 - 257
  • [43] Brain mechanisms associated with top-down processes in perception
    Frith, C
    Dolan, RJ
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1997, 352 (1358) : 1221 - 1230
  • [44] Top-down influences on stereoscopic depth-perception
    Isabelle Bülthoff
    Heinrich Bülthoff
    Pawan Sinha
    Nature Neuroscience, 1998, 1 : 254 - 257
  • [45] Top-down influence of recognition on stereoscopic depth perception
    Bulthoff, I
    Sinha, P
    Bulthoff, HH
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1996, 37 (03) : 5168 - 5168
  • [46] Top-down modulation: The crossroads of perception, attention and memory
    Gazzaley, Adam
    HUMAN VISION AND ELECTRONIC IMAGING XV, 2010, 7527
  • [47] Top-down effect of body representation on pain perception
    Matsumuro, Miki
    Ma, Ning
    Miura, Yuki
    Shibata, Fumihisa
    Kimura, Asako
    PLOS ONE, 2022, 17 (05):
  • [48] A Top-Down Cortical Circuit for Accurate Sensory Perception
    Manita, Satoshi
    Suzuki, Takayuki
    Homma, Chihiro
    Matsumoto, Takashi
    Odagawa, Maya
    Yamada, Kazuyuki
    Ota, Keisuke
    Matsubara, Chie
    Inutsuka, Ayumu
    Sato, Masaaki
    Ohkura, Masamichi
    Yamanaka, Akihiro
    Yanagawa, Yuchio
    Nakai, Junichi
    Hayashi, Yasunori
    Larkum, Matthew E.
    Murayama, Masanori
    NEURON, 2015, 86 (05) : 1304 - 1316
  • [49] A Top-down Perception Approach for Vehicle Pose Estimation
    Bernay-Angeletti, Coralie
    Chabot, Florian
    Aynaud, Claude
    Aufrere, Romuald
    Chapuis, Roland
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 2240 - 2245
  • [50] FUNCTIONAL REPRESENTATIONS COMMON TO VISUAL-PERCEPTION AND IMAGINATION
    PODGORNY, P
    SHEPARD, RN
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1978, 4 (01) : 21 - 35