Top-down generation of low-resolution representations improves visual perception and imagination

被引:3
|
作者
Bi, Zedong [1 ]
Li, Haoran [2 ]
Tian, Liang [2 ,3 ,4 ,5 ]
机构
[1] Lingang Lab, Shanghai 200031, Peoples R China
[2] Hong Kong Baptist Univ, Dept Phys, Hong Kong, Peoples R China
[3] Hong Kong Baptist Univ, Inst Computat & Theoret Studies, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Inst Syst Med & Hlth Sci, Hong Kong, Peoples R China
[5] Hong Kong Baptist Univ, State Key Lab Environm & Biol Anal, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative model; Visual system; Sketch generation; RECEPTIVE-FIELDS; WORKING-MEMORY; DYNAMICS; INHIBITION; MECHANISMS; CORTEX;
D O I
10.1016/j.neunet.2023.12.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Perception or imagination requires top-down signals from high-level cortex to primary visual cortex (V1) to reconstruct or simulate the representations bottom-up stimulated by the seen images. Interestingly, top-down signals in V1 have lower spatial resolution than bottom-up representations. It is unclear why the brain uses low-resolution signals to reconstruct or simulate high-resolution representations. By modeling the top-down pathway of the visual system using the decoder of a variational auto-encoder (VAE), we reveal that low resolution top-down signals can better reconstruct or simulate the information contained in the sparse activities of V1 simple cells, which facilitates perception and imagination. This advantage of low-resolution generation is related to facilitating high-level cortex to form geometry-respecting representations observed in experiments. Furthermore, we present two findings regarding this phenomenon in the context of AI-generated sketches, a style of drawings made of lines. First, we found that the quality of the generated sketches critically depends on the thickness of the lines in the sketches: thin-line sketches are harder to generate than thick-line sketches. Second, we propose a technique to generate high-quality thin-line sketches: instead of directly using original thin-line sketches, we use blurred sketches to train VAE or GAN (generative adversarial network), and then infer the thin-line sketches from the VAE-or GAN-generated blurred sketches. Collectively, our work suggests that low-resolution top-down generation is a strategy the brain uses to improve visual perception and imagination, which inspires new sketch-generation AI techniques.
引用
收藏
页码:440 / 456
页数:17
相关论文
共 50 条
  • [21] A top-down model for language perception and production
    Ingvar, DH
    BASIC MECHANISMS IN COGNITION AND LANGUAGE: WITH SPECIAL REFERENCE TO PHONOLOGICAL PROBLEMS IN DYSLEXIA, 1998, 70 : 109 - 120
  • [22] TOP-DOWN ISOLATION IMPROVES POWER ICS
    GALLAGHER, RT
    ELECTRONICS, 1982, 55 (18): : 84 - &
  • [23] TOP-DOWN MODULATION OF ATTENTION AND PERCEPTION BY EMOTION
    Mohanty, Aprajita
    Sussman, Tamara J.
    Jin, Jingwen
    Szekely, Akos
    PSYCHOPHYSIOLOGY, 2014, 51 : S6 - S6
  • [24] Neural mechanisms of top-down divided and selective spatial attention in visual and auditory perception
    Zhongtian Guan
    Meng Lin
    Qiong Wu
    Jinglong Wu
    Kewei Chen
    Hongbin Han
    Dehua Chui
    Xu Zhang
    Chunlin Li
    BrainScienceAdvances, 2023, 9 (02) : 95 - 113
  • [25] Electrophysiological marker of bottom-up and top-down processing in multistable visual perception
    Strüber, D
    Isoglu-Alkaç, Ü
    JOURNAL OF PSYCHOPHYSIOLOGY, 2005, 19 (02) : 148 - 148
  • [26] Inhibitory microcircuits for top-down plasticity of sensory representations
    Katharina Anna Wilmes
    Claudia Clopath
    Nature Communications, 10
  • [27] Inhibitory microcircuits for top-down plasticity of sensory representations
    Wilmes, Katharina Anna
    Clopath, Claudia
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [28] Top-down cortical influences in visual expectation
    Bressler, Steven L.
    Richter, Craig G.
    Chen, Yonghong
    Ding, Mingzhou
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 384 - +
  • [29] The contribution of top-down predictions to visual recognition
    Bar, M
    PERCEPTION, 2005, 34 : 47 - 48
  • [30] On the limits of top-down control of visual selection
    Jan Theeuwes
    Erik Van der Burg
    Attention, Perception, & Psychophysics, 2011, 73 : 2092 - 2103