Top-down generation of low-resolution representations improves visual perception and imagination

被引:3
|
作者
Bi, Zedong [1 ]
Li, Haoran [2 ]
Tian, Liang [2 ,3 ,4 ,5 ]
机构
[1] Lingang Lab, Shanghai 200031, Peoples R China
[2] Hong Kong Baptist Univ, Dept Phys, Hong Kong, Peoples R China
[3] Hong Kong Baptist Univ, Inst Computat & Theoret Studies, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Inst Syst Med & Hlth Sci, Hong Kong, Peoples R China
[5] Hong Kong Baptist Univ, State Key Lab Environm & Biol Anal, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative model; Visual system; Sketch generation; RECEPTIVE-FIELDS; WORKING-MEMORY; DYNAMICS; INHIBITION; MECHANISMS; CORTEX;
D O I
10.1016/j.neunet.2023.12.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Perception or imagination requires top-down signals from high-level cortex to primary visual cortex (V1) to reconstruct or simulate the representations bottom-up stimulated by the seen images. Interestingly, top-down signals in V1 have lower spatial resolution than bottom-up representations. It is unclear why the brain uses low-resolution signals to reconstruct or simulate high-resolution representations. By modeling the top-down pathway of the visual system using the decoder of a variational auto-encoder (VAE), we reveal that low resolution top-down signals can better reconstruct or simulate the information contained in the sparse activities of V1 simple cells, which facilitates perception and imagination. This advantage of low-resolution generation is related to facilitating high-level cortex to form geometry-respecting representations observed in experiments. Furthermore, we present two findings regarding this phenomenon in the context of AI-generated sketches, a style of drawings made of lines. First, we found that the quality of the generated sketches critically depends on the thickness of the lines in the sketches: thin-line sketches are harder to generate than thick-line sketches. Second, we propose a technique to generate high-quality thin-line sketches: instead of directly using original thin-line sketches, we use blurred sketches to train VAE or GAN (generative adversarial network), and then infer the thin-line sketches from the VAE-or GAN-generated blurred sketches. Collectively, our work suggests that low-resolution top-down generation is a strategy the brain uses to improve visual perception and imagination, which inspires new sketch-generation AI techniques.
引用
收藏
页码:440 / 456
页数:17
相关论文
共 50 条