Unsupervised neural network models of the ventral visual stream

被引:171
|
作者
Zhuang, Chengxu [1 ]
Yan, Siming [2 ]
Nayebi, Aran [3 ]
Schrimpf, Martin [4 ]
Frank, Michael C. [1 ]
DiCarlo, James J. [4 ]
Yamins, Daniel L. K. [1 ,5 ,6 ]
机构
[1] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[2] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[3] Stanford Univ, Neurosci PhD Program, Stanford, CA 94305 USA
[4] MIT, Brain & Cognit Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[5] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[6] Stanford Univ, Wu Tsai Neurosci Inst, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
ventral visual stream; deep neural networks; unsupervised algorithms; RECEPTIVE-FIELDS; AREA V4; RECOGNITION; INFANTS; INFORMATION; SELECTIVITY; FRAMEWORK; RESPONSES; FEATURES; PATHWAY;
D O I
10.1073/pnas.2014196118
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep neural networks currently provide the best quantitative models of the response patterns of neurons throughout the primate ventral visual stream. However, such networks have remained implausible as a model of the development of the ventral stream, in part because they are trained with supervised methods requiring many more labels than are accessible to infants during development. Here, we report that recent rapid progress in unsupervised learning has largely closed this gap. We find that neural network models learned with deep unsupervised contrastive embedding methods achieve neural prediction accuracy in multiple ventral visual cortical areas that equals or exceeds that of models derived using today's best supervised methods and that the mapping of these neural network models' hidden layers is neuroanatomically consistent across the ventral stream. Strikingly, we find that these methods produce brainlike representations even when trained solely with real human child developmental data collected from head-mounted cameras, despite the fact that these datasets are noisy and limited. We also find that semisupervised deep contrastive embeddings can leverage small numbers of labeled examples to produce representations with substantially improved error-pattern consistency to human behavior. Taken together, these results illustrate a use of unsupervised learning to provide a quantitative model of a multiarea cortical brain system and present a strong candidate for a biologically plausible computational theory of primate sensory learning.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
    Khassanov, Yerbolat
    Chng, Eng Siong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3343 - 3347
  • [42] A ventral visual stream reading center independent of sensory modality and visual experience
    Reich, L.
    Striem-Amit, E.
    Szwed, M.
    Cohen, L.
    Amedi, A.
    JOURNAL OF MOLECULAR NEUROSCIENCE, 2012, 48 : S95 - S95
  • [43] Mapping visual symbols onto spoken language along the ventral visual stream
    Taylor, J. S. H.
    Davis, Matthew H.
    Rastle, Kathleen
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (36) : 17723 - 17728
  • [44] Integrative and distinctive coding of visual and conceptual object features in the ventral visual stream
    Martin, Chris B.
    Douglas, Danielle
    Newsome, Rachel N.
    Man, Louisa L. Y.
    Barense, Morgan D.
    ELIFE, 2018, 7
  • [45] Neural network models for DMT-induced visual hallucinations
    Schartner, Michael M.
    Timmermann, Christopher
    NEUROSCIENCE OF CONSCIOUSNESS, 2020, 6 (01)
  • [46] Neural network models as evidence for different types of visual representations
    Kosslyn, SM
    Chabris, CF
    Baker, DP
    COGNITIVE SCIENCE, 1995, 19 (04) : 575 - 579
  • [47] High-Level Visual Encoding Model Framework with Hierarchical Ventral Stream-Optimized Neural Networks
    Xiao, Wulue
    Li, Jingwei
    Zhang, Chi
    Wang, Linyuan
    Chen, Panpan
    Yu, Ziya
    Tong, Li
    Yan, Bin
    BRAIN SCIENCES, 2022, 12 (08)
  • [48] Neural coding in the dorsal visual stream
    Chinellato, Eris
    del Pobil, Angel P.
    FROM ANIMALS TO ANIMATS 10, PROCEEDINGS, 2008, 5040 : 230 - 239
  • [49] Color discrimination involves ventral and dorsal stream visual areas
    Claeys, KG
    Dupont, P
    Cornette, L
    Sunaert, S
    Van Hecke, P
    De Schutter, E
    Orban, GA
    CEREBRAL CORTEX, 2004, 14 (07) : 803 - 822
  • [50] Exploring unsupervised textual representations generated by neural language models in the context of automatic tweet stream summarization
    Dusart, Alexis
    Pinel-Sauvagnat, Karen
    Hubert, Gilles
    ONLINE SOCIAL NETWORKS AND MEDIA, 2023, 37