Mid-level visual features underlie the high-level categorical organization of the ventral stream

被引:136
|
作者
Long, Bria [1 ,2 ]
Yu, Chen-Ping [1 ,3 ]
Konkle, Talia [1 ]
机构
[1] Harvard Univ, Dept Psychol, 33 Kirkland St, Cambridge, MA 02138 USA
[2] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[3] Phiar Technol Inc, Palo Alto, CA 94303 USA
关键词
ventral stream organization; mid-level features; object recognition; fMRI; deep neural networks; FUSIFORM FACE AREA; NEURAL REPRESENTATIONS; CONGENITALLY BLIND; TEMPORAL CORTEX; OBJECT CATEGORY; SHAPE; SELECTIVITY; SIZE; PERCEPTION; SIMILARITY;
D O I
10.1073/pnas.1719616115
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Human object-selective cortex shows a large-scale organization characterized by the high-level properties of both animacy and object size. To what extent are these neural responses explained by primitive perceptual features that distinguish animals from objects and big objects from small objects? To address this question, we used a texture synthesis algorithm to create a class of stimuli-texforms-which preserve some mid-level texture and form information from objects while rendering them unrecognizable. We found that unrecognizable texforms were sufficient to elicit the large-scale organizations of object-selective cortex along the entire ventral pathway. Further, the structure in the neural patterns elicited by texforms was well predicted by curvature features and by intermediate layers of a deep convolutional neural network, supporting the mid-level nature of the representations. These results provide clear evidence that a substantial portion of ventral stream organization can be accounted for by coarse texture and form information without requiring explicit recognition of intact objects.
引用
收藏
页码:E9015 / E9024
页数:10
相关论文
共 50 条
  • [41] Fused One-vs-All Mid-Level Features for Fine-Grained Visual Categorization
    Zhang, Xiaopeng
    Xiong, Hongkai
    Zhou, Wengang
    Tian, Qi
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 287 - 296
  • [42] The Functional Organization of High-Level Visual Cortex Determines the Representation of Complex Visual Stimuli
    Kliger, Libi
    Yovel, Galit
    JOURNAL OF NEUROSCIENCE, 2020, 40 (39): : 7545 - 7558
  • [43] Retinotopic organization of high-level aftereffects
    Attarian, S.
    Ezzati, A.
    Talehy-Moineddin, S.
    Sanayeid, M.
    PERCEPTION, 2009, 38 : 184 - 184
  • [44] Learning part-based mid-level representation for visual recognition
    Yuan, Baodi
    Tu, Jian
    Zhao, Rui-Wei
    Zheng, Yingbin
    Jiang, Yu-Gang
    NEUROCOMPUTING, 2018, 275 : 2126 - 2136
  • [45] Pedestrian re-identification based on fusing low-level and mid-level features
    Wang Li
    CHINESE OPTICS, 2016, 9 (05): : 540 - 546
  • [46] Improving Streaming Video Segmentation with Early and Mid-Level Visual Processing
    Tripathi, Subarna
    Hwang, Youngbae
    Belongie, Serge
    Truong Nguyen
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 477 - 484
  • [47] Crowd Behavior Analysis Using Local Mid-Level Visual Descriptors
    Fradi, Hajer
    Luvison, Bertrand
    Quoc Cuong Pham
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (03) : 589 - 602
  • [48] Visual Place Recognition by spatial matching of high-level CNN features
    Camara, Luis G.
    Preucil, Libor
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2020, 133
  • [49] Learning Mid-Level Features from Object Hierarchy for Image Classification
    Albaradei, Somayah
    Wang, Yang
    Cao, Liangliang
    Li, Li-Jia
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 235 - 240
  • [50] EXMOVES: Mid-level Features for Efficient Action Recognition and Video Analysis
    Du Tran
    Lorenzo Torresani
    International Journal of Computer Vision, 2016, 119 : 239 - 253