Mid-level visual features underlie the high-level categorical organization of the ventral stream

被引:136
|
作者
Long, Bria [1 ,2 ]
Yu, Chen-Ping [1 ,3 ]
Konkle, Talia [1 ]
机构
[1] Harvard Univ, Dept Psychol, 33 Kirkland St, Cambridge, MA 02138 USA
[2] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[3] Phiar Technol Inc, Palo Alto, CA 94303 USA
关键词
ventral stream organization; mid-level features; object recognition; fMRI; deep neural networks; FUSIFORM FACE AREA; NEURAL REPRESENTATIONS; CONGENITALLY BLIND; TEMPORAL CORTEX; OBJECT CATEGORY; SHAPE; SELECTIVITY; SIZE; PERCEPTION; SIMILARITY;
D O I
10.1073/pnas.1719616115
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Human object-selective cortex shows a large-scale organization characterized by the high-level properties of both animacy and object size. To what extent are these neural responses explained by primitive perceptual features that distinguish animals from objects and big objects from small objects? To address this question, we used a texture synthesis algorithm to create a class of stimuli-texforms-which preserve some mid-level texture and form information from objects while rendering them unrecognizable. We found that unrecognizable texforms were sufficient to elicit the large-scale organizations of object-selective cortex along the entire ventral pathway. Further, the structure in the neural patterns elicited by texforms was well predicted by curvature features and by intermediate layers of a deep convolutional neural network, supporting the mid-level nature of the representations. These results provide clear evidence that a substantial portion of ventral stream organization can be accounted for by coarse texture and form information without requiring explicit recognition of intact objects.
引用
收藏
页码:E9015 / E9024
页数:10
相关论文
共 50 条
  • [21] High-Level Visual Features for Underwater Place Recognition
    Li, Jie
    Eustice, Ryan M.
    Johnson-Roberson, Matthew
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 3652 - 3659
  • [22] Action recognition by learning mid-level motion features
    Fathi, Alireza
    Mori, Greg
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3064 - 3071
  • [23] Mid-level motion features for the recognition of biological movements
    Sigala, RA
    Serre, T
    Poggio, T
    Giese, MA
    Casile, A
    PERCEPTION, 2005, 34 : 64 - 64
  • [24] Supervised Mid-Level Features for Word Image Representation
    Gordo, Albert
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2956 - 2964
  • [25] Indoor Scene Classification Based on Mid-Level Features
    Zhang, Qiang
    Yang, Jinfu
    Zhang, Shanshan
    INFORMATION TECHNOLOGY AND INTELLIGENT TRANSPORTATION SYSTEMS, VOL 1, 2017, 454 : 235 - 242
  • [26] Transfer of perceptual learning in a hierarchy of mid-level visual tasks
    Peirce, J. W.
    McGovern, D. P.
    Webb, B. S.
    PERCEPTION, 2010, 39 : 64 - 64
  • [27] Mining Mid-level Visual Patterns with Deep CNN Activations
    Li, Yao
    Liu, Lingqiao
    Shen, Chunhua
    van den Hengel, Anton
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 121 (03) : 344 - 364
  • [28] The timescale of adaptation at early and mid-level stages of visual processing
    Mei, Gaoxing
    Dong, Xue
    Bao, Min
    JOURNAL OF VISION, 2017, 17 (01):
  • [29] Object Pose Estimation using Mid-level Visual Representations
    Nejatishahidin, Negar
    Fayyazsanavi, Pooya
    Kosecka, Jana
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 13105 - 13111
  • [30] A mid-level visual concept generation framework for sports analysis
    Tong, XF
    Duan, LY
    Lu, HQ
    Xu, CS
    Tian, Q
    Jin, JS
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 646 - 649