Sparse Spatial Coding: A Novel Approach to Visual Recognition

被引:10
|
作者
Oliveira, Gabriel Leivas [1 ]
Nascimento, Erickson R. [2 ]
Vieira, Antonio Wilson [3 ]
Montenegro Campos, Mario Fernando [2 ]
机构
[1] Univ Minnesota, Dept Comp Sci, Minneapolis, MN 55455 USA
[2] Univ Fed Minas Gerais, Dept Comp Sci, BR-31270901 Belo Horizonte, MG, Brazil
[3] Univ Estadual Montes Claros, Dept Math & Comp Sci, BR-39440 Montes Claros, Brazil
关键词
Object recognition; image coding; learning (artificial intelligence); computer vision; vision and scene undertanding; sparse coding; IMAGE; REPRESENTATIONS; EFFICIENT;
D O I
10.1109/TIP.2014.2317988
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Successful image-based object recognition techniques have been constructed founded on powerful techniques such as sparse representation, in lieu of the popular vector quantization approach. However, one serious drawback of sparse space-based methods is that local features that are quite similar can be quantized into quite distinct visual words. We address this problem with a novel approach for object recognition, called sparse spatial coding, which efficiently combines a sparse coding dictionary learning and spatial constraint coding stage. We performed experimental evaluation using the Caltech 101, Caltech 256, Corel 5000, and Corel 10000 data sets, which were specifically designed for object recognition evaluation. Our results show that our approach achieves high accuracy comparable with the best single feature method previously published on those databases. Our method outperformed, for the same bases, several multiple feature methods, and provided equivalent, and in few cases, slightly less accurate results than other techniques specifically designed to that end. Finally, we report state-of-the-art results for scene recognition on COsy Localization Dataset (COLD) and high performance results on the MIT-67 indoor scene recognition, thus demonstrating the generalization of our approach for such tasks.
引用
收藏
页码:2719 / 2731
页数:13
相关论文
共 50 条
  • [31] Sparse coding in striate and extrastriate visual cortex
    Willmore, Ben D. B.
    Mazer, James A.
    Gallant, Jack L.
    JOURNAL OF NEUROPHYSIOLOGY, 2011, 105 (06) : 2907 - 2919
  • [32] VISUAL TRACKING VIA ORTHOGONAL SPARSE CODING
    Wang, Jing
    Wang, Yiyang
    Liu, Risheng
    Su, Zhixun
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3817 - 3821
  • [33] Trimmed sparse coding for robust face recognition
    Dong, Boxiang
    Mi, Jian-xun
    ELECTRONICS LETTERS, 2017, 53 (22) : 1473 - 1474
  • [34] Face Recognition via Local Sparse Coding
    Theodorakopoulos, Ilias
    Rigas, Ioannis
    Economou, George
    Fotopoulos, Spiros
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 1647 - 1652
  • [35] Facial Expression Recognition Using Sparse Coding
    Abdolali, Maryam
    Rahmati, Mohammad
    2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 150 - 153
  • [36] Collaborative Sparse Coding for Multiview Action Recognition
    Wang, Wei
    Yan, Yan
    Zhang, Luming
    Hong, Richang
    Sebe, Nicu
    IEEE MULTIMEDIA, 2016, 23 (04) : 80 - 87
  • [37] Facial expression recognition using sparse coding
    Abdolali, Maryam
    Rahmati, Mohammad
    Iranian Conference on Machine Vision and Image Processing, MVIP, 2013, : 150 - 153
  • [38] Decoupling Sparse Coding with Fusion of Fisher Vectors and Scalable SVMs for Large-scale Visual Recognition
    Ji, Zhengping
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 450 - 457
  • [39] A Collaborative Neurodynamic Approach to Sparse Coding
    Che, Hangjun
    Wang, Jun
    Zhang, Wei
    ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT I, 2019, 11554 : 454 - 462
  • [40] FLEXIBLE CODING IN VISUAL WORD RECOGNITION
    PUGH, K
    REXER, K
    KATZ, L
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1992, 30 (06) : 460 - 460