Sparse Spatial Coding: A Novel Approach to Visual Recognition

被引:10
|
作者
Oliveira, Gabriel Leivas [1 ]
Nascimento, Erickson R. [2 ]
Vieira, Antonio Wilson [3 ]
Montenegro Campos, Mario Fernando [2 ]
机构
[1] Univ Minnesota, Dept Comp Sci, Minneapolis, MN 55455 USA
[2] Univ Fed Minas Gerais, Dept Comp Sci, BR-31270901 Belo Horizonte, MG, Brazil
[3] Univ Estadual Montes Claros, Dept Math & Comp Sci, BR-39440 Montes Claros, Brazil
关键词
Object recognition; image coding; learning (artificial intelligence); computer vision; vision and scene undertanding; sparse coding; IMAGE; REPRESENTATIONS; EFFICIENT;
D O I
10.1109/TIP.2014.2317988
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Successful image-based object recognition techniques have been constructed founded on powerful techniques such as sparse representation, in lieu of the popular vector quantization approach. However, one serious drawback of sparse space-based methods is that local features that are quite similar can be quantized into quite distinct visual words. We address this problem with a novel approach for object recognition, called sparse spatial coding, which efficiently combines a sparse coding dictionary learning and spatial constraint coding stage. We performed experimental evaluation using the Caltech 101, Caltech 256, Corel 5000, and Corel 10000 data sets, which were specifically designed for object recognition evaluation. Our results show that our approach achieves high accuracy comparable with the best single feature method previously published on those databases. Our method outperformed, for the same bases, several multiple feature methods, and provided equivalent, and in few cases, slightly less accurate results than other techniques specifically designed to that end. Finally, we report state-of-the-art results for scene recognition on COsy Localization Dataset (COLD) and high performance results on the MIT-67 indoor scene recognition, thus demonstrating the generalization of our approach for such tasks.
引用
收藏
页码:2719 / 2731
页数:13
相关论文
共 50 条
  • [21] ACTION RECOGNITION WITH APPROXIMATE SPARSE CODING
    Wang, Yu
    Kato, Lien
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 770 - 774
  • [22] Language Recognition via Sparse Coding
    Gwon, Youngjune L.
    Campbell, William M.
    Sturim, Douglas
    Kung, H. T.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2920 - 2924
  • [23] Statistics on Temporal Changes of Sparse Coding Coefficients in Spatial Pyramids for Human Action Recognition
    Li, Yang
    Ye, Junyong
    Wang, Tongqing
    Huang, Shijian
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (09) : 1711 - 1714
  • [24] Palmprint Recognition via Sparse Coding Spatial Pyramid Matching Representation of SIFT Feature
    Liu, Ligang
    Zhang, Jianxin
    Yang, Aoqi
    BIOMETRIC RECOGNITION, 2016, 9967 : 235 - 243
  • [25] A novel model of primary visual cortex based on biologically plausible sparse coding
    Rego, Jocelyn
    Watkins, Yijing
    Kenyon, Garrett
    Kim, Edward
    Teti, Michael
    APPLICATIONS OF MACHINE LEARNING 2023, 2023, 12675
  • [26] Forming a sparse representation for visual place recognition using a neurorobotic approach
    Colomer, Sylvain
    Cuperlier, Nicolas
    Bresson, Guillaume
    Romain, Olivier
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 3002 - 3009
  • [27] A Novel Speech Emotion Recognition Method via Transfer PCA and Sparse Coding
    Song, Peng
    Zheng, Wenming
    Liu, Jingjing
    Li, Jing
    Zhang, Xinran
    BIOMETRIC RECOGNITION, CCBR 2015, 2015, 9428 : 393 - 400
  • [28] A Multiattribute Sparse Coding Approach for Action Recognition From a Single Unknown Viewpoint
    Su, Te-Feng
    Chiang, Chen-Kuo
    Lai, Shang-Hong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (08) : 1476 - 1489
  • [29] CODING LEVELS IN VISUAL RECOGNITION
    GEISSLER, HG
    ZEITSCHRIFT FUR PSYCHOLOGIE, 1979, 187 (02): : 206 - 214
  • [30] Sparse Coding and Counting for Robust Visual Tracking
    Liu, Risheng
    Wang, Jing
    Shang, Xiaoke
    Wang, Yiyang
    Su, Zhixun
    Cai, Yu
    PLOS ONE, 2016, 11 (12):