Revisiting Sparse Convolutional Model for Visual Recognition

被引:0
|
作者
Dai, Xili [1 ]
Li, Mingyang [2 ]
Zhai, Pengyuan [3 ]
Tong, Shengbang [4 ]
Gao, Xingjian [4 ]
Huang, Shao-Lun [2 ]
Zhu, Zhihui [5 ]
You, Chong [4 ]
Ma, Yi [2 ,4 ]
机构
[1] Hong Kong Univ Sci & Technol, Guangzhou, Peoples R China
[2] Tsinghua Univ, TBSI, Shenzhen, Peoples R China
[3] Harvard Univ, Cambridge, MA 02138 USA
[4] Univ Calif Berkeley, Berkeley, CA 94720 USA
[5] Ohio State Univ, Columbus, OH 43210 USA
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite strong empirical performance for image classification, deep neural networks are often regarded as "black boxes" and they are difficult to interpret. On the other hand, sparse convolutional models, which assume that a signal can be expressed by a linear combination of a few elements from a convolutional dictionary, are powerful tools for analyzing natural images with good theoretical interpretability and biological plausibility. However, such principled models have not demonstrated competitive performance when compared with empirically designed deep networks. This paper revisits the sparse convolutional modeling for image classification and bridges the gap between good empirical performance (of deep learning) and good interpretability (of sparse convolutional models). Our method uses differentiable optimization layers that are defined from convolutional sparse coding as drop-in replacements of standard convolutional layers in conventional deep neural networks. We show that such models have equally strong empirical performance on CIFAR-10, CIFAR-100 and ImageNet datasets when compared to conventional neural networks. By leveraging stable recovery property of sparse modeling, we further show that such models can be much more robust to input corruptions as well as adversarial perturbations in testing through a simple proper trade-off between sparse regularization and data reconstruction terms. Source code can be found at https://github.com/Delay- Xili/SDNet.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Fusion that matters: convolutional fusion networks for visual recognition
    Liu, Yu
    Guo, Yanming
    Georgiou, Theodoros
    Lew, Michael S.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29407 - 29434
  • [32] Fusion that matters: convolutional fusion networks for visual recognition
    Yu Liu
    Yanming Guo
    Theodoros Georgiou
    Michael S. Lew
    Multimedia Tools and Applications, 2018, 77 : 29407 - 29434
  • [33] Sparse Embedding Visual Attention Model
    Zhao, Cairong
    Liu, ChuanCai
    Lai, Zhihui
    Sui, Yue
    Li, Zuoyong
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 823 - +
  • [34] Visual tracking using global sparse coding and local convolutional features
    Zeng, Xianyou
    Xu, Long
    Ma, Lin
    Zhao, Ruizhen
    Cen, Yigang
    DIGITAL SIGNAL PROCESSING, 2018, 72 : 115 - 125
  • [35] k3-Sparse Graph Convolutional Networks for Face Recognition
    Wu, Renjie
    Kamata, Sei-ichiro
    2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 174 - 179
  • [36] Compositional Factorization of Visual Scenes with Convolutional Sparse Coding and Resonator Networks
    Kymn, Christopher J.
    Mazelet, Sonia
    Ng, Annabel
    Kleyko, Denis
    Olshausen, Bruno A.
    2024 NEURO INSPIRED COMPUTATIONAL ELEMENTS CONFERENCE, NICE, 2024,
  • [37] Adaptive sparse triple convolutional attention for enhanced visual question answering
    Wang, Ronggui
    Chen, Hong
    Yang, Juan
    Xue, Lixia
    VISUAL COMPUTER, 2025,
  • [38] Convolutional Sparse Feature Descriptor for Object Recognition in CIFAR-10
    Carvalho, Edigleison Francelino
    Engel, Paulo Martins
    2013 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2013, : 131 - 135
  • [39] Convolutional Neural Networks Based on Sparse Coding for Human Postures Recognition
    Yang, Ning
    Li, Yawei
    Yang, Yuliang
    Zhu, Mengyu
    AOPC 2017: OPTICAL SENSING AND IMAGING TECHNOLOGY AND APPLICATIONS, 2017, 10462
  • [40] Efficient Learning of Sparse, Distributed, Convolutional Feature Representations for Object Recognition
    Sohn, Kihyuk
    Jung, Dae Yon
    Lee, Honglak
    Hero, Alfred O., III
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 2643 - 2650