Multilevel Context Representation for Improving Object Recognition

被引:3
|
作者
Koelsch, Andreas [1 ,2 ]
Afzal, Muhammad Zeshan [1 ,2 ]
Liwicki, Marcus [1 ,2 ,3 ]
机构
[1] Univ Kaiserslautern, MindGarage, Kaiserslautern, Germany
[2] Insiders Technol GmbH, Kaiserslautern, Germany
[3] Univ Fribourg, Fribourg, Switzerland
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 5 | 2017年
关键词
D O I
10.1109/ICDAR.2017.322
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose the combined usage of low-and high-level blocks of convolutional neural networks (CNNs) for improving object recognition. While recent research focused on either propagating the context from all layers, e.g. ResNet, (including the very low-level layers) or having multiple loss layers (e.g. GoogLeNet), the importance of the features close to the higher layers is ignored. This paper postulates that the use of context closer to the high-level layers provides the scale and translation invariance and works better than using the top layer only. In particular, we extend AlexNet and GoogLeNet by additional connections in the top n layers. In order to demonstrate the effectiveness of the proposed approach, we evaluated it on the standard ImageNet task. The relative reduction of the classification error is around 1 - 2% without affecting the computational cost. Furthermore, we show that this approach is orthogonal to typical test data augmentation techniques, as recently introduced by Szegedy et al. (leading to a runtime reduction of 144 during test time).
引用
收藏
页码:10 / 15
页数:6
相关论文
共 50 条
  • [21] Sparse representation for coarse and fine object recognition
    Pham, TV
    Smeulders, AWM
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (04) : 555 - 567
  • [22] Ketamine changes the neural representation of object recognition
    van Loon, A. M.
    Scholte, H.
    Fahrenfort, J. J.
    van der Velde, B.
    Lirk, P. B.
    Vulink, N. C.
    Hohmann, M. W.
    Lamme, V.
    PERCEPTION, 2013, 42 : 162 - 162
  • [23] The role of color diagnosticity in object recognition and representation
    Therriault, David J.
    Yaxley, Richard H.
    Zwaan, Rolf A.
    COGNITIVE PROCESSING, 2009, 10 (04) : 335 - 342
  • [24] The role of color diagnosticity in object recognition and representation
    David J. Therriault
    Richard H. Yaxley
    Rolf A. Zwaan
    Cognitive Processing, 2009, 10 : 335 - 342
  • [25] Tree representation for image matching and object recognition
    Mattes, J
    Richard, M
    Demongeot, J
    DISCRETE GEOMETRY FOR COMPUTER IMAGERY, 1999, 1568 : 298 - 309
  • [26] Neural representation for object recognition in inferotemporal cortex
    Lehky, Sidney R.
    Tanaka, Keiji
    CURRENT OPINION IN NEUROBIOLOGY, 2016, 37 : 23 - 35
  • [27] COMBINATORIAL OPTIMIZATION FOR IMAGE REPRESENTATION AND OBJECT RECOGNITION
    CHRISTOFIDES, N
    CONSTANTINIDES, AG
    TWENTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2: CONFERENCE RECORD, 1989, : 624 - 627
  • [28] Improving Activity Recognition with Context Information
    Zhang, Licheng
    Wu, Xihong
    Luo, Dingsheng
    2015 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, 2015, : 1241 - 1246
  • [29] Object Recognition Based on Improved Context Model
    WEN Chenglin
    ZHOU Guangfu
    GAO Jingli
    LI Hongwei
    XU Xiaobin
    ChineseJournalofElectronics, 2018, 27 (03) : 573 - 581
  • [30] Context-Dependent Object Proposal and Recognition
    Chang, Ray-, I
    Ting, Chao-Lung
    Wu, Syuan-Yi
    Yin, Peng-Yeng
    SYMMETRY-BASEL, 2020, 12 (10): : 1 - 20