Multilevel Context Representation for Improving Object Recognition

被引:3
|
作者
Koelsch, Andreas [1 ,2 ]
Afzal, Muhammad Zeshan [1 ,2 ]
Liwicki, Marcus [1 ,2 ,3 ]
机构
[1] Univ Kaiserslautern, MindGarage, Kaiserslautern, Germany
[2] Insiders Technol GmbH, Kaiserslautern, Germany
[3] Univ Fribourg, Fribourg, Switzerland
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 5 | 2017年
关键词
D O I
10.1109/ICDAR.2017.322
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose the combined usage of low-and high-level blocks of convolutional neural networks (CNNs) for improving object recognition. While recent research focused on either propagating the context from all layers, e.g. ResNet, (including the very low-level layers) or having multiple loss layers (e.g. GoogLeNet), the importance of the features close to the higher layers is ignored. This paper postulates that the use of context closer to the high-level layers provides the scale and translation invariance and works better than using the top layer only. In particular, we extend AlexNet and GoogLeNet by additional connections in the top n layers. In order to demonstrate the effectiveness of the proposed approach, we evaluated it on the standard ImageNet task. The relative reduction of the classification error is around 1 - 2% without affecting the computational cost. Furthermore, we show that this approach is orthogonal to typical test data augmentation techniques, as recently introduced by Szegedy et al. (leading to a runtime reduction of 144 during test time).
引用
收藏
页码:10 / 15
页数:6
相关论文
共 50 条
  • [41] The representation of shape in the context of visual object categorization tasks
    Op de Beeck, H
    Béatse, E
    Wagemans, J
    Sunaert, S
    Van Hecke, P
    NEUROIMAGE, 2000, 12 (01) : 28 - 40
  • [42] Bone tumor recognition strategy based on object region and context representation in medical decision-making system
    Liu, Yueguang
    Liu, Jun
    Dai, Tingyi
    Gou, Fangfang
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [43] The role of the dorsal dentate gyrus in object and object-context recognition
    Dees, Richard L.
    Kesner, Raymond P.
    NEUROBIOLOGY OF LEARNING AND MEMORY, 2013, 106 : 112 - 117
  • [44] 3D object recognition: Representation and matching
    Anil K. Jain
    Chitra Dorai
    Statistics and Computing, 2000, 10 : 167 - 182
  • [45] Brain Functional Representation of Highly Occluded Object Recognition
    Li, Bao
    Zhang, Chi
    Cao, Long
    Chen, Panpan
    Liu, Tianyuan
    Gao, Hui
    Wang, Linyuan
    Yan, Bin
    Tong, Li
    BRAIN SCIENCES, 2023, 13 (10)
  • [46] 3D object recognition: Representation and matching
    Jain, AK
    Dorai, C
    STATISTICS AND COMPUTING, 2000, 10 (02) : 167 - 182
  • [47] Efficient Object Recognition Method Based on Hierarchical Representation
    Gu, Chao
    Huang, Weiguo
    Tao, Jin
    Shang, Li
    Zhu, Z. K.
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2013, : 358 - 363
  • [48] Space-variant representation for active object recognition
    Exel, S
    Pessoa, L
    SIBGRAPI '98 - INTERNATIONAL SYMPOSIUM ON COMPUTER GRAPHICS, IMAGE PROCESSING, AND VISION, PROCEEDINGS, 1998, : 233 - 240
  • [49] The dominant role of functional action representation in object recognition
    Ni, Long
    Liu, Ye
    Yu, Wenyuan
    EXPERIMENTAL BRAIN RESEARCH, 2019, 237 (02) : 363 - 375
  • [50] Object Recognition Using Sparse Representation of Overcomplete Dictionary
    Loo, Chu-Kiong
    Memariani, Ali
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT IV, 2012, 7666 : 75 - 82