Multilevel Context Representation for Improving Object Recognition

被引：3

作者：

Koelsch, Andreas ^{[1
,2
]}

Afzal, Muhammad Zeshan ^{[1
,2
]}

Liwicki, Marcus ^{[1
,2
,3
]}

机构：

[1] Univ Kaiserslautern, MindGarage, Kaiserslautern, Germany

[2] Insiders Technol GmbH, Kaiserslautern, Germany

[3] Univ Fribourg, Fribourg, Switzerland

来源：

2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 5 | 2017年

关键词：

D O I：

10.1109/ICDAR.2017.322

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we propose the combined usage of low-and high-level blocks of convolutional neural networks (CNNs) for improving object recognition. While recent research focused on either propagating the context from all layers, e.g. ResNet, (including the very low-level layers) or having multiple loss layers (e.g. GoogLeNet), the importance of the features close to the higher layers is ignored. This paper postulates that the use of context closer to the high-level layers provides the scale and translation invariance and works better than using the top layer only. In particular, we extend AlexNet and GoogLeNet by additional connections in the top n layers. In order to demonstrate the effectiveness of the proposed approach, we evaluated it on the standard ImageNet task. The relative reduction of the classification error is around 1 - 2% without affecting the computational cost. Furthermore, we show that this approach is orthogonal to typical test data augmentation techniques, as recently introduced by Szegedy et al. (leading to a runtime reduction of 144 during test time).

引用

页码：10 / 15

页数：6

共 50 条

[1] Geodesic object representation and recognition
Ben Hamza, A
Krim, H
DISCRETE GEOMETRY FOR COMPUTER IMAGERY, PROCEEDINGS, 2003, 2886 : 378 - 387
[2] The role of context in object recognition
Oliva, Aude
Torralba, Antonio
TRENDS IN COGNITIVE SCIENCES, 2007, 11 (12) : 520 - 527
[3] TEMPORAL CONTEXT IN OBJECT RECOGNITION
Chalasani, Rakesh
Principe, Jose C.
2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,
[4] Integrating Spatio-Temporal Context with Multiview Representation for Object Recognition in Visual Surveillance
Liu, Xiaobai
Lin, Liang
Yan, Shuicheng
Jin, Hai
Tao, Wenbing
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (04) : 393 - 407
[5] CANONICAL VIEWS IN OBJECT REPRESENTATION AND RECOGNITION
CUTZU, F
EDELMAN, S
VISION RESEARCH, 1994, 34 (22) : 3037 - 3056
[6] Embodied Object Representation Learning and Recognition
Van de Maele, Toon
Verbelen, Tim
Catal, Ozan
Dhoedt, Bart
FRONTIERS IN NEUROROBOTICS, 2022, 16
[7] Integrated object representation for recognition and grasping
Kefalea, Efthimia
Maeel, Eric
Wuertz, Rolf P.
International Conference on Knowledge-Based Intelligent Electronic Systems, Proceedings, KES, 1999, : 423 - 426
[8] Viewpoint dependency in object representation and recognition
NEC Research Inst, Princeton, United States
Spat Vision, 4 ([d]491-521):
[9] The cognitive neuroscience of object representation and recognition
Vecera, SP
PSYCHOBIOLOGY, 1998, 26 (04) : 279 - 280
[10] Viewpoint dependency in object representation and recognition
Liu, ZL
SPATIAL VISION, 1996, 9 (04): : 491 - 521

← 1 2 3 4 5 →