ROBUST POSITION, SCALE, AND ROTATION INVARIANT OBJECT RECOGNITION USING HIGHER-ORDER NEURAL NETWORKS

被引：32

作者：

SPIRKOVSKA, L

REID, MB

机构：

[1] NASA Ames Research Center, Moffett Field, CA 94035-1000

来源：

PATTERN RECOGNITION | 1992年 / 25卷 / 09期

关键词：

NEURAL NETWORKS; HIGHER-ORDER; WHITE NOISE; GAUSSIAN NOISE; OCCLUSION; OBJECT RECOGNITION; INVARIANT CLASSIFICATION; COARSE-CODING;

D O I：

10.1016/0031-3203(92)90062-N

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For object recognition invariant to changes in the object's position, size, and in-plane rotation, higher-order neural networks (HONNs) have numerous advantages over other neural network approaches. Because distortion invariance can be built into the architecture of the network, HONNs need to be trained on just one view of each object, not numerous distorted views, reducing the training time significantly. Further, 100% accuracy can be guaranteed for noise-free test images characterized by the built-in distortions. Specifically, a third-order neural network trained on just one view of an SR-71 aircraft and a U-2 aircraft in a 127 x 127 pixel input field successfully recognized all views of both aircraft larger than 70% of the original size, regardless of orientation or position of the test image. Training required just six passes. In contrast, other neural network approaches require thousands of passes through a training set consisting of a much larger number of training images and typically achieve only 80-90% accuracy on novel views of the objects. The above results assume a noise-free environment. The performance of HONNs is explored with non-ideal test images characterized by white Gaussian noise or partial occlusion. With white noise added to images with an ideal separation of background vs. foreground gray levels, it is shown that HONNs achieve 100% recognition accuracy for the test set for a standard deviation up to approximately 10% of the maximum gray value and continue to show good performance (defined as better than 75% accuracy) up to a standard deviation of approximately 14%. HONNs are also robust with respect to partial occlusion. For the test set of training images with very similar profiles, HONNs achieve 100% recognition accuracy for one occlusion of approximately 13% of the input field size and four occlusions of approximately 70% of the input field size. They show good performance for one occlusion of approximately 23% of the input field size or four occlusions of approximately 15% of the input field size each. For training images with very different profiles, HONNs achieve 100% recognition accuracy for the test set for up to four occlusions of approximately 2% of the input field size and continue to show good performance for up to four occlusions of approximately 23% of the input field size each.

引用

页码：975 / 985

页数：11

共 50 条

[41] DIGITAL SYSTEM INVARIANT TO POSITION AND ROTATION TO OBJECT RECOGNITION IN IMAGES BY INTENSITY PROFILES
Solorza, S.
Alvarez-Borrego, J.
REVISTA CUBANA DE FISICA, 2014, 31 (01): : 18 - 19
[42] Multiple Foreground Recognition and Cosegmentation: An Object-Oriented CRF Model with Robust Higher-Order Potentials
Zhu, Hongyuan
Lu, Jiangbo
Cai, Jianfei
Zheng, Jianming
Thalmann, Nadia M.
2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 485 - 492
[43] Rotation invariant IR object recognition using adaptive kernel subspace projections with a neural network
Smart, MHW
BIOLOGICAL AND ARTIFICIAL COMPUTATION: FROM NEUROSCIENCE TO TECHNOLOGY, 1997, 1240 : 1028 - 1037
[44] Translation, rotation and scale invariant pattern recognition using spectral analysis and hybrid genetic-neural-fuzzy networks
Lee, SK
Jang, DS
COMPUTERS & INDUSTRIAL ENGINEERING, 1996, 30 (03) : 511 - 522
[45] LEARNING AND ENCODING HIGHER-ORDER RULES IN NEURAL NETWORKS
LEVINE, DS
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1995, 27 (02): : 178 - 182
[46] Graph Neural Network for Higher-Order Dependency Networks
Jin, Di
Gong, Yingli
Wang, Zhiqiang
Yu, Zhizhi
He, Dongxiao
Huang, Yuxiao
Wang, Wenjun
PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1622 - 1630
[47] Higher-order Clustering and Pooling for Graph Neural Networks
Duval, Alexandre
Malliaros, Fragkiskos
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 426 - 435
[48] Neural Predicting Higher-order Patterns in Temporal Networks
Liu, Yunyu
Ma, Jianzhu
Li, Pan
PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1340 - 1351
[49] Theory and development of higher-order CMAC neural networks
Lane, Stephen H.
Handelman, David A.
Gelfand, Jack J.
IEEE Control Systems Magazine, 1992, 12 (02): : 23 - 30
[50] Position-invariant, rotation-invariant, and scale-invariant process for binary image recognition
Levkovitz, J
Oron, E
Tur, M
APPLIED OPTICS, 1997, 36 (14): : 3035 - 3042

← 1 2 3 4 5 →