ROBUST POSITION, SCALE, AND ROTATION INVARIANT OBJECT RECOGNITION USING HIGHER-ORDER NEURAL NETWORKS

被引:32
|
作者
SPIRKOVSKA, L
REID, MB
机构
[1] NASA Ames Research Center, Moffett Field, CA 94035-1000
关键词
NEURAL NETWORKS; HIGHER-ORDER; WHITE NOISE; GAUSSIAN NOISE; OCCLUSION; OBJECT RECOGNITION; INVARIANT CLASSIFICATION; COARSE-CODING;
D O I
10.1016/0031-3203(92)90062-N
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For object recognition invariant to changes in the object's position, size, and in-plane rotation, higher-order neural networks (HONNs) have numerous advantages over other neural network approaches. Because distortion invariance can be built into the architecture of the network, HONNs need to be trained on just one view of each object, not numerous distorted views, reducing the training time significantly. Further, 100% accuracy can be guaranteed for noise-free test images characterized by the built-in distortions. Specifically, a third-order neural network trained on just one view of an SR-71 aircraft and a U-2 aircraft in a 127 x 127 pixel input field successfully recognized all views of both aircraft larger than 70% of the original size, regardless of orientation or position of the test image. Training required just six passes. In contrast, other neural network approaches require thousands of passes through a training set consisting of a much larger number of training images and typically achieve only 80-90% accuracy on novel views of the objects. The above results assume a noise-free environment. The performance of HONNs is explored with non-ideal test images characterized by white Gaussian noise or partial occlusion. With white noise added to images with an ideal separation of background vs. foreground gray levels, it is shown that HONNs achieve 100% recognition accuracy for the test set for a standard deviation up to approximately 10% of the maximum gray value and continue to show good performance (defined as better than 75% accuracy) up to a standard deviation of approximately 14%. HONNs are also robust with respect to partial occlusion. For the test set of training images with very similar profiles, HONNs achieve 100% recognition accuracy for one occlusion of approximately 13% of the input field size and four occlusions of approximately 70% of the input field size. They show good performance for one occlusion of approximately 23% of the input field size or four occlusions of approximately 15% of the input field size each. For training images with very different profiles, HONNs achieve 100% recognition accuracy for the test set for up to four occlusions of approximately 2% of the input field size and continue to show good performance for up to four occlusions of approximately 23% of the input field size each.
引用
收藏
页码:975 / 985
页数:11
相关论文
共 50 条
  • [41] DIGITAL SYSTEM INVARIANT TO POSITION AND ROTATION TO OBJECT RECOGNITION IN IMAGES BY INTENSITY PROFILES
    Solorza, S.
    Alvarez-Borrego, J.
    REVISTA CUBANA DE FISICA, 2014, 31 (01): : 18 - 19
  • [42] Multiple Foreground Recognition and Cosegmentation: An Object-Oriented CRF Model with Robust Higher-Order Potentials
    Zhu, Hongyuan
    Lu, Jiangbo
    Cai, Jianfei
    Zheng, Jianming
    Thalmann, Nadia M.
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 485 - 492
  • [43] Rotation invariant IR object recognition using adaptive kernel subspace projections with a neural network
    Smart, MHW
    BIOLOGICAL AND ARTIFICIAL COMPUTATION: FROM NEUROSCIENCE TO TECHNOLOGY, 1997, 1240 : 1028 - 1037
  • [44] Translation, rotation and scale invariant pattern recognition using spectral analysis and hybrid genetic-neural-fuzzy networks
    Lee, SK
    Jang, DS
    COMPUTERS & INDUSTRIAL ENGINEERING, 1996, 30 (03) : 511 - 522
  • [45] LEARNING AND ENCODING HIGHER-ORDER RULES IN NEURAL NETWORKS
    LEVINE, DS
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1995, 27 (02): : 178 - 182
  • [46] Graph Neural Network for Higher-Order Dependency Networks
    Jin, Di
    Gong, Yingli
    Wang, Zhiqiang
    Yu, Zhizhi
    He, Dongxiao
    Huang, Yuxiao
    Wang, Wenjun
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1622 - 1630
  • [47] Higher-order Clustering and Pooling for Graph Neural Networks
    Duval, Alexandre
    Malliaros, Fragkiskos
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 426 - 435
  • [48] Neural Predicting Higher-order Patterns in Temporal Networks
    Liu, Yunyu
    Ma, Jianzhu
    Li, Pan
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1340 - 1351
  • [49] Theory and development of higher-order CMAC neural networks
    Lane, Stephen H.
    Handelman, David A.
    Gelfand, Jack J.
    IEEE Control Systems Magazine, 1992, 12 (02): : 23 - 30
  • [50] Position-invariant, rotation-invariant, and scale-invariant process for binary image recognition
    Levkovitz, J
    Oron, E
    Tur, M
    APPLIED OPTICS, 1997, 36 (14): : 3035 - 3042