Using generative models for handwritten digit recognition

被引:97
|
作者
Revow, M [1 ]
Williams, CKI [1 ]
Hinton, GE [1 ]
机构
[1] ASTON UNIV, DEPT COMP SCI & APPL MATH, BIRMINGHAM B4 7ET, W MIDLANDS, ENGLAND
关键词
deformable model; elastic net; optical character recognition; generative model; probabilistic model; mixture model;
D O I
10.1109/34.506410
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a method of recognizing handwritten digits by fitting generative models that are built from deformable B-splines with Gaussian ''ink generators'' spaced along the length of the spline. The splines are adjusted using a novel elastic matching procedure based on the Expectation Maximization (EM) algorithm that maximizes the likelihood of the model generating the data. This approach has many advantages. 1) After identifying the model most likely to have generated the data, the system not only produces a classification of the digit but also a rich description of the instantiation parameters which can yield information such as the writing style. 2) During the process of explaining the image, generative models can perform recognition driven segmentation. 3) The method involves a relatively small number or parameters and hence training is relatively easy and fast. 4) Unlike many other recognition schemes, if does not rely on some form of pre-normalization of input images, but can handle arbitrary scalings, translations and a limited degree of image rotation. We have demonstrated our method of fitting models to images does not get trapped in poor local minima. The main disadvantage of the method is it requires much more computation than more standard OCR techniques.
引用
收藏
页码:592 / 606
页数:15
相关论文
共 50 条
  • [21] Persian handwritten digit recognition using ensemble classifiers
    Karimi, Hossein
    Esfahanimehr, Azadeh
    Mosleh, Mohammad
    Ghadam, Faraz Mohammadian Jadval
    Salehpour, Simintaj
    Medhati, Omid
    INTERNATIONAL CONFERENCE ON ADVANCED WIRELESS INFORMATION AND COMMUNICATION TECHNOLOGIES (AWICT 2015), 2015, 73 : 416 - 425
  • [22] A modular classification scheme with elastic net models for handwritten digit recognition
    Zhang, BL
    Fu, MY
    Yan, H
    FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1859 - 1861
  • [23] HANDWRITTEN BANGLA DIGIT RECOGNITION USING CHEMICAL REACTION OPTIMIZATION
    Boni, Pritam Khan
    Abir, Bappy Shahriar
    Hasan, H. M. Mehedi
    Islam, Md. Rafiqul
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [24] Unconstrained handwritten digit recognition using perceptual shape primitives
    Kalyan S. Dash
    Niladri B. Puhan
    Ganapati Panda
    Pattern Analysis and Applications, 2018, 21 : 413 - 436
  • [25] Handwritten Digit String Recognition using Convolutional Neural Network
    Zhan, Hongjian
    Lyu, Shujing
    Lu, Yue
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3729 - 3734
  • [26] Handwritten English Character and Digit Recognition
    Al-Mahmud
    Tanvin, Asnuva
    Rahman, Sazia
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND INFORMATION TECHNOLOGY 2021 (ICECIT 2021), 2021,
  • [27] Rosenblatt Perceptrons for handwritten digit recognition
    Ernst, K
    Tatyana, B
    Lora, K
    Vladimir, L
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1516 - 1520
  • [28] A Novel Technique for Handwritten Digit Recognition Using Deep Learning
    Ahmed, Syed Sohail
    Mehmood, Zahid
    Awan, Imran Ahmad
    Yousaf, Rehan Mehmood
    JOURNAL OF SENSORS, 2023, 2023
  • [29] Isolated Handwritten Digit Recognition Using oBIFs and Background Features
    Gattal, Abdeljalil
    Djeddi, Chawki
    Chibani, Youcef
    Siddiqi, Imran
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 305 - 310
  • [30] Unconstrained handwritten digit recognition using perceptual shape primitives
    Dash, Kalyan S.
    Puhan, Niladri B.
    Panda, Ganapati
    PATTERN ANALYSIS AND APPLICATIONS, 2018, 21 (02) : 413 - 436