Document image analysis: A primer

被引:0
|
作者
Rangachar Kasturi
Lawrence O’Gorman
Venu Govindaraju
机构
[1] The Pennsylvania State University,Department of Computer Science & Engineering
[2] Avaya Labs,CEDAR
[3] State University of New York at Buffalo,undefined
来源
Sadhana | 2002年 / 27卷
关键词
OCR; feature analysis; document processing; graphics recognition; character recognition; layout analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Document image analysis refers to algorithms and techniques that are applied to images of documents to obtain a computer-readable description from pixel data. A well-known document image analysis product is the Optical Character Recognition (OCR) software that recognizes characters in a scanned document. OCR makes it possible for the user to edit or search the document’s contents. In this paper we briefly describe various components of a document analysis system. Many of these basic building blocks are found in most document analysis systems, irrespective of the particular domain or language to which they are applied. We hope that this paper will help the reader by providing the background necessary to understand the detailed descriptions of specific techniques presented in other papers in this issue.
引用
收藏
页码:3 / 22
页数:19
相关论文
共 50 条
  • [21] Human interactive proofs and document image analysis
    Baird, Henry S.
    Popat, Kris
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2002, 2423 : 507 - 518
  • [22] Adaptive binarization method for document image analysis
    Feng, ML
    Tan, YP
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 339 - 342
  • [23] New segmentation techniques for document image analysis
    Univ of Leeds, Leeds, United Kingdom
    Image Vision Comput, 7 (573-583):
  • [24] XML data representation in document image analysis
    Belaid, Abdel
    Falk, Ingrid
    Rangoni, Yves
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 78 - +
  • [25] A model guided document image analysis scheme
    Harit, G
    Chaudhury, S
    Gupta, P
    Vohra, N
    Joshi, SD
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 1137 - 1141
  • [26] Old document image analysis: a texture approach
    Journet, Nicholas
    Ramel, Jean-Yves
    Eglin, Veronique
    Mullot, Remy
    TRAITEMENT DU SIGNAL, 2007, 24 (06) : 461 - 479
  • [27] Digital geometric methods in document image analysis
    Gross, A
    Latecki, LJ
    PATTERN RECOGNITION, 1999, 32 (03) : 407 - 424
  • [28] A fast multifunctional approach for document image analysis
    Gattani, A
    Mukerji, M
    Gur, H
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1178 - 1182
  • [29] Adaptive thresholding technique for document image analysis
    Bin Rais, N
    Hanif, MS
    Taj, IA
    INMIC 2004: 8th International Multitopic Conference, Proceedings, 2004, : 61 - 66
  • [30] Parallel preprocessing algorithms for document image analysis
    Rao, PS
    Srinivas, C
    Chand, BH
    Agarwal, A
    JOURNAL OF THE INSTITUTION OF ELECTRONICS AND TELECOMMUNICATION ENGINEERS, 1996, 42 (03): : 165 - 173