Document image analysis: A primer

被引:63
|
作者
Kasturi, R [1 ]
O'Gorman, L [1 ]
Govindaraju, V [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
关键词
OCR; feature analysis; document processing; graphics recognition; character recognition; layout analysis;
D O I
10.1007/BF02703309
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Document image analysis refers to algorithms and techniques that are applied to images of documents to obtain a computer-readable description from pixel data. A well-known document image analysis product is the Optical Character Recognition (OCR) software that recognizes characters in a scanned document. OCR makes it possible for the user to edit or search the document's contents. In this paper we briefly describe various components of a document analysis system. Many of these basic building blocks are found in most document analysis systems, irrespective of the particular domain or language to which they are applied. We hope that this paper will help the reader by providing the background necessary to understand the detailed descriptions of specific techniques presented in other papers in this issue.
引用
收藏
页码:3 / 22
页数:20
相关论文
共 50 条
  • [1] Document image analysis: A primer
    Rangachar Kasturi
    Lawrence O’Gorman
    Venu Govindaraju
    Sadhana, 2002, 27 : 3 - 22
  • [2] Document design primer
    Henry, P
    PUBLISHING RESEARCH QUARTERLY, 2004, 19 (04) : 69 - 71
  • [3] Document image analysis - Introduction
    Kamel, M
    Wesolkowski, S
    CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING-REVUE CANADIENNE DE GENIE ELECTRIQUE ET INFORMATIQUE, 1999, 24 (02): : 50 - 50
  • [4] Document design: A brief primer
    Flanders, MG
    SOCIETY FOR TECHNICAL COMMUNICATION 44TH ANNUAL CONFERENCE, 1997 PROCEEDINGS, 1997, : 235 - 238
  • [5] DOCUMENT IMAGE-ANALYSIS SYSTEMS
    OGORMAN, L
    KASTURI, R
    COMPUTER, 1992, 25 (07) : 5 - 8
  • [6] DOCUMENT IMAGE SEGMENTATION AND LAYOUT ANALYSIS
    SAITOH, T
    YAMAAI, T
    TACHIKAWA, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1994, E77D (07) : 778 - 784
  • [7] Resolution-sensitive document image analysis for document repurposing
    Berkner, K
    Schwartz, EL
    CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 102 - 106
  • [8] Document image analysis and recognition: a survey
    Arlazarov, V. V.
    Andreeva, E., I
    Bulatov, K. B.
    Nikolaev, D. P.
    Petrova, O. O.
    Savelev, B., I
    Slavin, O. A.
    COMPUTER OPTICS, 2022, 46 (04) : 567 - 589
  • [9] Script identification of document image analysis
    Cheng, Juan
    Ping, Xijian
    Zhou, Guanwei
    Yang, Yang
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 3, PROCEEDINGS, 2006, : 178 - +
  • [10] Treatment of diagrams in document image analysis
    Blostein, D
    Lank, E
    Zanibbi, R
    THEORY AND APPLICATION OF DIAGRAMS, PROCEEDINGS, 2000, 1889 : 330 - 344