Document image analysis: A primer

被引:63
|
作者
Kasturi, R [1 ]
O'Gorman, L [1 ]
Govindaraju, V [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
来源
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES | 2002年 / 27卷 / 1期
关键词
OCR; feature analysis; document processing; graphics recognition; character recognition; layout analysis;
D O I
10.1007/BF02703309
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Document image analysis refers to algorithms and techniques that are applied to images of documents to obtain a computer-readable description from pixel data. A well-known document image analysis product is the Optical Character Recognition (OCR) software that recognizes characters in a scanned document. OCR makes it possible for the user to edit or search the document's contents. In this paper we briefly describe various components of a document analysis system. Many of these basic building blocks are found in most document analysis systems, irrespective of the particular domain or language to which they are applied. We hope that this paper will help the reader by providing the background necessary to understand the detailed descriptions of specific techniques presented in other papers in this issue.
引用
收藏
页码:3 / 22
页数:20
相关论文
共 50 条
  • [31] Parallel preprocessing algorithms for document image analysis
    Rao, P.S.
    Srinivas, C.
    Chand, B.Hem
    Agarwal, Arun
    1996, IETE, New Delhi, India (42)
  • [32] A statistical learning approach to document image analysis
    Laven, K
    Leishman, S
    Roweis, S
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 357 - 361
  • [33] Human interactive proofs and document image analysis
    Baird, HS
    Popat, K
    DOCUMENT ANALYSIS SYSTEM V, PROCEEDINGS, 2002, 2423 : 507 - +
  • [34] Twenty years of document image analysis in PAMI
    Nagy, G
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (01) : 38 - 62
  • [35] Special issue on document image analysis and recognition
    Amin, A
    PATTERN ANALYSIS AND APPLICATIONS, 2000, 3 (02) : 77 - 77
  • [36] Topologically invariant methods in document image analysis
    Gross, A
    Latecki, L
    VISION GEOMETRY VI, 1997, 3168 : 61 - 68
  • [37] A document image analysis system on parallel processors
    Sural, S
    Das, PK
    FOURTH INTERNATIONAL CONFERENCE ON HIGH-PERFORMANCE COMPUTING, PROCEEDINGS, 1997, : 527 - 532
  • [38] Formal Performance Evaluation for Document Image Analysis
    Lamiroy, Bart
    5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 2 - 2
  • [39] Threats to Image: A Primer
    Waldrep, G. C.
    STAND, 2022, 20 (03): : 14 - 14
  • [40] Human-Document Interaction systems - a new frontier for document image analysis
    Karatzas, Dimosthenis
    Poulain d'Andecy, Vincent
    Rusinol, Marcal
    Chica, Antonio
    Vazquez, Pere-Pau
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 369 - 374