Document image analysis: A primer

被引:63
|
作者
Kasturi, R [1 ]
O'Gorman, L [1 ]
Govindaraju, V [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
来源
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES | 2002年 / 27卷 / 1期
关键词
OCR; feature analysis; document processing; graphics recognition; character recognition; layout analysis;
D O I
10.1007/BF02703309
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Document image analysis refers to algorithms and techniques that are applied to images of documents to obtain a computer-readable description from pixel data. A well-known document image analysis product is the Optical Character Recognition (OCR) software that recognizes characters in a scanned document. OCR makes it possible for the user to edit or search the document's contents. In this paper we briefly describe various components of a document analysis system. Many of these basic building blocks are found in most document analysis systems, irrespective of the particular domain or language to which they are applied. We hope that this paper will help the reader by providing the background necessary to understand the detailed descriptions of specific techniques presented in other papers in this issue.
引用
收藏
页码:3 / 22
页数:20
相关论文
共 50 条
  • [11] Desktop document management by image analysis
    Nakagawa, T
    Kawashima, T
    Aoki, Y
    ADVANCED RESEARCH IN COMPUTERS AND COMMUNICATIONS IN EDUCATION, VOL 1: NEW HUMAN ABILITIES FOR THE NETWORKED SOCIETY, 1999, 55 : 179 - 182
  • [12] The ISL Document Image Analysis Toolbox
    Rogers, R
    Liang, JS
    Haralick, RM
    Phillips, IT
    WORKSHOP ON DOCUMENT IMAGE ANALYSIS (DIA'97), PROCEEDINGS: IN COOPERATION WITH CVPR '97, 1997, : 18 - 25
  • [13] Fuzzy segmentation for document image analysis
    Chan, KCC
    Huang, XD
    Bao, P
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 977 - 982
  • [14] HANDWRITTEN DOCUMENT IMAGE SEGMENTATION AND ANALYSIS
    SHAPIRO, V
    GLUHCHEV, G
    SGUREV, V
    PATTERN RECOGNITION LETTERS, 1993, 14 (01) : 71 - 78
  • [15] Digital libraries and document image analysis
    Baird, HS
    IS&T'S 2004 ARCHIVING CONFERENCE, PROCEEDINGS, 2004, : 286 - 288
  • [16] Textured reductions for document image analysis
    Bloomberg, DS
    DOCUMENT RECOGNITION III, 1996, 2660 : 160 - 174
  • [17] Digital libraries and document image analysis
    Baird, HS
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 2 - 14
  • [18] Watershed Based Document Image Analysis
    Shadkami, Pasha
    Bonnier, Nicolas
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PT I, 2010, 6474 : 114 - 124
  • [19] Features for printed document image analysis
    Duong, J
    Emptoz, H
    Côté, M
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 245 - 248
  • [20] Comparative Semantic Document Layout Analysis for Enhanced Document Image Retrieval
    Jaha, Emad Sami
    IEEE ACCESS, 2024, 12 : 150451 - 150467