Using Scale-Space Anisotropic Smoothing for Text Line Extraction in Historical Documents

被引:17
|
作者
Cohen, Rafi [1 ]
Dinstein, Itshak [2 ]
El-Sana, Jihad [1 ]
Kedem, Klara [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
[2] Ben Gurion Univ Negev, Dept Elect & Comp Engn, Beer Sheva, Israel
关键词
Historical document processing; Text lines extraction;
D O I
10.1007/978-3-319-11758-4_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text line extraction is vital pre-requisite for various document processing tasks. This paper presents a novel approach for text line extraction which is based on Gaussian scale space and dedicated binarization that utilize the inherent structure of smoothed text document images. It enhances the text lines in the image using multi-scale anisotropic second derivative of Gaussian filter bank at the average height of the text line. It then applies a binarization, which is based on component-tree and is tailored towards line extraction. The final stage of the algorithm is based on an energy minimization framework for removing spurious text line and assigning connected components to lines. We have tested our approach on various datasets written in different languages at range of image quality and received high detection rates, which outperform state-of-the-art algorithms. Our MATLAB code is publicly available. (http://www.cs.bgu.ac.il/similar to rafico/LineExtraction.zip)
引用
收藏
页码:349 / 358
页数:10
相关论文
共 50 条
  • [21] Text line extraction in graphical documents using background and foreground information
    Partha Pratim Roy
    Umapada Pal
    Josep Lladós
    International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 227 - 241
  • [22] Text line extraction in graphical documents using background and foreground information
    Pratim Roy, Partha
    Pal, Umapada
    Llados, Josep
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (03) : 227 - 241
  • [23] Repeatedly smoothing, discrete scale-space evolution and dominant point detection
    Li, BC
    PATTERN RECOGNITION, 1996, 29 (06) : 1049 - 1059
  • [24] Scale-space filter for smoothing electronic speckle pattern interferometry fringes
    Davila, A
    Kaufmann, GH
    Kerr, D
    OPTICAL ENGINEERING, 1996, 35 (12) : 3549 - 3554
  • [25] Text Line Extraction using DMLP Classifiers for Historical Manuscripts
    Baechler, Micheal
    Liwicki, Marcus
    Ingold, Rolf
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1029 - 1033
  • [26] A new feature extraction for iris identification using scale-space filtering technique
    Hong, J
    Yang, WS
    Kim, D
    Kim, YJ
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2004, E87A (12): : 3404 - 3408
  • [27] Scale-space analysis and corner detection on digital curves using a discrete scale-space kernel
    Ray, BK
    Ray, KS
    PATTERN RECOGNITION, 1997, 30 (09) : 1463 - 1474
  • [28] DARTs: Efficient scale-space extraction of DAISY keypoints
    Marimon, David
    Bonnin, Arturo
    Adamek, Tomasz
    Gimeno, Roger
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2416 - 2423
  • [29] Scale-Space Anisotropic Total Variation for Limited Angle Tomography
    Huang, Yixing
    Taubmann, Oliver
    Huang, Xiaolin
    Haase, Viktor
    Lauritsch, Guenter
    Maier, Andreas
    IEEE TRANSACTIONS ON RADIATION AND PLASMA MEDICAL SCIENCES, 2018, 2 (04) : 307 - 314
  • [30] A Thresholding Approach for Text Extraction in Handwritten Historical Documents using Adaptive Morphology
    Roy, Bishakha
    Chatterjee, Rohit Kamal
    2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2014, : 198 - 203