Using Scale-Space Anisotropic Smoothing for Text Line Extraction in Historical Documents

被引:17
|
作者
Cohen, Rafi [1 ]
Dinstein, Itshak [2 ]
El-Sana, Jihad [1 ]
Kedem, Klara [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
[2] Ben Gurion Univ Negev, Dept Elect & Comp Engn, Beer Sheva, Israel
关键词
Historical document processing; Text lines extraction;
D O I
10.1007/978-3-319-11758-4_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text line extraction is vital pre-requisite for various document processing tasks. This paper presents a novel approach for text line extraction which is based on Gaussian scale space and dedicated binarization that utilize the inherent structure of smoothed text document images. It enhances the text lines in the image using multi-scale anisotropic second derivative of Gaussian filter bank at the average height of the text line. It then applies a binarization, which is based on component-tree and is tailored towards line extraction. The final stage of the algorithm is based on an energy minimization framework for removing spurious text line and assigning connected components to lines. We have tested our approach on various datasets written in different languages at range of image quality and received high detection rates, which outperform state-of-the-art algorithms. Our MATLAB code is publicly available. (http://www.cs.bgu.ac.il/similar to rafico/LineExtraction.zip)
引用
收藏
页码:349 / 358
页数:10
相关论文
共 50 条
  • [1] Text Line Extraction in Handwritten Historical Documents
    Capobianco, Samuele
    Marinai, Simone
    DIGITAL LIBRARIES AND ARCHIVES, IRCDL 2017, 2017, 733 : 68 - 79
  • [2] Text Line Extraction in Historical Documents Using Mask R-CNN
    Droby, Ahmad
    Barakat, Berat Kurar
    Alaasam, Reem
    Madi, Boraq
    Rabaev, Irina
    El-Sana, Jihad
    SIGNALS, 2022, 3 (03): : 535 - 549
  • [3] Skew Correction and Text Line Extraction of Arabic Historical Documents
    Zoizon, Abdelhay
    Zarghili, Ars Alane
    Chaker, Ilham
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 181 - 193
  • [4] Feature point extraction using scale-space representation
    Abdeljaoued, Y
    Ebrahimi, T
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 3053 - 3056
  • [5] SCALE-SPACE AND EDGE-DETECTION USING ANISOTROPIC DIFFUSION
    PERONA, P
    MALIK, J
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (07) : 629 - 639
  • [6] A Combined System for Text Line Extraction and Handwriting Recognition in Historical Documents
    Fischer, Andreas
    Baechler, Micheal
    Garz, Angelika
    Liwicki, Marcus
    Ingold, Rolf
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 71 - 75
  • [7] Stochastic analysis of image acquisition and scale-space smoothing
    Astrom, K
    Heyden, A
    GAUSSIAN SCALE-SPACE THEORY, 1997, 8 : 129 - 136
  • [8] Generic scale-space process for handwriting documents analysis
    Joutel, Guillaume
    Eglin, Veronique
    Emptoz, Hubert
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2698 - 2701
  • [9] Stochastic analysis of image acquisition, interpolation and scale-space smoothing
    Åström, K
    Heyden, A
    ADVANCES IN APPLIED PROBABILITY, 1999, 31 (04) : 855 - 894
  • [10] Scale-space feature extraction on digital surfaces
    Levallois, Jeremy
    Coeurjolly, David
    Lachaud, Jacques-Olivier
    COMPUTERS & GRAPHICS-UK, 2015, 51 : 177 - 189