Degraded document image binarization using structural symmetry of strokes

被引:64
|
作者
Jia, Fuxi [1 ]
Shi, Cunzhao [1 ]
He, Kun [1 ]
Wang, Chunheng [1 ]
Xiao, Baihua [1 ]
机构
[1] Univ Chinese Acad Sci, Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Document image binarization; Structural symmetry of strokes; Local threshold; Stroke width estimation;
D O I
10.1016/j.patcog.2017.09.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an effective approach for the local threshold binarization of degraded document images. We utilize the structural symmetric pixels (SSPs) to calculate the local threshold in neighborhood and the voting result of multiple thresholds will determine whether one pixel belongs to the foreground or not. The SSPs are defined as the pixels around strokes whose gradient magnitudes are large enough and orientations are symmetric opposite. The compensated gradient map is used to extract the SSP so as to weaken the influence of document degradations. To extract SSP candidates with large magnitudes and distinguish the faint characters and bleed-through background, we propose an adaptive global threshold selection algorithm. To further extract pixels with opposite orientations, an iterative stroke width estimation algorithm is applied to ensure the proper size of neighborhood used in orientation judgement. At last, we present a multiple threshold vote based framework to deal with some inaccurate detections of SSP. The experimental results on seven public document image binarization datasets show that our method is accurate and robust compared with many traditional and state-of-the-art document binarization approaches based on multiple evaluation measures. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:225 / 240
页数:16
相关论文
共 50 条
  • [41] Automatic Document Image Binarization using Bayesian Optimization
    Vats, Ekta
    Hast, Anders
    Singh, Prashant
    PROCEEDINGS OF THE 4TH INTERNATIONAL WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING, HIP 2017, 2017, : 89 - 94
  • [42] Historical document image binarization
    Mello, Carlos A. B.
    Oliveira, Adriano L. I.
    Sanchez, Angel
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2008, : 108 - 113
  • [43] Adaptive document image binarization
    Sauvola, J
    Pietikäinen, M
    PATTERN RECOGNITION, 2000, 33 (02) : 225 - 236
  • [44] Binarization of Document Image Using Optimum Threshold Modification
    Mustafa, Wan Azani
    Kader, Mohamed Mydin M. Abdul
    1ST INTERNATIONAL CONFERENCE ON GREEN AND SUSTAINABLE COMPUTING (ICOGES) 2017, 2018, 1019
  • [45] Document Image Binarization Using Recurrent Neural Networks
    Westphal, Florian
    Lavesson, Niklas
    Grahn, Hakan
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 263 - 268
  • [46] Document image binarization using difference of concatenated convolutions
    Jyothi, R. L.
    Rahiman, M. Abdul
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (02) : 2939 - 2952
  • [47] Binarization of document images using image dependent model
    Dawoud, A
    Kamel, M
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 49 - 53
  • [48] Binarization of degraded document images based on contrast enhancement
    Di Lu
    Xin Huang
    LiXue Sui
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 123 - 135
  • [49] Binarization of degraded document images based on contrast enhancement
    Lu, Di
    Huang, Xin
    Sui, LiXue
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (1-2) : 123 - 135
  • [50] Efficient binarization technique for severely degraded document images
    Brij Mohan Singh
    CSI Transactions on ICT, 2014, 2 (3) : 153 - 161