Robust Binarization of Degraded Document Images Using Heuristics

被引:0
|
作者
Parker, Jon [1 ]
Frieder, Ophir [1 ]
Frieder, Gideon [1 ]
机构
[1] Georgetown Univ, Dept Comp Sci, Washington, DC 20057 USA
来源
关键词
readability enhancement; historic document processing; document degradation;
D O I
10.1117/12.2042581
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Historically significant documents are often discovered with defects that make them difficult to read and analyze. This fact is particularly troublesome if the defects prevent software from performing an automated analysis. Image enhancement methods are used to remove or minimize document defects, improve software performance, and generally make images more legible. We describe an automated, image enhancement method that is input page independent and requires no training data. The approach applies to color or greyscale images with hand written script, typewritten text, images, and mixtures thereof. We evaluated the image enhancement method against the test images provided by the 2011 Document Image Binarization Contest (DIBCO). Our method outperforms all 2011 DIBCO entrants in terms of average F1 measure - doing so with a significantly lower variance than top contest entrants. The capability of the proposed method is also illustrated using select images from a collection of historic documents stored at Yad Vashem Holocaust Memorial in Israel.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Binarization of Degraded Document Images Using Convolutional Neural Networks and Wavelet-Based Multichannel Images
    Akbari, Younes
    Al-Maadeed, Somaya
    Adam, Kalthoum
    IEEE ACCESS, 2020, 8 (08): : 153517 - 153534
  • [32] A new efficient binarization method: application to degraded historical document images
    Zineb Hadjadj
    Mohamed Cheriet
    Abdelkrim Meziane
    Yazid Cherfa
    Signal, Image and Video Processing, 2017, 11 : 1155 - 1162
  • [33] Binarization of degraded document images based on hierarchical deep supervised network
    Vo, Quang Nhat
    Kim, Soo Hyung
    Yang, Hyung Jeong
    Lee, Gueesang
    PATTERN RECOGNITION, 2018, 74 : 568 - 586
  • [34] Nonlinear diffusion system for simultaneous restoration and binarization of degraded document images
    Du, Zhongjie
    He, Chuanjiang
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2024, 153 : 237 - 248
  • [35] A new efficient binarization method: application to degraded historical document images
    Hadjadj, Zineb
    Cheriet, Mohamed
    Meziane, Abdelkrim
    Cherfa, Yazid
    SIGNAL IMAGE AND VIDEO PROCESSING, 2017, 11 (06) : 1155 - 1162
  • [36] Nonlinear diffusion equation with selective source for binarization of degraded document images
    Du, Zhongjie
    He, Chuanjiang
    APPLIED MATHEMATICAL MODELLING, 2021, 99 : 243 - 259
  • [37] A multi-scale framework for adaptive binarization of degraded document images
    Moghaddam, Reza Farrahi
    Cheriet, Mohamed
    PATTERN RECOGNITION, 2010, 43 (06) : 2186 - 2198
  • [38] GiB: A Game Theory Inspired Binarization Technique for Degraded Document Images
    Bhowmik, Showmik
    Sarkar, Ram
    Das, Bishwadeep
    Doermann, David
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (03) : 1443 - 1455
  • [39] Adaptive degraded document image binarization
    Gatos, B
    Pratikakis, I
    Perantonis, SJ
    PATTERN RECOGNITION, 2006, 39 (03) : 317 - 327
  • [40] Degraded document image binarization using structural symmetry of strokes
    Jia, Fuxi
    Shi, Cunzhao
    He, Kun
    Wang, Chunheng
    Xiao, Baihua
    PATTERN RECOGNITION, 2018, 74 : 225 - 240