Degraded Document Image Binarization using Novel Background Estimation Technique

被引:0
|
作者
Jindal, Harshit [1 ]
Kumar, Manoj [1 ]
Tomar, Akhil [1 ]
Malik, Ayush [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci Engn, New Delhi, India
关键词
Document Image Processing; Degraded Document Image Binarization; Thresholding; Background estimation; Noise Removal; Otsu Thresholding; Bilateral Filtering;
D O I
10.1109/I2CT51068.2021.9418084
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Over the past few decades, the use of scanned historical document images has increased dramatically, especially with the emergence of online libraries and standard benchmark datasets like DIBCO. The historical documents are usually in very-poor conditions containing noises like large ink stains, bleed-through, liquid spills, uneven-background, spots, faded-ink, weak/thin text that makes the task of binarization very difficult. In this paper, we propose an effective degraded document image binarization algorithm that performs accurate text segmentation. Our method first estimates the background utilizing information from neighboring pixels and filter smoothening. The next step is background subtraction that helps in the compensation of background distortions. The document is segmented using Otsu thresholding, and then we process the image to remove the remaining noise and maximize text content using labelled connected components. Our method outperforms several existing and widely used binarization algorithms on F-measure, PSNR, DRD, and pseudo F-measure when evaluated on H-DIBCO 2016 and H-DIBCO 2018 datasets and can very effectively detect faint characters from a document image.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] A novel hybrid binarization system for degraded document images
    Li, Yuanfeng
    Xi, Yan
    Chen, Jinshu
    ADVANCING SCIENCE THROUGH COMPUTATION, 2008, : 276 - 278
  • [22] Novel Adaptive Binarization Method for Degraded Document Images
    Abdullah, Siti Norul Huda Sheikh
    Ismail, Saad M.
    Hasan, Mohammad Kamrul
    Shivakumara, Palaiahnakote
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (03): : 3815 - 3832
  • [23] Adaptive Binarization for Degraded Document Image via Contrast Enhancement
    Hu, Xueting
    Wu, Shiqian
    Xu, Wangming
    PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, : 2363 - 2367
  • [24] Ground-truth estimation in multispectral representation space: application to degraded document image binarization
    Hedjam, Rachid
    Cheriet, Mohamed
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 190 - 194
  • [25] MAP-MRF APPROACH FOR BINARIZATION OF DEGRADED DOCUMENT IMAGE
    Kuk, Jung Gap
    Cho, Nam Ik
    Lee, Kyoung Mu
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 2612 - 2615
  • [26] Restoration of Degraded Historical Document Image: An Adaptive Multi layer-Information Binarization Technique
    Khancasikam, Krisda
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (05) : 1321 - 1338
  • [27] Adaptive Thresholding to Robust Image Binarization for Degraded Document Images
    Ingle, Prashant Devidas
    Kaur, Parminder
    2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 189 - 193
  • [28] Ancient degraded document image binarization based on texture features
    Sehad, Abdenour
    Chibani, Youcef
    Cheriet, Mohamed
    Yaddaden, Yacine
    2013 8TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA), 2013, : 189 - +
  • [29] A Novel Approach for Document Image Binarization
    Vishnupriya, S.
    Saranya, P.
    Elangovan, E.
    ICACCS 2015 PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS, 2015,
  • [30] Robust Binarization of Degraded Document Images Using Heuristics
    Parker, Jon
    Frieder, Ophir
    Frieder, Gideon
    DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021