Ground-truth estimation in multispectral representation space: application to degraded document image binarization

被引:8
|
作者
Hedjam, Rachid [1 ]
Cheriet, Mohamed [1 ]
机构
[1] Synchromedia Lab Multemedia Commun Telepresence, Montreal, PQ H3C 1K3, Canada
来源
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR) | 2013年
关键词
Ground-truth estimation; Document image analysis; Document image binarization; Historical document images; Multispectral document imaging;
D O I
10.1109/ICDAR.2013.45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human ground-truthing is the manual labelling of samples (pixels for example) to generate reference data without any automatic algorithm help. Although a manual ground-truth is more accurate than a machine ground-truth, it still suffers from mislabeling and/or judgement errors. In this paper we propose a new method of ground-truth estimation using multispectral (MS) imaging feature space for the sake of document image binarization. Starting from the initial manual ground-truth, the proposed classification method aims to select automatically some samples with correct labels (well-labeled pixels) from each class for the training phase, then reassign new labels to the document image pixels. The classification scheme is based on the cooperation of multiple classifiers under some constraints. A real data set of MS historical document images and their ground-truth is created(1) (see url bellow) to demonstrate the effectiveness of the proposed method of ground-truth estimation.
引用
收藏
页码:190 / 194
页数:5
相关论文
共 50 条
  • [1] Automatic ground-truth generation for document image analysis and understanding
    Heroux, Pierre
    Barbu, Eugen
    Adam, Sebastien
    Trupin, Eric
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 476 - 480
  • [2] Automatic Ground-Truth Validation With Genetic Algorithms for Multispectral Image Classification
    Ghoggali, Noureddine
    Melgani, Farid
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (07): : 2172 - 2181
  • [3] MSIO: MultiSpectral Document Image BinarizatIOn
    Diem, Markus
    Hollaus, Fabian
    Sablatnig, Robert
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 84 - 89
  • [4] Adaptive degraded document image binarization
    Gatos, B
    Pratikakis, I
    Perantonis, SJ
    PATTERN RECOGNITION, 2006, 39 (03) : 317 - 327
  • [5] Binarization of degraded document image based on feature space partitioning and classification
    Morteza Valizadeh
    Ehsanollah Kabir
    International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 57 - 69
  • [6] Binarization of degraded document image based on feature space partitioning and classification
    Valizadeh, Morteza
    Kabir, Ehsanollah
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (01) : 57 - 69
  • [7] Degraded Document Image Binarization using Novel Background Estimation Technique
    Jindal, Harshit
    Kumar, Manoj
    Tomar, Akhil
    Malik, Ayush
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [8] Gabor Filters for Degraded Document Image Binarization
    Sehad, Abdenour
    Chibani, Youcef
    Cheriet, Mohamed
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 702 - 707
  • [9] Complex and Degraded Color Document Image Binarization
    Mysore, Sheshera
    Gupta, Manish Kumar
    Belhe, Swapnil
    2016 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2016, : 157 - 162
  • [10] Improved Degraded Document Image Binarization Using Median Filter for Background Estimation
    Khitas, Mehdi
    Ziet, Lahcene
    Bouguezel, Saad
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2018, 24 (03) : 82 - 87