Graph Clustering-based Ensemble Method for Handwritten Text Line Segmentation

被引:11
|
作者
Manohar, Vasant [1 ]
Vitaladevuni, Shiv N. [1 ]
Cao, Huaigu [1 ]
Prasad, Rohit [1 ]
Natarajan, Prem [1 ]
机构
[1] Raytheon BBN Technol, Speech Language & Multimedia Business Unit, Cambridge, MA 02138 USA
关键词
text line segmentation; handwriting; ensemble method; graph clustering;
D O I
10.1109/ICDAR.2011.121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten text line segmentation on real-world data presents significant challenges that cannot be overcome by any single technique. Given the diversity of approaches and the recent advances in ensemble-based combination for pattern recognition problems, it is possible to improve the segmentation performance by combining the outputs from different line finding methods. In this paper, we propose a novel graph clustering-based approach to combine the output of an ensemble of text line segmentation algorithms. A weighted undirected graph is constructed with nodes corresponding to connected components and edge connecting pairs of connected components. Text line segmentation is then posed as the problem of minimum cost partitioning of the nodes in the graph such that each cluster corresponds to a unique line in the document image. Experimental results on a challenging Arabic field dataset using the ensemble method shows a relative gain of 18% in the F-1 score over the best individual method within the ensemble.
引用
收藏
页码:574 / 578
页数:5
相关论文
共 50 条
  • [1] Graph-based ensemble method for text line segmentation in offline Chinese handwritten documents
    Huang, L. (huangliang1576@gmail.com), 1600, Huazhong University of Science and Technology (42):
  • [2] Clustering-based word segmentation from off-line handwritten Uyghur text-line images
    Hamdulla A.
    Abliz A.
    Dawut A.
    Moydin K.
    Tuerxun P.
    International Journal of Information and Communication Technology, 2020, 16 (03) : 214 - 229
  • [3] Handwritten Text Line Segmentation by Spectral Clustering
    Han, Xuecheng
    Yao, Hui
    Zhong, Guoqiang
    EIGHTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2016), 2017, 10225
  • [4] Handwritten Chinese text line segmentation by clustering with distance metric learning
    Yin, Fei
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2009, 42 (12) : 3146 - 3157
  • [5] Angle Minimization and Graph Analysis for text line segmentation in handwritten documents
    Setitra, Insaf
    Meziane, Abdelkrim
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 453 - 458
  • [6] GAN-based text line segmentation method for challenging handwritten documents
    Ozseker, Ibrahim
    Demir, Ali Alper
    Ozkaya, Ufuk
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024,
  • [7] Graph based line segmentation on cluttered handwritten manuscripts
    Wahlberg, Fredrik
    Brun, Anders
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1570 - 1573
  • [8] GAN-based text line segmentation method for challenging handwritten documentsGAN-based text line segmentation method for challenging handwritten documentsİ Özşeker et al.
    İbrahim Özşeker
    Ali Alper Demir
    Ufuk Özkaya
    International Journal on Document Analysis and Recognition (IJDAR), 2025, 28 (1): : 59 - 69
  • [9] Handwritten Documents Text Line Segmentation based on Information Energy
    Boiangiu, C. A.
    Tanase, M. C.
    Ioanitescu, R.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2014, 9 (01) : 8 - 15
  • [10] Influence of Text Line Segmentation in Handwritten Text Recognition
    Romero, Veronica
    Andreu Sanchez, Joan
    Bosch, Vicente
    Depuydt, Katrien
    de Does, Jesse
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 536 - 540