Graph Clustering-based Ensemble Method for Handwritten Text Line Segmentation

被引:11
|
作者
Manohar, Vasant [1 ]
Vitaladevuni, Shiv N. [1 ]
Cao, Huaigu [1 ]
Prasad, Rohit [1 ]
Natarajan, Prem [1 ]
机构
[1] Raytheon BBN Technol, Speech Language & Multimedia Business Unit, Cambridge, MA 02138 USA
关键词
text line segmentation; handwriting; ensemble method; graph clustering;
D O I
10.1109/ICDAR.2011.121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten text line segmentation on real-world data presents significant challenges that cannot be overcome by any single technique. Given the diversity of approaches and the recent advances in ensemble-based combination for pattern recognition problems, it is possible to improve the segmentation performance by combining the outputs from different line finding methods. In this paper, we propose a novel graph clustering-based approach to combine the output of an ensemble of text line segmentation algorithms. A weighted undirected graph is constructed with nodes corresponding to connected components and edge connecting pairs of connected components. Text line segmentation is then posed as the problem of minimum cost partitioning of the nodes in the graph such that each cluster corresponds to a unique line in the document image. Experimental results on a challenging Arabic field dataset using the ensemble method shows a relative gain of 18% in the F-1 score over the best individual method within the ensemble.
引用
收藏
页码:574 / 578
页数:5
相关论文
共 50 条
  • [41] Unsupervised multi-language handwritten text line segmentation
    Angel Garcia-Calderon, Miguel
    Arnulfo Garcia-Hernandez, Rene
    Ledeneva, Yulia
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 2901 - 2911
  • [42] A Multilevel Text line Segmentation Framework for Handwritten Historical Documents
    Ben Messaoud, Ines
    Amiri, Hamid
    El Abed, Haikal
    Maergner, Volker
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 515 - 520
  • [43] A new scheme for unconstrained handwritten text-line segmentation
    Alaei, Alireza
    Pal, Umapada
    Nagabhushan, P.
    PATTERN RECOGNITION, 2011, 44 (04) : 917 - 928
  • [44] DENSE PREDICTION FOR TEXT LINE SEGMENTATION IN HANDWRITTEN DOCUMENT IMAGES
    Quang Nhat Vo
    Lee, GueeSang
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3264 - 3268
  • [45] Handwritten text line segmentation using Fully Convolutional Network
    Renton, Guillaume
    Chatelain, Clement
    Adam, Sebastien
    Kermorvant, Christopher
    Paquet, Thierry
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 5, 2017, : 5 - 9
  • [46] Survey on Clustering-Based Image Segmentation Techniques
    Zou, Yanni
    Liu, Bo
    2016 IEEE 20th International Conference on Computer Supported Cooperative Work in Design (CSCWD), 2016, : 106 - 110
  • [47] Text line segmentation in handwritten document using a production system
    Nicolas, S
    Paquet, T
    Heutte, L
    NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 245 - 250
  • [48] LINE SEGMENTATION OF HANDWRITTEN TEXT USING HISTOGRAMS AND TENSOR VOTING
    Babczynski, Tomasz
    Ptak, Roman
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2020, 30 (03) : 585 - 596
  • [49] Robust fuzzy clustering-based image segmentation
    Yang, Zhang
    Chung, Fu-Lai
    Wang Shitong
    APPLIED SOFT COMPUTING, 2009, 9 (01) : 80 - 84
  • [50] Handwritten Text Segmentation Method Based on Greedy Snake Algorithm and Radical Recognition
    Fu, Pengbin
    Dong, Aojing
    Yang, Huirong
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (01): : 80 - 90