Graph Clustering-based Ensemble Method for Handwritten Text Line Segmentation

被引:11
|
作者
Manohar, Vasant [1 ]
Vitaladevuni, Shiv N. [1 ]
Cao, Huaigu [1 ]
Prasad, Rohit [1 ]
Natarajan, Prem [1 ]
机构
[1] Raytheon BBN Technol, Speech Language & Multimedia Business Unit, Cambridge, MA 02138 USA
关键词
text line segmentation; handwriting; ensemble method; graph clustering;
D O I
10.1109/ICDAR.2011.121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten text line segmentation on real-world data presents significant challenges that cannot be overcome by any single technique. Given the diversity of approaches and the recent advances in ensemble-based combination for pattern recognition problems, it is possible to improve the segmentation performance by combining the outputs from different line finding methods. In this paper, we propose a novel graph clustering-based approach to combine the output of an ensemble of text line segmentation algorithms. A weighted undirected graph is constructed with nodes corresponding to connected components and edge connecting pairs of connected components. Text line segmentation is then posed as the problem of minimum cost partitioning of the nodes in the graph such that each cluster corresponds to a unique line in the document image. Experimental results on a challenging Arabic field dataset using the ensemble method shows a relative gain of 18% in the F-1 score over the best individual method within the ensemble.
引用
收藏
页码:574 / 578
页数:5
相关论文
共 50 条
  • [31] Language Adaptive Methodology for Handwritten Text Line Segmentation
    Panwar, Subhash
    Nain, Neeta
    Saxena, Subhra
    Gupta, P. C.
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PT I, 2013, 8047 : 344 - 351
  • [32] Clustering-based volume segmentation design
    Xu Q.
    Zhao Z.
    Wang W.
    International Journal of Advanced Media and Communication, 2016, 6 (2-4) : 156 - 166
  • [33] Entropy-Based Approach for Enabling Text Line Segmentation in Handwritten Documents
    Sindhushree, G. S.
    Amarnath, R.
    Nagabhushan, P.
    DATA ANALYTICS AND LEARNING, 2019, 43 : 169 - 184
  • [34] Text Line Segmentation in Handwritten Documents Based on Connected Components Trajectory Generation
    Setitra, Insaf
    Meziane, Abdelkrim
    Hadjadj, Zineb
    Bengherbia, Nawfel
    PATTERN RECOGNITION APPLICATIONS AND METHODS, 2018, 10857 : 222 - 234
  • [35] Clustering-based selective neural network ensemble
    Fu Q.
    Hu S.-X.
    Zhao S.-Y.
    Journal of Zhejiang University-SCIENCE A, 2005, 6 (5): : 387 - 392
  • [36] A Clustering-Based Ensemble Technique for Shape Decomposition
    Lewin, Sergej
    Jiang, Xiaoyi
    Clausing, Achim
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 153 - 161
  • [37] An automated method for gridding and clustering-based segmentation of cDNA microarray images
    Giannakeas, Nikolaos
    Fotiadis, Dimitrios I.
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2009, 33 (01) : 40 - 49
  • [38] A Grid based Approach for Handwritten Text Segmentation
    Ghosh, Soumalya
    Gupta, Umesh Kumar
    Ghosh, Uttam
    Shetty, Sachin
    2019 IEEE SOUTHEASTCON, 2019,
  • [39] A clustering-based adaptive undersampling ensemble method for highly unbalanced data classification
    Yuan, Xiaohan
    Sun, Chuan
    Chen, Shuyu
    APPLIED SOFT COMPUTING, 2024, 159
  • [40] Prediction of Traffic Incident Duration Using Clustering-Based Ensemble Learning Method
    Zhao, Hui
    Gunardi, Willy
    Liu, Yang
    Kiew, Christabel
    Teng, Teck-Hou
    Yang, Xiao Bo
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2022, 148 (07)