Graph Clustering-based Ensemble Method for Handwritten Text Line Segmentation

被引:11
|
作者
Manohar, Vasant [1 ]
Vitaladevuni, Shiv N. [1 ]
Cao, Huaigu [1 ]
Prasad, Rohit [1 ]
Natarajan, Prem [1 ]
机构
[1] Raytheon BBN Technol, Speech Language & Multimedia Business Unit, Cambridge, MA 02138 USA
关键词
text line segmentation; handwriting; ensemble method; graph clustering;
D O I
10.1109/ICDAR.2011.121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten text line segmentation on real-world data presents significant challenges that cannot be overcome by any single technique. Given the diversity of approaches and the recent advances in ensemble-based combination for pattern recognition problems, it is possible to improve the segmentation performance by combining the outputs from different line finding methods. In this paper, we propose a novel graph clustering-based approach to combine the output of an ensemble of text line segmentation algorithms. A weighted undirected graph is constructed with nodes corresponding to connected components and edge connecting pairs of connected components. Text line segmentation is then posed as the problem of minimum cost partitioning of the nodes in the graph such that each cluster corresponds to a unique line in the document image. Experimental results on a challenging Arabic field dataset using the ensemble method shows a relative gain of 18% in the F-1 score over the best individual method within the ensemble.
引用
收藏
页码:574 / 578
页数:5
相关论文
共 50 条
  • [21] Word segmentation in handwritten Korean text lines based on gap clustering techniques
    Kim, SH
    Jeong, S
    Lee, GS
    Suen, CY
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 189 - 193
  • [22] A new handwritten character segmentation method based on nonlinear clustering
    Tan, Jun
    Lai, Jian-Huang
    Wang, Chang-Dong
    Wang, Wen-Xian
    Zuo, Xiao-Xiong
    NEUROCOMPUTING, 2012, 89 : 213 - 219
  • [23] Dual Clustering-Based Method for Geospatial Knowledge Graph Partitioning
    Chen, Yuxuan
    Ou, Feifei
    Liu, Qiliang
    Wu, Gusheng
    Chen, Kaiqi
    Deng, Min
    Chen, Meihua
    Xu, Rui
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [24] A generalized line segmentation method for multi-script handwritten text documents
    Rakshit, Payel
    Halder, Chayan
    Md Obaidullah, Sk
    Roy, Kaushik
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [25] A Multi-scale Text Line Segmentation Method in Freestyle Handwritten Documents
    Gao, Yangdong
    Ding, Xiaoqing
    Liu, Changsong
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 643 - 647
  • [26] LINECOUNTER: LEARNING HANDWRITTEN TEXT LINE SEGMENTATION BY COUNTING
    Li, Deng
    Wu, Yue
    Zhou, Yicong
    Proceedings - International Conference on Image Processing, ICIP, 2021, 2021-September : 929 - 933
  • [27] A Morphological Approach to Persian Handwritten Text Line Segmentation
    Amirkhani-Shahraki, Abdollah
    Ghahnavieh, Amir Ebrahimi
    Mirmandavi, Seyyed Abdollah
    2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, : 298 - 301
  • [28] A Tracking Approach for Text Line Segmentation in Handwritten Documents
    Setitra, Insaf
    Hadjadj, Zineb
    Meziane, Abdelkrim
    ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 193 - 198
  • [29] Text Line Segmentation in Images of Handwritten Historical Documents
    Sanchez, A.
    Suarez, P. D.
    Melloz, C. A. B.
    Oliveira, A. L. I.
    Alves, V. M. O.
    2008 FIRST INTERNATIONAL WORKSHOPS ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2008, : 232 - +
  • [30] LINECOUNTER: LEARNING HANDWRITTEN TEXT LINE SEGMENTATION BY COUNTING
    Li, Deng
    Wu, Yue
    Zhou, Yicong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 929 - 933