Page Segmentation for Historical Handwritten Document Images Using Color and Texture Features

被引:27
|
作者
Chen, Kai [1 ]
Wei, Hao [1 ]
Hennebert, Jean [1 ,2 ]
Ingold, Rolf [1 ]
Liwicki, Marcus [1 ,3 ]
机构
[1] Univ Fribourg, Dept Informat, DIVA Res Grp, CH-1700 Fribourg, Switzerland
[2] Univ Appl Sci, HES SO FR, CH-1705 Fribourg, Switzerland
[3] DFKI German Res Ctr Artificial Itelligence, Saarbrucken, Germany
来源
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2014年
基金
瑞士国家科学基金会;
关键词
page segmentation; historical document; layout analysis; feature selection; LAYOUT ANALYSIS;
D O I
10.1109/ICFHR.2014.88
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a physical structure detection method for historical handwritten document images. We considered layout analysis as a pixel labeling problem. By classifying each pixel as either periphery, background, text block, or decoration, we achieve high quality segmentation without any assumption of specific topologies and shapes. Various color and texture features such as color variance, smoothness, Laplacian, Local Binary Patterns, and Gabor Dominant Orientation Histogram are used for classification. Some of these features have so far not got many attentions for document image layout analysis. By applying an Improved Fast Correlation-Based Filter feature selection algorithm, the redundant and irrelevant features are removed. Finally, the segmentation results are refined by a smoothing post-processing procedure. The proposed method is demonstrated by experhnents conducted on three different historical handwritten document image datasets. Experiments show the benefit of combining various color and texture features for classification. The results also show the advantage of using a feature selection method to choose optimal feature subset. By applying the proposed method we achieve superior accuracy compared with earlier work on several datasets, e.g., we achieved 93% accuracy compared with 91% of the previous method on the Parzival dataset which contains about 100 million pixels.
引用
收藏
页码:488 / 493
页数:6
相关论文
共 50 条
  • [31] DENSE PREDICTION FOR TEXT LINE SEGMENTATION IN HANDWRITTEN DOCUMENT IMAGES
    Quang Nhat Vo
    Lee, GueeSang
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3264 - 3268
  • [32] Text Line Segmentation for Challenging Handwritten Document Images Using Fully Convolutional Network
    Barakat, Berat
    Droby, Ahmad
    Kassis, Majeed
    El-Sana, Jihad
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 374 - 379
  • [33] Complex handwritten page segmentation using contextual models
    Nicolas, Stephane
    Paquet, Thierry
    Heutte, Laurent
    SECOND INTERNATIONAL CONFERENCE ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2006, : 46 - +
  • [34] Segmentation of Closely set and Touching Lines in Handwritten document images using Fringe Maps
    Koppula, Vijaya Kumar
    Negi, Atul
    2014 INTERNATIONAL CONFERENCE FOR CONVERGENCE OF TECHNOLOGY (I2CT), 2014,
  • [35] COLOR FEATURES FOR DATING HISTORICAL COLOR IMAGES
    Fernando, Basura
    Muselet, Damien
    Khan, Rahat
    Tuytelaars, Tinne
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2589 - 2593
  • [36] Text segmentation in degraded historical document images
    Kavitha, A. S.
    Shivakumara, P.
    Kumar, G. H.
    Lu, Tong
    EGYPTIAN INFORMATICS JOURNAL, 2016, 17 (02) : 189 - 197
  • [37] Segmentation and Recognition for Historical Tibetan Document Images
    Ma, Longlong
    Long, Congjun
    Duan, Lijuan
    Zhang, Xiqun
    Li, Yanxing
    Zhao, Quanchao
    IEEE ACCESS, 2020, 8 : 52641 - 52651
  • [38] Restoration of Degraded Historical Kannada Handwritten Document Images Using Image Enhancement Techniques
    Bannigidad, Parashuram
    Gudada, Chandrashekar
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2016), 2018, 614 : 498 - 508
  • [39] Color and texture based segmentation of molecular pathology images using HSOMS
    Datar, Manasi
    Padfield, Dirk
    Cline, Harvey
    2008 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, VOLS 1-4, 2008, : 292 - +
  • [40] Perceptual color and spatial texture features for segmentation
    Chen, JQ
    Pappas, TN
    Mojsilovic, A
    Rogowitz, BE
    HUMAN VISION AND ELECTRONIC IMAGING VIII, 2003, 5007 : 340 - 351