Page Segmentation for Historical Handwritten Document Images Using Color and Texture Features

被引:27
|
作者
Chen, Kai [1 ]
Wei, Hao [1 ]
Hennebert, Jean [1 ,2 ]
Ingold, Rolf [1 ]
Liwicki, Marcus [1 ,3 ]
机构
[1] Univ Fribourg, Dept Informat, DIVA Res Grp, CH-1700 Fribourg, Switzerland
[2] Univ Appl Sci, HES SO FR, CH-1705 Fribourg, Switzerland
[3] DFKI German Res Ctr Artificial Itelligence, Saarbrucken, Germany
来源
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2014年
基金
瑞士国家科学基金会;
关键词
page segmentation; historical document; layout analysis; feature selection; LAYOUT ANALYSIS;
D O I
10.1109/ICFHR.2014.88
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a physical structure detection method for historical handwritten document images. We considered layout analysis as a pixel labeling problem. By classifying each pixel as either periphery, background, text block, or decoration, we achieve high quality segmentation without any assumption of specific topologies and shapes. Various color and texture features such as color variance, smoothness, Laplacian, Local Binary Patterns, and Gabor Dominant Orientation Histogram are used for classification. Some of these features have so far not got many attentions for document image layout analysis. By applying an Improved Fast Correlation-Based Filter feature selection algorithm, the redundant and irrelevant features are removed. Finally, the segmentation results are refined by a smoothing post-processing procedure. The proposed method is demonstrated by experhnents conducted on three different historical handwritten document image datasets. Experiments show the benefit of combining various color and texture features for classification. The results also show the advantage of using a feature selection method to choose optimal feature subset. By applying the proposed method we achieve superior accuracy compared with earlier work on several datasets, e.g., we achieved 93% accuracy compared with 91% of the previous method on the Parzival dataset which contains about 100 million pixels.
引用
收藏
页码:488 / 493
页数:6
相关论文
共 50 条
  • [41] Segmentation of images by color features: A survey
    Garcia-Lamont, Farid
    Cervantes, Jair
    Lopez, Asdrubal
    Rodriguez, Lisbeth
    NEUROCOMPUTING, 2018, 292 : 1 - 27
  • [42] Segmentation of ultrasonic ovarian images by texture features
    Jiang, CF
    Chen, ML
    PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 20, PTS 1-6: BIOMEDICAL ENGINEERING TOWARDS THE YEAR 2000 AND BEYOND, 1998, 20 : 850 - 853
  • [43] Seafloor Segmentation Using Combined Texture Features of Sidescan Sonar Images
    Huo, Guanying
    Li, Qingwu
    Zhou, Yan
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 3794 - 3799
  • [44] Text Line Segmentation for Unconstrained Handwritten Document Images Using Neighborhood Connected Component Analysis
    Khandelwal, Abhishek
    Choudhury, Pritha
    Sarkar, Ram
    Basu, Subhadip
    Nasipuri, Mita
    Das, Nibaran
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 369 - +
  • [45] Characters Segmentation from Arabic Handwritten Document Images: Hybrid Approach
    Boraik, Omar Ali
    Ravikumar, M.
    Saif, Mufeed Ahmed Naji
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (04) : 395 - 403
  • [46] Foreground text segmentation in complex color document images using Gabor filters
    S. Nirmala
    P. Nagabhushan
    Signal, Image and Video Processing, 2012, 6 : 669 - 678
  • [47] Aerial Images Visual Localization on a Vector Map Using Color Texture Segmentation
    Kunina, I. A.
    Teplyakov, L. M.
    Gladkov, A. P.
    Khanipov, T. M.
    Nikolaev, D. P.
    TENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2017), 2018, 10696
  • [48] Foreground text segmentation in complex color document images using Gabor filters
    Nirmala, S.
    Nagabhushan, P.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2012, 6 (04) : 669 - 678
  • [49] The fuzzy integral for color seal segmentation on document images
    Soria-Frisch, A
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 1, PROCEEDINGS, 2003, : 157 - 160
  • [50] Adaptive fuzzy text segmentation in images with complex backgrounds using color and texture
    Gllavata, J
    Freisleben, B
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2005, 3691 : 756 - 765