Complex documents images segmentation based on steerable pyramid features

被引:0
|
作者
Mohamed Benjelil
Slim Kanoun
Rémy Mullot
Adel M. Alimi
机构
[1] REGIM–ENIS,
[2] L3I,undefined
[3] University of La Rochelle,undefined
关键词
Steerable pyramid; Complex document segmentation; Multi-resolution analysis; Invariant features;
D O I
暂无
中图分类号
学科分类号
摘要
Page segmentation and classification is very important in document layout analysis system before it is presented to an OCR system or for any other subsequent processing steps. In this paper, we propose an accurate and suitably designed system for complex documents segmentation. This system is based on steerable pyramid transform. The features extracted from pyramid sub-bands serve to locate and classify regions into text (either machine-printed or handwritten) and non-text (images, graphics, drawings or paintings) in some noise-infected, deformed, multilingual, multi-script document images. These documents contain tabular structures, logos, stamps, handwritten script blocks, photographs, etc. The encouraging and promising results obtained on 1,000 official complex document images data set are presented in this research paper. We compared our results with those from existing state-of-the-art methods. This comparison shows that the proposed method performs consistently well on large sets of complex document images.
引用
收藏
页码:209 / 228
页数:19
相关论文
共 50 条
  • [31] 3D Steerable Pyramid based on conic filters
    Delle Luche, CA
    Denis, F
    Baskurt, A
    WAVELET APPLICATIONS IN INDUSTRIAL PROCESSING, 2003, 5266 : 260 - 268
  • [32] Volumetric Semantic Segmentation using Pyramid Context Features
    Barron, Jonathan T.
    Arbelaez, Pablo
    Keraenen, Soile V. E.
    Biggin, Mark D.
    Knowles, David W.
    Malik, Jitendra
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3448 - 3455
  • [33] Tracking of Deformable Object in Complex Video Using Steerable Pyramid Wavelet Transform
    Prakash, Om
    Khare, Manish
    Srivastava, Rajneesh Kumar
    Khare, Ashish
    COMPUTATIONAL VISION AND ROBOTICS, 2015, 332 : 1 - 6
  • [34] Discriminative features pyramid network for medical image segmentation
    Xie, Xiwang
    Xie, Lijie
    Li, Guanyu
    Guo, Hao
    Zhang, Weidong
    Shao, Feng
    Zhao, Wenyi
    Tong, Ling
    Pan, Xipeng
    An, Jubai
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2024, 44 (02) : 327 - 340
  • [35] Forest Segmentation with Spatial Pyramid Pooling Modules: A Surveillance System Based on Satellite Images
    Ru, Fung Xin
    Zulkifley, Mohd Asyraf
    Abdani, Siti Raihanah
    Spraggon, Martin
    FORESTS, 2023, 14 (02):
  • [36] Semantic Segmentation Method of Autonomous Driving Images Based on Atrous Spatial Pyramid Pooling
    Wang D.
    Liu L.
    Cao J.
    Zhao G.
    Zhao W.
    Tang W.
    Qiche Gongcheng/Automotive Engineering, 2022, 44 (12): : 1818 - 1824
  • [37] A segmentation pyramid for the interactive segmentation of 3-D images and video sequences
    Zanoguera, F
    Marcotegui, B
    Meyer, F
    MATHEMATICAL MORPHOLOGY AND ITS APPLICATIONS TO IMAGE AND SIGNAL PROCESSING, 2000, 18 : 223 - 232
  • [38] Clustering based segmentation of text in complex color images
    Mao, Wen-Ge
    Wang, Hong-Bin
    Zhang, Tian-Wen
    Journal of Harbin Institute of Technology (New Series), 2004, 11 (04) : 387 - 394
  • [39] Detection and segmentation of morphologically complex eukaryotic cells in fluorescence microscopy images via feature pyramid fusion
    Korfhage, Nikolaus
    Muehling, Markus
    Ringshandl, Stephan
    Becker, Anke
    Schmeck, Bernd
    Freisleben, Bernd
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (09)
  • [40] Image co-segmentation based on pyramid features cross-correlation network
    Jia CHEN
    Yasong CHEN
    Weihao LI
    Zhi LIU
    Sannyuya LIU
    Zongkai YANG
    ScienceChina(InformationSciences), 2023, 66 (01) : 316 - 317