Using Multi-level Segmentation Features for Document Image Classification

被引:0
|
作者
Kaddas, Panagiotis [1 ,2 ]
Gatos, Basilis [1 ]
机构
[1] Natl Ctr Sci Res Demokritos, Computat Intelligence Lab, Inst Informat & Telecommun, Athens 15310, Greece
[2] Univ Athens, Dept Informat & Telecommun, Athens 15784, Greece
来源
关键词
Document image classification; Document image segmentation; Convolutional Neural Network; Deep Learning;
D O I
10.1007/978-3-031-06555-2_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document Image classification is a crucial step in the processing pipeline for many purposes (e.g. indexing, OCR, keyword spotting) and is being applied at early stages. At this point, textual information about the document (OCR) is usually not available and additional features are required in order to achieve higher recognition accuracy. On the other hand, one may have reliable segmentation information (e.g. text block, paragraph, line, word, symbol segmentation results), extracted also at pre-processing stages. In this paper, visual features are fused with segmentation analysis results in a novel integrated workflow and end-to-end training can be easily applied. Significant improvements on popular datasets (Tobacco-3482 and RVL-CDIP) are presented, when compared to state-of-the-art methodologies which consider visual features.
引用
收藏
页码:702 / 712
页数:11
相关论文
共 50 条
  • [21] Image segmentation and classification using color features
    Stachowicz, MS
    Lemke, D
    PROCEEDINGS VIPROMCOM-2002, 2002, : 57 - 64
  • [22] Detection of abnormalities in ultrasound lung image using multi-level RVM classification
    Veeramani, Senthil Kumar
    Muthusamy, Ezhilarasi
    JOURNAL OF MATERNAL-FETAL & NEONATAL MEDICINE, 2016, 29 (11): : 1844 - 1852
  • [23] SEGMENTATION OF ABDOMINAL ORGANS FROM MR IMAGES USING MULTI-LEVEL HIERARCHICAL CLASSIFICATION
    Selvi, Esref
    Selver, M. Alper
    Kavur, Ali Emre
    Guzelis, Cuneyt
    Dicle, Oguz
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2015, 30 (03): : 533 - 546
  • [24] Multi-level spatial attention network for image data segmentation
    Guo, Jun
    Jiang, Zhixiong
    Jiang, Dingchao
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2021, 14 (03) : 289 - 299
  • [25] Multi-level dilated residual network for biomedical image segmentation
    Naga Raju Gudhe
    Hamid Behravan
    Mazen Sudah
    Hidemi Okuma
    Ritva Vanninen
    Veli-Matti Kosma
    Arto Mannermaa
    Scientific Reports, 11
  • [26] Object based image retrieval based on multi-level segmentation
    Xu, Y
    Duygulu, P
    Saber, E
    Tekalp, AM
    Yarman-Vural, FT
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2019 - 2022
  • [27] Multi-level Feature Attention Network for medical image segmentation
    Zhang, Yaning
    Yin, Jianjian
    Gu, Yanhui
    Chen, Yi
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263
  • [28] Extended multi-level logistic model and SAR image segmentation
    Cao, YF
    Han, CZ
    Sun, H
    Yang, W
    IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 3700 - 3702
  • [29] Efficient Optimal Multi-level Thresholding for Biofilm Image Segmentation
    Rojas, Dario
    Rueda, Luis
    Urrutia, Homero
    Ngom, Alioune
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2009, 5780 : 307 - +
  • [30] Multi-level dilated residual network for biomedical image segmentation
    Gudhe, Naga Raju
    Behravan, Hamid
    Sudah, Mazen
    Okuma, Hidemi
    Vanninen, Ritva
    Kosma, Veli-Matti
    Mannermaa, Arto
    SCIENTIFIC REPORTS, 2021, 11 (01)