A system for the automatic layout segmentation and classification of digital documents

被引:0
|
作者
Cinque, L [1 ]
Levialdi, S [1 ]
Malizia, A [1 ]
机构
[1] Univ Roma La Sapienza, Dept Informat Sci, I-00198 Rome, Italy
关键词
pattern recognition; image processing and segmentation; document analysis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper documents recognition is fundamental for office automation becoming every day a more powerful tool in those fields where information is still on paper. Document recognition follows from data acquisition, from both journals, and entire books in order to transform them in digital objects. We present a new system for Document recognition that follows the Open Source methodologies, XML description for documents segmentation and classification, which turns to be beneficial in terms of classification precision, and general-purpose availability.
引用
收藏
页码:106 / 112
页数:7
相关论文
共 50 条
  • [21] Automatic classification of journalistic documents on the Internet
    Oliveira, Elias
    Branquinho Filho, Delermando
    TRANSINFORMACAO, 2017, 29 (03): : 245 - 255
  • [22] Automatic segmentation and desereening of scanned color documents
    Rao, AR
    Thompson, G
    IS&T'S NIP16: INTERNATIONAL CONFERENCE ON DIGITAL PRINTING TECHNOLOGIES, 2000, : 818 - 821
  • [23] Automatic segmentation and annotation of audio archive documents
    Bohac, Marek
    Blavka, Karel
    2011 10TH INTERNATIONAL WORKSHOP ON ELECTRONICS, CONTROL, MEASUREMENT AND SIGNALS (ECMS), 2011, : 61 - 66
  • [24] AUTOMATIC SEGMENTATION FOR ARABIC CHARACTERS IN HANDWRITING DOCUMENTS
    Lawgali, A.
    Bouridane, A.
    Angelova, M.
    Ghassemlooy, Z.
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [25] Automatic Indexing of Scanned Documents - a Layout-based Approach
    Esser, Daniel
    Schuster, Daniel
    Muthmann, Klemens
    Berger, Michael
    Schill, Alexander
    DOCUMENT RECOGNITION AND RETRIEVAL XIX, 2012, 8297
  • [26] THE LAYOUT SYNTHESIZER - AN AUTOMATIC NETLIST-TO-LAYOUT SYSTEM
    CHEN, CC
    CHOW, SL
    26TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, 1989, : 232 - 238
  • [27] A hierarchical classification strategy for digital documents
    Schettini, R
    Brambilla, C
    Ciocca, G
    Valsasna, A
    De Ponti, M
    PATTERN RECOGNITION, 2002, 35 (08) : 1759 - 1769
  • [28] Text mining in the classification of digital documents
    Contreras Barrera, Marcial
    BIBLIOS-REVISTA DE BIBLIOTECOLOGIA Y CIENCIAS DE LA INFORMACION, 2016, (64): : 33 - 43
  • [29] Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System
    Himel, Galib Muhammad Shahriar
    Islam, Md. Masudul
    Al-Aff, Kh. Abdullah
    Karim, Shams Ibne
    Sikder, Md. Kabir Uddin
    INTERNATIONAL JOURNAL OF BIOMEDICAL IMAGING, 2024, 2024
  • [30] Study for Automatic Classification of Arabic Spoken Documents
    Labidi, Mohamed
    Maraoui, Mohsen
    Zrigui, Mounir
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2017, PT II, 2017, 10449 : 459 - 468