A Chinese Document Layout Analysis Based on Non-text Images

被引:1
|
作者
Fu Xiaoling [1 ]
Li Xiaofeng [1 ]
机构
[1] N China Univ Informat Engn NCUT, Multimedia Technol Lab, Beijing 100144, Peoples R China
关键词
layout analysis; projection; connective region; threshold;
D O I
10.1109/IFCSTA.2009.85
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the paper as the medium of electronic information, traditional books, magazines, newspapers, etc are scanned into the images,and changed into electronic documents through OCR(optical character recognition) technology,layout analysis as an important part of OCR has played a greater role. This paper presents a Chinese document layout analysis based on non-text images, solve the deformed image of the issue of text extraction, and there is great value in practice.
引用
收藏
页码:326 / 328
页数:3
相关论文
共 50 条
  • [31] Distinguishing Text/Non-Text Natural Images with Multi-Dimensional Recurrent Neural Networks
    Lyu, Pengyuan
    Shi, Baoguang
    Zhang, Chengquan
    Bai, Xiang
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3981 - 3986
  • [32] A recurrent neural network based deep learning model for text and non-text stroke classification in online handwritten Devanagari document
    Rajib Ghosh
    Multimedia Tools and Applications, 2022, 81 : 24245 - 24263
  • [33] Text Classification and Document Layout Analysis of Paper Fragments
    Diem, Markus
    Kleber, Florian
    Sablatnig, Robert
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 854 - 858
  • [34] Text detection method in document images based on multiresolution analysis
    Lee, Geum-Boon
    Shin, Dong-Guk
    Cho, Beom-Joon
    WMSCI 2007 : 11TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, POST CONFERENCE ISSUE, PROCEEDINGS, 2007, : 200 - +
  • [35] A novel OCR approach based on document layout analysis and text block classification
    Zhu, Weiheng
    Liu, Yuanfeng
    Hao, Liang
    PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2016, : 91 - 94
  • [36] INDEXING AND RETRIEVAL OF NON-TEXT INFORMATION
    Vermeij, Hermine
    CATALOGING & CLASSIFICATION QUARTERLY, 2013, 51 (08) : 945 - 946
  • [37] Distance Transform-Based Stroke Feature Descriptor for Text Non-text Classification
    Khan, Tauseef
    Mollah, Ayatullah Faruk
    RECENT DEVELOPMENTS IN MACHINE LEARNING AND DATA ANALYTICS, 2019, 740 : 189 - 200
  • [38] Text/Image Region Separation for Document Layout Detection of Old Document Images using Non-linear Diffusion and Level Set
    Kumar, Sachin S.
    Rajendran, Parvathy
    Prabaharan, P.
    Soman, K. P.
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS, 2016, 93 : 469 - 477
  • [39] Segmentation of Text and Non-text in On-Line Handwritten Patient Record Based on Spatio-Temporal Analysis
    Waranusast, Rattapoom
    Haddawy, Peter
    Dailey, Matthew
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PROCEEDINGS, 2009, 5651 : 345 - 354
  • [40] Open Evaluation Tool for Layout Analysis of Document Images
    Alberti, Michele
    Bouillon, Manuel
    Ingold, Rolf
    Liwicki, Marcus
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 4, 2017, : 43 - 47