A Chinese Document Layout Analysis Based on Non-text Images

被引:1
|
作者
Fu Xiaoling [1 ]
Li Xiaofeng [1 ]
机构
[1] N China Univ Informat Engn NCUT, Multimedia Technol Lab, Beijing 100144, Peoples R China
关键词
layout analysis; projection; connective region; threshold;
D O I
10.1109/IFCSTA.2009.85
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the paper as the medium of electronic information, traditional books, magazines, newspapers, etc are scanned into the images,and changed into electronic documents through OCR(optical character recognition) technology,layout analysis as an important part of OCR has played a greater role. This paper presents a Chinese document layout analysis based on non-text images, solve the deformed image of the issue of text extraction, and there is great value in practice.
引用
收藏
页码:326 / 328
页数:3
相关论文
共 50 条
  • [21] Readability of Non-Text Images on the World Wide Web (WWW)
    Elahi, Ehsan
    Iglesias, Ana
    Morato, Jorge
    IEEE ACCESS, 2022, 10 : 116627 - 116634
  • [22] Adaptive layout analysis of document images
    Malerba, D
    Esposito, F
    Altamura, O
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2002, 2366 : 526 - 534
  • [23] Text and Non-text Segmentation based on Connected Component Features
    Viet Phuong Le
    Nayef, Nibal
    Visani, Muriel
    Ogier, Jean-Marc
    Cao De Tran
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1096 - 1100
  • [24] Layout analysis of urdu document images
    Shafait, Faisal
    Adnan-ul-Hasan
    Keysers, Daniel
    Breuel, Thomas M.
    10TH IEEE INTERNATIONAL MULTITOPIC CONFERENCE 2006, PROCEEDINGS, 2006, : 293 - +
  • [25] Multi-script text versus non-text classification of regions in scene images
    Sriman, Bowornrat
    Schomaker, Lambert
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 23 - 42
  • [26] A Hybrid Approach for Document Layout Analysis in Document Images
    Shehzadi, Tahira
    Stricker, Didier
    Afzal, Muhammad Zeshan
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT IV, 2024, 14807 : 21 - 39
  • [27] Text non-text classification based on area occupancy of equidistant pixels
    Khan, Tauseef
    Mollah, Ayatullah Faruk
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1889 - 1900
  • [28] Segmentation-Less Extraction of Text and Non-Text Regions From JPEG 2000 Compressed Document Images Through Partial and Intelligent Decompression
    Bisen, Tejasvee
    Javed, Mohammed
    Nagabhushan, P.
    Watanabe, Osamu
    IEEE ACCESS, 2023, 11 : 20673 - 20687
  • [29] A recurrent neural network based deep learning model for text and non-text stroke classification in online handwritten Devanagari document
    Ghosh, Rajib
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24245 - 24263
  • [30] User interface for text and non-text classification
    Thanh Thi Xuan Lam
    Anh Duc Le
    Nakagawa, Masaki
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDAR 2019 WORKSHOP) AND 2ND INTERNATIONAL WORKSHOP ON HUMAN-DOCUMENT INTERACTION, VOL 3, 2019, : 1 - 5