Unified layout analysis and text localization framework

被引:4
|
作者
Vasilopoulos, Nikos [1 ]
Kavallieratou, Ergina [1 ]
机构
[1] Univ Aegean, Dept Informat & Commun Syst Engn, Samos, Greece
关键词
document images; page layout analysis; text localization; PAGE SEGMENTATION; IMAGES; COMPETITION; EXTRACTION; IDENTIFICATION; RECOGNITION; CHARACTERS; ALGORITHM;
D O I
10.1117/1.JEI.26.1.013009
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A technique appropriate for extracting textual information from documents with complex layouts, such as newspapers and journals, is presented. It is a combination of a foreground analysis and a text localization method. The first one is used to segment the page in text and nontext blocks, whereas the second one is used to detect text that may be embedded inside images, charts, diagrams, tables, etc. Detailed experiments on two public databases showed that mixing layout analysis and text localization techniques can lead to improved page segmentation and text extraction results. (C) 2017 SPIE and IS&T
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Unified framework for recognition, localization and mapping using wearable cameras
    Vazquez-Martin, Ricardo
    Bandera, Antonio
    COGNITIVE PROCESSING, 2012, 13 : S351 - S354
  • [32] Unified framework for recognition, localization and mapping using wearable cameras
    Ricardo Vázquez-Martín
    Antonio Bandera
    Cognitive Processing, 2012, 13 : 351 - 354
  • [33] UniLoc: A Unified Mobile Localization Framework Exploiting Scheme Diversity
    Du, Wan
    Tong, Panrong
    Li, Mo
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 818 - 829
  • [34] UniLoc: A Unified Mobile Localization Framework Exploiting Scheme Diversity
    Du, Wan
    Tong, Panrong
    Li, Mo
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (07) : 2505 - 2517
  • [35] A UNIFIED FRAMEWORK FOR MULTIPLE ARRAYS ON A ROBOT AND APPLICATION TO SOUND LOCALIZATION
    Madmoni, L.
    Barfuss, H.
    Rafaely, B.
    Kellermann, W.
    2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 66 - 70
  • [36] Unified framework for recognition, localization and mapping using wearable cameras
    Vazquez-Martin, Ricardo
    Bandera, Antonio J.
    COGNITIVE PROCESSING, 2012, 13 : S5 - S5
  • [37] A Unified Analytical Framework for RSS-Based Localization Systems
    He, Jiajun
    Chun, Young Jin
    So, Hing Cheung
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (09) : 6506 - 6519
  • [38] Texterra: A framework for text analysis
    D. Yu. Turdakov
    N. A. Astrakhantsev
    Ya. R. Nedumov
    A. A. Sysoev
    I. A. Andrianov
    V. D. Mayorov
    D. G. Fedorenko
    A. V. Korshunov
    S. D. Kuznetsov
    Programming and Computer Software, 2014, 40 : 288 - 295
  • [39] Texterra: A framework for text analysis
    Turdakov, D. Yu.
    Astrakhantsev, N. A.
    Nedumov, Ya. R.
    Sysoev, A. A.
    Andrianov, I. A.
    Mayorov, V. D.
    Fedorenko, D. G.
    Korshunov, A. V.
    Kuznetsov, S. D.
    PROGRAMMING AND COMPUTER SOFTWARE, 2014, 40 (05) : 288 - 295
  • [40] A unified framework of medical information annotation and extraction for Chinese clinical text
    Zhu, Enwei
    Sheng, Qilin
    Yang, Huanwan
    Liu, Yiyang
    Cai, Ting
    Li, Jinpeng
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 142