Visual Similarity Based Document Layout Analysis

被引:0
|
作者
Di Wen
Xiao-Qing Ding
机构
[1] Tsinghua University,Department of Electronic Engineering & State Key Laboratory of Intelligent Technology and Systems
关键词
document layout analysis; texture analysis; dynamic clustering;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a visual similarity based document layout analysis (DLA) scheme is proposed, which by using clustering strategy can adaptively deal with documents in different languages, with different layout structures and skew angles. Aiming at a robust and adaptive DLA approach, the authors first manage to find a set of representative filters and statistics to characterize typical texture patterns in document images, which is through a visual similarity testing process. Texture features are then extracted from these filters and passed into a dynamic clustering procedure, which is called visual similarity clustering. Finally, text contents are located from the clustered results. Benefit from this scheme, the algorithm demonstrates strong robustness and adaptability in a wide variety of documents, which previous traditional DLA approaches do not possess.
引用
收藏
页码:459 / 465
页数:6
相关论文
共 50 条
  • [11] Psychophysical evaluation of document visual similarity
    Satkhozhina, Aziza
    Ahmadullin, Ildus
    Lee, Seungyon
    Pizlo, Zygmunt
    Allebach, Jan P.
    IMAGING AND PRINTING IN A WEB 2.0 WORLD III, 2012, 8302
  • [12] LayoutQT-Layout Quadrant Tags to embed visual features for document analysis
    Drumond, Patricia Medyna Lauritzen de Lucena
    Leite, Lindeberg Pessoa
    de Campos, Teofilo E.
    Braz, Fabricio Ataides
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [13] Arabic document layout analysis
    Hesham, Amany M.
    Rashwan, Mohsen A. A.
    Al-Barhamtoshy, Hassanin M.
    Abdou, Sherif M.
    Badr, Amr A.
    Farag, Ibrahim
    PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (04) : 1275 - 1287
  • [14] Arabic document layout analysis
    Amany M. Hesham
    Mohsen A. A. Rashwan
    Hassanin M. Al-Barhamtoshy
    Sherif M. Abdou
    Amr A. Badr
    Ibrahim Farag
    Pattern Analysis and Applications, 2017, 20 : 1275 - 1287
  • [15] SimiLay: A Developing Web Page Layout Based Visual Similarity Search Engine
    Bozkir, Ahmet Selman
    Sezer, Ebru Akcapinar
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 : 457 - 470
  • [16] Similarity Pyramid: Browsing a Document Database with Respect to Visual Similarity
    Ahmadullin, Ildus
    Allebach, Jan
    IMAGING AND PRINTING IN A WEB 2.0 WORLD III, 2012, 8302
  • [17] Multimodal cascaded document layout analysis network based on Transformer
    Wen S.
    Wu R.
    Feng C.
    Liu Y.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (02): : 317 - 324and369
  • [18] A Deep Learning-Based System for Document Layout Analysis
    Hong-Tai Tran
    Nam-Quan Nguyen
    Tuan-Anh Tran
    Xuan-Toan Mai
    Quoc-Thang Nguyen
    PROCEEDINGS OF 2022 THE 6TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, ICMLSC 20222, 2022, : 20 - 25
  • [19] A layout-analysis based system for document image retrieval
    1600, Associazione Italiana per l'Informatica e il Calcolo Automatico (13):
  • [20] A Hybrid Approach for Document Layout Analysis in Document Images
    Shehzadi, Tahira
    Stricker, Didier
    Afzal, Muhammad Zeshan
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT IV, 2024, 14807 : 21 - 39