Visual Similarity Based Document Layout Analysis

被引:0
|
作者
Di Wen
Xiao-Qing Ding
机构
[1] Tsinghua University,Department of Electronic Engineering & State Key Laboratory of Intelligent Technology and Systems
关键词
document layout analysis; texture analysis; dynamic clustering;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a visual similarity based document layout analysis (DLA) scheme is proposed, which by using clustering strategy can adaptively deal with documents in different languages, with different layout structures and skew angles. Aiming at a robust and adaptive DLA approach, the authors first manage to find a set of representative filters and statistics to characterize typical texture patterns in document images, which is through a visual similarity testing process. Texture features are then extracted from these filters and passed into a dynamic clustering procedure, which is called visual similarity clustering. Finally, text contents are located from the clustered results. Benefit from this scheme, the algorithm demonstrates strong robustness and adaptability in a wide variety of documents, which previous traditional DLA approaches do not possess.
引用
收藏
页码:459 / 465
页数:6
相关论文
共 50 条
  • [21] Local Descriptors for Document Layout Analysis
    Garz, Angelika
    Diem, Markus
    Sablatnig, Robert
    ADVANCES IN VISUAL COMPUTING, PT III, 2010, 6455 : 29 - 38
  • [22] THE DOCUMENT SPECTRUM FOR PAGE LAYOUT ANALYSIS
    OGORMAN, L
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (11) : 1162 - 1173
  • [23] Adaptive layout analysis of document images
    Malerba, D
    Esposito, F
    Altamura, O
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2002, 2366 : 526 - 534
  • [24] DOCUMENT IMAGE SEGMENTATION AND LAYOUT ANALYSIS
    SAITOH, T
    YAMAAI, T
    TACHIKAWA, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1994, E77D (07) : 778 - 784
  • [25] Layout analysis of urdu document images
    Shafait, Faisal
    Adnan-ul-Hasan
    Keysers, Daniel
    Breuel, Thomas M.
    10TH IEEE INTERNATIONAL MULTITOPIC CONFERENCE 2006, PROCEEDINGS, 2006, : 293 - +
  • [26] Document Layout Analysis: A Comprehensive Survey
    Binmakhashen, Galal M.
    Mahmoud, Sabri A.
    ACM COMPUTING SURVEYS, 2020, 52 (06)
  • [27] Historical Document Layout Analysis Competition
    Antonacopoulos, A.
    Clausner, C.
    Papadopoulos, C.
    Pletschacher, S.
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1516 - 1520
  • [28] Document Reconstruction by Layout Analysis of Snippets
    Kleber, Florian
    Diem, Markus
    Sablatnig, Robert
    COMPUTER VISION AND IMAGE ANALYSIS OF ART, 2010, 7531
  • [29] A Methodological Study of Document Layout Analysis
    Zhang, Chunhu
    Ibrayim, Mayire
    Hamdulla, Askar
    2022 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY, HUMAN-COMPUTER INTERACTION AND ARTIFICIAL INTELLIGENCE, VRHCIAI, 2022, : 12 - 17
  • [30] A Document Layout Analysis Method Based on Morphological Operators and Connected Components
    Alarcon Arenas, Sebastian W.
    Meza-Lovon, Graciela L.
    Yari, Yessenia
    2018 XLIV LATIN AMERICAN COMPUTER CONFERENCE (CLEI 2018), 2018, : 622 - 631