A machine-learning approach for analyzing document layout structures with two reading orders

被引:9
|
作者
Wu, Chung-Chih [1 ]
Chou, Chien-Hsing [1 ]
Chang, Fu [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei 115, Taiwan
关键词
binary decision; document layout analysis; reading order; support vector machine; taboo box; textline; text region;
D O I
10.1016/j.patcog.2008.03.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of document layout analysis is to locate textlines and text regions in document images mostly via a series of split-or-merge operations. Before applying such an operation, however, it is necessary to examine the context to decide whether the place chosen for the operation is appropriate. We thus view document layout analysis as a matter of solving a series of binary decision problems, such as whether to apply, or not to apply, a split-or-merge operation to a chosen place. To solve these problems, we use support vector machines to learn whether OF not to apply the previously mentioned operations from training documents in which all textlines and text regions have been located and their identifies labeled. The proposed approach is very effective for analyzing documents that allow both horizontal and vertical reading orders. When applied to a test data set composed of eight types of layout structure, the approach's accuracy rates for identifying textlines and text regions are 98.83% and 96.72%, respectively. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3200 / 3213
页数:14
相关论文
共 50 条
  • [1] Correcting the document layout: A machine learning approach
    Malerba, D
    Esposito, F
    Altamura, O
    Ceci, M
    Berardi, M
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 97 - 102
  • [2] Machine-learning approach predicts RNA structures
    Arnaud, Celia
    CHEMICAL & ENGINEERING NEWS, 2021, 99 (32) : 8 - 8
  • [3] A Machine-Learning Based Approach for Extracting Logical Structure of a Styled Document
    Kim, Tae-young
    Kim, Suntae
    Choi, Sangchul
    Kim, Jeong-Ah
    Choi, Jae-Young
    Ko, Jong-Won
    Lee, Jee-Huong
    Cho, Youngwha
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (02): : 1043 - 1056
  • [4] Prediction of project activity delays caused by variation orders: a machine-learning approach
    Nishat, Mirza Muntasir
    Neraas, Sander Magnussen
    Marsov, Andrei
    Olsson, Nils O. E.
    12TH NORDIC CONFERENCE ON CONSTRUCTION ECONOMICS AND ORGANISATION, 2024, 2024, 1389
  • [5] Machine-learning mathematical structures
    He, Yang-Hui
    arXiv, 2021,
  • [6] A Machine-Learning Approach to Distinguish Passengers and Drivers Reading While Driving
    Torres, Renato
    Ohashi, Orlando
    Pessin, Gustavo
    SENSORS, 2019, 19 (14)
  • [7] Analyzing Momentum Shifts in Tennis: A Machine-Learning Approach to Predicting Match Outcomes
    Xia, Yuean
    Li, Changfeng
    Zhang, Tanran
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [8] Learning Reading Order via Document Layout with Layout2Pos
    Nguyen, Laura
    Piwowarski, Benjamin
    Laborde, Julio
    Moyse, Gilles
    LINKING THEORY AND PRACTICE OF DIGITAL LIBRARIES, PT I, TPDL 2024, 2024, 15177 : 3 - 19
  • [9] Machine-learning approach for local classification of crystalline structures in multiphase systems
    Dietz, C.
    Kretz, T.
    Thoma, M. H.
    PHYSICAL REVIEW E, 2017, 96 (01)
  • [10] Analyzing document logic structure by machine learning
    Liu, GS
    Wang, YC
    Hu, PH
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 179 - 183