Layout Analysis for Arabic Historical Document Images Using Machine Learning

被引:43
|
作者
Bukhari, Syed Saqib [1 ]
Breuel, Thomas M. [1 ]
Asi, Abedelkadir [2 ]
El-Sana, Jihad [2 ]
机构
[1] Tech Univ Kaiserslautern, Kaiserslautern, Germany
[2] Ben Gurion Univ Negev, Negev, Israel
基金
以色列科学基金会;
关键词
SEGMENTATION;
D O I
10.1109/ICFHR.2012.227
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Page layout analysis is a fundamental step of any document image understanding system. We introduce an approach that segments text appearing in page margins (a.k.a side-notes text) from manuscripts with complex layout format. Simple and discriminative features are extracted in a connected-component level and subsequently robust feature vectors are generated. Multi-layer perception classifier is exploited to classify connected components to the relevant class of text. A voting scheme is then applied to refine the resulting segmentation and produce the final classification. In contrast to state-of-the-art segmentation approaches, this method is independent of block segmentation, as well as pixel level analysis. The proposed method has been trained and tested on a dataset that contains a variety of complex side-notes layout formats, achieving a segmentation accuracy of about 95%.
引用
收藏
页码:639 / 644
页数:6
相关论文
共 50 条
  • [41] Emoji-Based Sentiment Analysis of Arabic Microblogs Using Machine Learning
    Al-Azani, Sadam
    El-Alfy, El-Sayed M.
    2018 21ST SAUDI COMPUTER SOCIETY NATIONAL COMPUTER CONFERENCE (NCC), 2018,
  • [42] Document layout analysis using pattern classification method
    Yamaoka, M
    Iwaki, O
    IMAGE ANALYSIS APPLICATIONS AND COMPUTER GRAPHICS, 1995, 1024 : 524 - 525
  • [43] Analysis of Multimodality Brain Images using Machine Learning Techniques
    Kavitha, S.
    Thyagharajan, K. K.
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2015, : 1482 - 1486
  • [44] Arabic Captioning for Images of Clothing Using Deep Learning
    Al-Malki, Rasha Saleh
    Al-Aama, Arwa Yousuf
    SENSORS, 2023, 23 (08)
  • [45] Domain adaptive learning for document layout analysis and object detection using classifier alignment mechanism
    Mishra, Prerna
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 116
  • [46] Cross-domain document layout analysis using document style guide
    Wu, Xingjiao
    Xiao, Luwei
    Du, Xiangcheng
    Zheng, Yingbin
    Li, Xin
    Ma, Tianlong
    Jin, Cheng
    He, Liang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
  • [47] A Binarization Algorithm for Historical Arabic Manuscript Images using a Neutrosophic Approach
    Amin, Khalid M.
    Elfattah, Mohamed Abd
    Hassanien, Aboul Ella
    Schaefer, Gerald
    2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2014, : 266 - 270
  • [48] Supervised Machine Learning for document analysis and prediction
    Ghany, Kareem Kamal A.
    Ayeldeen, Heba
    PROCEEDINGS OF 2015 THIRD IEEE WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS), 2015,
  • [49] Stochastic language models for style-directed layout analysis of document images
    Kanungo, T
    Mao, S
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2003, 12 (05) : 583 - 596
  • [50] A machine-learning approach for analyzing document layout structures with two reading orders
    Wu, Chung-Chih
    Chou, Chien-Hsing
    Chang, Fu
    PATTERN RECOGNITION, 2008, 41 (10) : 3200 - 3213