Layout Analysis for Arabic Historical Document Images Using Machine Learning

被引:43
|
作者
Bukhari, Syed Saqib [1 ]
Breuel, Thomas M. [1 ]
Asi, Abedelkadir [2 ]
El-Sana, Jihad [2 ]
机构
[1] Tech Univ Kaiserslautern, Kaiserslautern, Germany
[2] Ben Gurion Univ Negev, Negev, Israel
基金
以色列科学基金会;
关键词
SEGMENTATION;
D O I
10.1109/ICFHR.2012.227
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Page layout analysis is a fundamental step of any document image understanding system. We introduce an approach that segments text appearing in page margins (a.k.a side-notes text) from manuscripts with complex layout format. Simple and discriminative features are extracted in a connected-component level and subsequently robust feature vectors are generated. Multi-layer perception classifier is exploited to classify connected components to the relevant class of text. A voting scheme is then applied to refine the resulting segmentation and produce the final classification. In contrast to state-of-the-art segmentation approaches, this method is independent of block segmentation, as well as pixel level analysis. The proposed method has been trained and tested on a dataset that contains a variety of complex side-notes layout formats, achieving a segmentation accuracy of about 95%.
引用
收藏
页码:639 / 644
页数:6
相关论文
共 50 条
  • [31] Arabic Calligraphy Images Analysis with Transfer Learning
    Gurer, Dilara Zeynep
    Gokbay, Inci Zaim
    ELECTRICA, 2023, 24 (01): : 201 - 209
  • [32] Localization of Digit Strings in Farsi/Arabic Document Images Using Structural Features and Syntactical Analysis
    Abedi, Ali
    Faez, Karim
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 728 - 733
  • [33] LABA: Logical Layout Analysis of Book Page Images in Arabic Using Multiple Support Vector Machines
    Qin, Wenda
    Elanwar, Randa
    Betke, Margrit
    2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2018, : 35 - 40
  • [34] Translator attribution for Arabic using machine learning
    Mohamed, Emad
    Sarwar, Raheem
    Mostafa, Sayed
    DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2023, 38 (02) : 658 - 666
  • [35] A Chinese Document Layout Analysis Based on Non-text Images
    Fu Xiaoling
    Li Xiaofeng
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 326 - 328
  • [36] A two-step framework for text line segmentation in historical Arabic and Latin document images
    Olfa Mechi
    Maroua Mehri
    Rolf Ingold
    Najoua Essoukri Ben Amara
    International Journal on Document Analysis and Recognition (IJDAR), 2021, 24 : 197 - 218
  • [37] A two-step framework for text line segmentation in historical Arabic and Latin document images
    Mechi, Olfa
    Mehri, Maroua
    Ingold, Rolf
    Essoukri Ben Amara, Najoua
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2021, 24 (03) : 197 - 218
  • [38] A Deep Learning-Based System for Document Layout Analysis
    Hong-Tai Tran
    Nam-Quan Nguyen
    Tuan-Anh Tran
    Xuan-Toan Mai
    Quoc-Thang Nguyen
    PROCEEDINGS OF 2022 THE 6TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, ICMLSC 20222, 2022, : 20 - 25
  • [39] Framework of Semantic Annotation of Arabic Document using Deep Learning
    Albukhitan, Saeed
    Alnazer, Ahmed
    Helmy, Tarek
    11TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 3RD INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2020, 170 : 989 - 994
  • [40] Arabic Sentiment Analysis for Student Evaluation using Machine Learning and the AraBERT Transformer
    Alamoudi, Huda
    Aljojo, Nahla
    Munshi, Asmaa
    Alghoson, Abdullah
    Banjar, Ameen
    Tashkandi, Araek
    Al-Tirawi, Anas
    Alsaleh, Iqbal
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2023, 13 (05) : 11945 - 11952