Analyzing the potential of active learning for document image classification

被引:0
|
作者
Saifullah Saifullah
Stefan Agne
Andreas Dengel
Sheraz Ahmed
机构
[1] German Research Center for Artificial Intelligence,
[2] RPTU Kaiserslautern-Landau,undefined
[3] DeepReader GmbH,undefined
关键词
Document image classification; Document analysis; Active learning; Deep active learning;
D O I
暂无
中图分类号
学科分类号
摘要
Deep learning has been extensively researched in the field of document analysis and has shown excellent performance across a wide range of document-related tasks. As a result, a great deal of emphasis is now being placed on its practical deployment and integration into modern industrial document processing pipelines. It is well known, however, that deep learning models are data-hungry and often require huge volumes of annotated data in order to achieve competitive performances. And since data annotation is a costly and labor-intensive process, it remains one of the major hurdles to their practical deployment. This study investigates the possibility of using active learning to reduce the costs of data annotation in the context of document image classification, which is one of the core components of modern document processing pipelines. The results of this study demonstrate that by utilizing active learning (AL), deep document classification models can achieve competitive performances to the models trained on fully annotated datasets and, in some cases, even surpass them by annotating only 15–40% of the total training dataset. Furthermore, this study demonstrates that modern AL strategies significantly outperform random querying, and in many cases achieve comparable performance to the models trained on fully annotated datasets even in the presence of practical deployment issues such as data imbalance, and annotation noise, and thus, offer tremendous benefits in real-world deployment of deep document classification models. The code to reproduce our experiments is publicly available at https://github.com/saifullah3396/doc_al.
引用
收藏
页码:187 / 209
页数:22
相关论文
共 50 条
  • [1] Analyzing the potential of active learning for document image classification
    Saifullah, Saifullah
    Agne, Stefan
    Dengel, Andreas
    Ahmed, Sheraz
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2023, 26 (03) : 187 - 209
  • [2] Analyzing the Potential of Zero-Shot Recognition for Document Image Classification
    Siddiqui, Shoaib Ahmed
    Dengel, Andreas
    Ahmed, Sheraz
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 293 - 304
  • [3] Classification of Text regions in a Document Image by Analyzing the properties of Connected Components
    Bhowmik, Showmik
    Sarkar, Ram
    PROCEEDINGS OF 2020 IEEE APPLIED SIGNAL PROCESSING CONFERENCE (ASPCON 2020), 2020, : 36 - 40
  • [4] Adaptive Active Learning for Image Classification
    Li, Xin
    Guo, Yuhong
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 859 - 866
  • [5] DEEP ACTIVE LEARNING FOR IMAGE CLASSIFICATION
    Ranganathan, Hiranmayi
    Venkateswara, Hemanth
    Chakraborty, Shayok
    Panchanathan, Sethuraman
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3934 - 3938
  • [6] DOCUMENT IMAGE AND ZONE CLASSIFICATION THROUGH INCREMENTAL LEARNING
    Bouguelia, Mohamed-Rafik
    Belaid, Yolande
    Belaid, Abdel
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 4230 - 4234
  • [7] ACTIVE MANIFOLD LEARNING FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Zhang, Zhou
    Taskin, Gulsen
    Crawford, Melba M.
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2587 - 2590
  • [8] Explorations in Active Learning Applied to Image Classification
    Klimczak, Adriana
    Wenka, Marcel
    Ganzha, Maria
    Paprzycki, Marcin
    BIG DATA ANALYTICS IN ASTRONOMY, SCIENCE, AND ENGINEERING, BDA 2022, 2023, 13830 : 17 - 30
  • [9] Focused active learning for histopathological image classification *
    Schmidt, Arne
    Morales-Alvarez, Pablo
    Cooper, Lee A. D.
    Newberg, Lee A.
    Enquobahrie, Andinet
    Molina, Rafael
    Katsaggelos, Aggelos K.
    MEDICAL IMAGE ANALYSIS, 2024, 95
  • [10] Scalable Active Learning for Multiclass Image Classification
    Joshi, Ajay J.
    Porikli, Fatih
    Papanikolopoulos, Nikolaos P.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2259 - 2273