Analyzing the potential of active learning for document image classification

被引:3
|
作者
Saifullah, Saifullah [1 ,2 ]
Agne, Stefan [1 ,3 ]
Dengel, Andreas [1 ,2 ]
Ahmed, Sheraz [1 ,3 ]
机构
[1] German Res Ctr Artificial Intelligence, D-67663 Kaiserslautern, Germany
[2] RPTU Kaiserslautern Landau, D-67663 Kaiserslautern, Germany
[3] DeepReader GmbH, D-67663 Kaiserslautern, Germany
关键词
Document image classification; Document analysis; Active learning; Deep active learning; NEURAL-NETWORKS;
D O I
10.1007/s10032-023-00429-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has been extensively researched in the field of document analysis and has shown excellent performance across a wide range of document-related tasks. As a result, a great deal of emphasis is now being placed on its practical deployment and integration into modern industrial document processing pipelines. It is well known, however, that deep learning models are data-hungry and often require huge volumes of annotated data in order to achieve competitive performances. And since data annotation is a costly and labor-intensive process, it remains one of the major hurdles to their practical deployment. This study investigates the possibility of using active learning to reduce the costs of data annotation in the context of document image classification, which is one of the core components of modern document processing pipelines. The results of this study demonstrate that by utilizing active learning (AL), deep document classification models can achieve competitive performances to the models trained on fully annotated datasets and, in some cases, even surpass them by annotating only 15-40% of the total training dataset. Furthermore, this study demonstrates that modern AL strategies significantly outperform random querying, and in many cases achieve comparable performance to the models trained on fully annotated datasets even in the presence of practical deployment issues such as data imbalance, and annotation noise, and thus, offer tremendous benefits in real-world deployment of deep document classification models. The code to reproduce our experiments is publicly available at .
引用
收藏
页码:187 / 209
页数:23
相关论文
共 50 条
  • [31] Deep active learning models for imbalanced image classification
    Jin, Qiuye
    Yuan, Mingzhi
    Wang, Haoran
    Wang, Manning
    Song, Zhijian
    KNOWLEDGE-BASED SYSTEMS, 2022, 257
  • [32] Mammographic Image Classification System via Active Learning
    Yu Zhao
    Dong Chen
    Hongzhi Xie
    Shuyang Zhang
    Lixu Gu
    Journal of Medical and Biological Engineering, 2019, 39 : 569 - 582
  • [33] Fuzzy multiclass active learning for hyperspectral image classification
    Samat, Alim
    Gamba, Paolo
    Liu, Sicong
    Li, Erzhu
    Miao, Zelang
    Abuduwaili, Jilili
    IET IMAGE PROCESSING, 2018, 12 (07) : 1095 - 1101
  • [34] Active Learning for Image Classification: A Comprehensive Analysis in Agriculture
    Flores, Christopher A.
    Valenzuela, Ariel I.
    Verschae, Rodrigo
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 10, ICICT 2024, 2025, 1055 : 607 - 616
  • [35] Column subset selection for active learning in image classification
    Shen, Jianfeng
    Ju, Bin
    Jiang, Tao
    Ren, Jingjing
    Zheng, Miao
    Yao, Chengwei
    Li, Lanjuan
    NEUROCOMPUTING, 2011, 74 (18) : 3785 - 3792
  • [36] Active learning SVM with regularization path for image classification
    Fuming Sun
    Yan Xu
    Jun Zhou
    Multimedia Tools and Applications, 2016, 75 : 1427 - 1442
  • [37] Personalized Image Classification by Semantic Embedding and Active Learning †
    Song, Mofei
    ENTROPY, 2020, 22 (11) : 1 - 26
  • [38] sMoBYAL: Supervised Contrastive Active Learning for Image Classification
    Thanh Hong Dang
    Thanh Tung Nguyen
    Huy Quang Trinh
    Linh Bao Doan
    Toan Van Pham
    SIXTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION, ICMV 2023, 2024, 13072
  • [39] Active learning SVM with regularization path for image classification
    Sun, Fuming
    Xu, Yan
    Zhou, Jun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (03) : 1427 - 1442
  • [40] Active Learning for Hyperspectral Image Classification: A Comparative Review
    Thoreau, Romain
    Achard, Veronique
    Risser, Laurent
    Berthelot, Beatrice
    Briottet, Xavier
    IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2022, 10 (03) : 256 - 278