Analyzing the potential of active learning for document image classification

被引:3
|
作者
Saifullah, Saifullah [1 ,2 ]
Agne, Stefan [1 ,3 ]
Dengel, Andreas [1 ,2 ]
Ahmed, Sheraz [1 ,3 ]
机构
[1] German Res Ctr Artificial Intelligence, D-67663 Kaiserslautern, Germany
[2] RPTU Kaiserslautern Landau, D-67663 Kaiserslautern, Germany
[3] DeepReader GmbH, D-67663 Kaiserslautern, Germany
关键词
Document image classification; Document analysis; Active learning; Deep active learning; NEURAL-NETWORKS;
D O I
10.1007/s10032-023-00429-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has been extensively researched in the field of document analysis and has shown excellent performance across a wide range of document-related tasks. As a result, a great deal of emphasis is now being placed on its practical deployment and integration into modern industrial document processing pipelines. It is well known, however, that deep learning models are data-hungry and often require huge volumes of annotated data in order to achieve competitive performances. And since data annotation is a costly and labor-intensive process, it remains one of the major hurdles to their practical deployment. This study investigates the possibility of using active learning to reduce the costs of data annotation in the context of document image classification, which is one of the core components of modern document processing pipelines. The results of this study demonstrate that by utilizing active learning (AL), deep document classification models can achieve competitive performances to the models trained on fully annotated datasets and, in some cases, even surpass them by annotating only 15-40% of the total training dataset. Furthermore, this study demonstrates that modern AL strategies significantly outperform random querying, and in many cases achieve comparable performance to the models trained on fully annotated datasets even in the presence of practical deployment issues such as data imbalance, and annotation noise, and thus, offer tremendous benefits in real-world deployment of deep document classification models. The code to reproduce our experiments is publicly available at .
引用
收藏
页码:187 / 209
页数:23
相关论文
共 50 条
  • [21] Unlabeled data selection for active learning in image classification
    Li, Xiongquan
    Wang, Xukang
    Chen, Xuhesheng
    Lu, Yao
    Fu, Hongpeng
    Wu, Ying Cheng
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [22] Active Learning Methods for Remote Sensing Image Classification
    Tuia, Devis
    Ratle, Frederic
    Pacifici, Fabio
    Kanevski, Mikhail F.
    Emery, William J.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (07): : 2218 - 2232
  • [23] Two-dimensional active learning for image classification
    Qi, Guo-Jun
    Hua, Xian-Sheng
    Rui, Yong
    Tang, Jinhui
    Zhang, Hong-Jiang
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 325 - +
  • [24] K-COVERS FOR ACTIVE LEARNING IN IMAGE CLASSIFICATION
    Shen, Yeji
    Song, Yuhang
    Li, Hanhan
    Kamali, Shahab
    Wang, Bin
    Kuo, C. -C. Jay
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 288 - 293
  • [25] Substep active deep learning framework for image classification
    Guoqiang Li
    Ning Gong
    Pattern Analysis and Applications, 2021, 24 : 23 - 34
  • [26] AN NOVEL ACTIVE LEARNING STRATEGY FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Shi, Qian
    Zhang, Liangpei
    Du, Bo
    2012 4TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING (WHISPERS), 2012,
  • [27] Unlabeled data selection for active learning in image classification
    Xiongquan Li
    Xukang Wang
    Xuhesheng Chen
    Yao Lu
    Hongpeng Fu
    Ying Cheng Wu
    Scientific Reports, 14
  • [28] Substep active deep learning framework for image classification
    Li, Guoqiang
    Gong, Ning
    PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (01) : 23 - 34
  • [29] Mammographic Image Classification System via Active Learning
    Zhao, Yu
    Chen, Dong
    Xie, Hongzhi
    Zhang, Shuyang
    Gu, Lixu
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2019, 39 (04) : 569 - 582
  • [30] Active Convolution: Learning the Shape of Convolution for Image Classification
    Jeon, Yunho
    Kim, Junmo
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1846 - 1854