Continual Learning for Table Detection in Document Images

被引:6
|
作者
Minouei, Mohammad [1 ,2 ]
Hashmi, Khurram Azeem [1 ,2 ]
Soheili, Mohammad Reza [3 ]
Afzal, Muhammad Zeshan [1 ,2 ]
Stricker, Didier [1 ,2 ]
机构
[1] Tech Univ Kaiserslautern, Dept Comp Sci, D-67663 Kaiserslautern, Germany
[2] German Res Inst Artificial Intelligence DFKI, D-67663 Kaiserslautern, Germany
[3] Kharazmi Univ, Dept Elect & Comp Engn, Tehran 1571914911, Iran
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 18期
关键词
table detection; document layout analysis; continual learning; incremental learning; experience replay;
D O I
10.3390/app12188969
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The growing amount of data demands methods that can gradually learn from new samples. However, it is not trivial to continually train a network. Retraining a network with new data usually results in a phenomenon called "catastrophic forgetting". In a nutshell, the performance of the model on the previous data drops by learning from the new instances. This paper explores this issue in the table detection problem. While there are multiple datasets and sophisticated methods for table detection, the utilization of continual learning techniques in this domain has not been studied. We employed an effective technique called experience replay and performed extensive experiments on several datasets to investigate the effects of catastrophic forgetting. The results show that our proposed approach mitigates the performance drop by 15 percent. To the best of our knowledge, this is the first time that continual learning techniques have been adopted for table detection, and we hope this stands as a baseline for future research.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] On automated workflow for fine-tuning deepneural network models for table detection in document images
    Cherepanov, Igor
    Mikhailov, Andrey
    Shigarov, Alexey
    Paramonov, Viacheslav
    2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 1130 - 1133
  • [42] Automatic Table Detection and Retention from Scanned Document Images via Analysis of Structural Information
    Ranka, Varsha
    Patil, Shubham
    Patni, Shubham
    Raut, Tushar
    Mehrotra, Kapil
    Gupta, Manish Kumar
    2017 FOURTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2017, : 244 - 249
  • [43] Altered Handwritten Text Detection in Document Images Using Deep Learning
    Patil, Gayatri
    Palaiahnakote, Shivakumara
    Gornale, Shivanand S.
    Lopresti, Daniel P.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (03)
  • [44] DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images
    Schreiber, Sebastian
    Agne, Stefan
    Wolf, Ivo
    Dengel, Andreas
    Ahmed, Sheraz
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1162 - 1167
  • [45] Word extraction from table regions in document images
    Jeong, CB
    Park, SC
    Son, HJ
    Kim, SH
    DIGITAL LIBRARIES: IMPLEMENTING STRATEGIES AND SHARING EXPERIENCES, PROCEEDINGS, 2005, 3815 : 214 - 223
  • [46] Learning to segment document images
    Kumar, KSS
    Namboodiri, A
    Jawahar, CV
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 471 - 476
  • [47] CasTabDetectoRS: Cascade Network for Table Detection in Document Images with Recursive Feature Pyramid and Switchable Atrous Convolution
    Hashmi, Khurram Azeem
    Pagani, Alain
    Liwicki, Marcus
    Stricker, Didier
    Afzal, Muhammad Zeshan
    JOURNAL OF IMAGING, 2021, 7 (10)
  • [48] DOCUMENT SHADOW REMOVAL WITH FOREGROUND DETECTION LEARNING FROM FULLY SYNTHETIC IMAGES
    Matsuo, Yuhi
    Akimoto, Naofumi
    Aoki, Yoshimitsu
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1656 - 1660
  • [49] A deep learning based system for mathematical expression detection and recognition in document images
    Bui Hai Phong
    Loung Tan Da
    Nguyen Thi Yen
    Thang Math Hoang
    Thi-Lan Le
    2020 12TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (IEEE KSE 2020), 2020, : 85 - 90
  • [50] A Model Based Framework for Table Processing in Degraded Document Images
    Shi, Zhixin
    Setlur, Srirangaraj
    Govindaraju, Venu
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 963 - 967