End-to-end semi-supervised approach with modulated object queries for table detection in documents

被引:0
|
作者
Ehsan, Iqraa [1 ]
Shehzadi, Tahira [1 ,2 ,3 ]
Stricker, Didier [1 ,2 ,3 ]
Afzal, Muhammad Zeshan [1 ,2 ,3 ]
机构
[1] Tech Univ Kaiserslautern, Dept Comp Sci, D-67663 Kaiserslautern, Germany
[2] Tech Univ Kaiserslautern, Mindgarage, D-67663 Kaiserslautern, Germany
[3] German Res Inst Artificial Intelligence DFKI, Comp Vis, D-67663 Kaiserslautern, Germany
关键词
Table detection; Document analysis; Semi-supervised learning; Detection transformer;
D O I
10.1007/s10032-024-00471-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Table detection, a pivotal task in document analysis, aims to precisely recognize and locate tables within document images. Although deep learning has shown remarkable progress in this realm, it typically requires an extensive dataset of labeled data for proficient training. Current CNN-based semi-supervised table detection approaches use the anchor generation process and non-maximum suppression in their detection process, limiting training efficiency. Meanwhile, transformer-based semi-supervised techniques adopted a one-to-one match strategy that provides noisy pseudo-labels, limiting overall efficiency. This study presents an innovative transformer-based semi-supervised table detector. It improves the quality of pseudo-labels through a novel matching strategy combining one-to-one and one-to-many assignment techniques. This approach significantly enhances training efficiency during the early stages, ensuring superior pseudo-labels for further training. Our semi-supervised approach is comprehensively evaluated on benchmark datasets, including PubLayNet, ICADR-19, and TableBank. It achieves new state-of-the-art results, with a mAP of 95.7% and 97.9% on TableBank (word) and PubLaynet with 30% label data, marking a 7.4 and 7.6 point improvement over previous semi-supervised table detection approach, respectively. The results clearly show the superiority of our semi-supervised approach, surpassing all existing state-of-the-art methods by substantial margins. This research represents a significant advancement in semi-supervised table detection methods, offering a more efficient and accurate solution for practical document analysis tasks.
引用
收藏
页码:363 / 378
页数:16
相关论文
共 50 条
  • [1] End-to-End Semi-Supervised Object Detection with Soft Teacher
    Xu, Mengde
    Zhang, Zheng
    Hu, Han
    Wang, Jianfeng
    Wang, Lijuan
    Wei, Fangyun
    Bai, Xiang
    Liu, Zicheng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3040 - 3049
  • [2] Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework
    Zhou, Qiang
    Yu, Chaohui
    Wang, Zhibin
    Qian, Qi
    Li, Hao
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4079 - 4088
  • [3] Towards End-to-End Semi-supervised Table Detection with Semantic Aligned Matching Transformer
    Shehzadi, Tahira
    Sarode, Shalini
    Stricker, Didier
    Afzal, Muhammad Zeshan
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 295 - 318
  • [4] End-to-End Semi-Supervised Learning for Video Action Detection
    Kumar, Akash
    Rawat, Yogesh Singh
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14680 - 14690
  • [5] Semi-Supervised End-to-End Speech Recognition
    Karita, Shigeki
    Watanabe, Shinji
    Iwata, Tomoharu
    Ogawa, Atsunori
    Delcroix, Marc
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2 - 6
  • [6] Towards Precise End-to-end Semi-Supervised Human Head Detection Network
    Li, Rongchun
    Zhang, Junjie
    Liu, Yuntao
    Dou, Yong
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [7] Semi-Supervised End-to-End Learning for Integrated Sensing and Communications
    Mateos-Ramos, Jose Miguel
    Chatelier, Baptiste
    Hager, Christian
    Keskin, Musa Furkan
    Le Magoarou, Luc
    Wymeersch, Henk
    2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 132 - 138
  • [8] GrowingNet: An end-to-end growing network for semi-supervised learning
    Zhang, Qifei
    Yu, Xiaomo
    COMPUTER COMMUNICATIONS, 2020, 151 : 208 - 215
  • [9] ACTIVEMATCH: END-TO-END SEMI-SUPERVISED ACTIVE REPRESENTATION LEARNING
    Yuan, Xinkai
    Li, Zilinghan
    Wang, Gaoang
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1136 - 1140
  • [10] End-to-End Semi-supervised Learning for Differentiable Particle Filters
    Wen, Hao
    Chen, Xiongjie
    Papagiannis, Georgios
    Hu, Conghui
    Li, Yunpeng
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 5825 - 5831