Fine-grained Pseudo Labels for Scene Text Recognition

被引:0
|
作者
Li, Xiaoyu [1 ]
Chen, Xiaoxue [1 ]
Huang, Zuming [1 ]
Xie, Lele [1 ]
Chen, Jingdong [1 ]
Yang, Ming [1 ]
机构
[1] Ant Grp, Hangzhou, Peoples R China
关键词
pseudo labels; domain shift; scene text recognition;
D O I
10.1145/3581783.3611791
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pseudo-Labeling based semi-supervised learning has shown promising advantages in Scene Text Recognition (STR). Most of them usually use a pre-trained model to generate sequence-level pseudo labels for text images and then re-train the model. Recently, conducting Pseudo-Labeling in a teacher-student framework (a student model is supervised by the pseudo labels from a teacher model) has become increasingly popular, which trains in an end-to-end manner and yields outstanding performance in semi-supervised learning. However, applying this framework directly to Pseudo-Labeling STR exhibits unstable convergence, as generating pseudo labels at the coarse-grained sequence-level leads to inefficient utilization of unlabelled data. Furthermore, the inherent domain shift between labeled and unlabeled data results in low quality of derived pseudo labels. To mitigate the above issues, we propose a novel Cross-domain Pseudo-Labeling (CPL) approach for scene text recognition, which makes better utilization of unlabeled data at the character-level and provides more accurate pseudo labels. Specifically, our proposed Pseudo-Labeled Curriculum Learning dynamically adjusts the thresholds for different character classes according to the model's learning status. Moreover, an Adaptive Distribution Regularizer is employed to bridge the domain gap and improve the quality of pseudo labels. Extensive experiments show that CPL boosts those representative STR models to achieve state-of-the-art results on six challenging STR benchmarks. Besides, it can be effectively generalized to handwritten text.
引用
收藏
页码:5786 / 5795
页数:10
相关论文
共 50 条
  • [21] Multimodal fine-grained grocery product recognition using image and OCR text
    Pettersson, Tobias
    Riveiro, Maria
    Lofstrom, Tuwe
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
  • [22] Propagating Fine-Grained Topic Labels in News Snippets
    Sarmento, Luis
    Nunes, Sergio
    Teixeira, Jorge
    Oliveira, Eugenio
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 515 - +
  • [23] Fine-grained Angular Contrastive Learning with Coarse Labels
    Bukchin, Guy
    Schwartz, Eli
    Saenko, Kate
    Shahar, Ori
    Feris, Rogerio
    Giryes, Raja
    Karlinsky, Leonid
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8726 - 8736
  • [24] Towards Fine-Grained Recognition: Joint Learning for Object Detection and Fine-Grained Classification
    Wang, Qiaosong
    Rasmussen, Christopher
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT II, 2019, 11845 : 332 - 344
  • [25] Robust fine-grained image classification with noisy labels
    Xinxing Tan
    Zemin Dong
    Hualing Zhao
    The Visual Computer, 2023, 39 : 5637 - 5650
  • [26] Robust fine-grained image classification with noisy labels
    Tan, Xinxing
    Dong, Zemin
    Zhao, Hualing
    VISUAL COMPUTER, 2022, 39 (11): : 5637 - 5650
  • [27] FINE-GRAINED AND LAYERED OBJECT RECOGNITION
    Wu, Yang
    Zheng, Nanning
    Liu, Yuanliu
    Yuan, Zejian
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (02)
  • [28] SELECTIVE PARTS FOR FINE-GRAINED RECOGNITION
    Li, Dong
    Li, Yali
    Wang, Shengjin
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 922 - 926
  • [29] Deep LSAC for Fine-Grained Recognition
    Lin, Di
    Wang, Yi
    Liang, Lingyu
    Li, Ping
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 200 - 214
  • [30] FgER: Fine-Grained Entity Recognition
    Abhishek
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8008 - 8009