MAGIC: Multi-granularity domain adaptation for text recognition

被引:0
|
作者
Zhang, Jia-Ying [1 ]
Liu, Xiao-Qian [1 ]
Xue, Zhi-Yuan [1 ]
Luo, Xin [1 ]
Xu, Xin-Shun [1 ]
机构
[1] Shandong Univ, Sch Software, 1500 Shunhua Rd, Jinan 250101, Peoples R China
基金
中国国家自然科学基金;
关键词
Text recognition; Unsupervised domain adaptation; Entropy minimization; Multi-granularity prediction;
D O I
10.1016/j.patcog.2024.111229
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain gaps between synthetic text and real-world text restrict current text recognition methods. One solution is to align features through Unsupervised Domain Adaptation (UDA). Most existing UDA-based text recognition methods extract global and local features to alleviate domain differences, only focusing on character distribution gaps. However, notable distribution gaps in character combinations exert a pivotal influence diverse text recognition tasks. To this end, we propose a Multi-level And multi-Granularity domain adaptation with entropy loss guIded text reCognition model, named MAGIC. It integrates Global-level Domain Adaptation (GDA) to mitigate image-level domain drift and Local-level Multi-granularity Domain Adaptation (LMDA) local feature shifts. Particularly, we design a subword-level domain discriminator to align the subword features relating to each character combination. Moreover, multi-granularity entropy minimization is used to optimize the target domain data for better domain adaptation. Experimental results on several types of text datasets demonstrate the effectiveness of MAGIC.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Multi-granularity Prediction for Scene Text Recognition
    Wang, Peng
    Da, Cheng
    Yao, Cong
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 339 - 355
  • [2] Multi-Granularity Alignment Domain Adaptation for Object Detection
    Zhou, Wenzhang
    Du, Dawei
    Zhang, Libo
    Luo, Tiejian
    Wu, Yanjun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9571 - 9580
  • [3] Multi-granularity Deep Local Representations for Irregular Scene Text Recognition
    Gao, Hongchao
    Li, Yujia
    Dai, Jiao
    Wang, Xi
    Han, Jizhong
    Li, Ruixuan
    ACM/IMS Transactions on Data Science, 2021, 2 (02):
  • [4] Multi-granularity Legal Text Matching Method for Incorporating Domain Element Knowledge
    Luo S.
    Dong B.
    Pan L.
    Wu Z.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2024, 44 (03): : 298 - 305
  • [5] A Multi-Granularity Heterogeneous Graph for Extractive Text Summarization
    Zhao, Henghui
    Zhang, Wensheng
    Huang, Mengxing
    Feng, Siling
    Wu, Yuanyuan
    ELECTRONICS, 2023, 12 (10)
  • [6] On persuasion in spam email: A multi-granularity text analysis
    Janez-Martino, Francisco
    Barron-Cedeno, Alberto
    Alaiz-Rodriguez, Rocio
    Gonzalez-Castro, Victor
    Muti, Arianna
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 265
  • [7] A Multi-Granularity Semantic Extraction Method for Text Classification
    Li, Min
    Liu, Zeyu
    Li, Gang
    Han, Delong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XIII, ICIC 2024, 2024, 14874 : 224 - 236
  • [8] Research on Text Classification by Fusing Multi-Granularity Information
    Xin, Miaomiao
    Ma, Li
    Hu, Bofa
    Computer Engineering and Applications, 2023, 59 (09) : 104 - 111
  • [9] Text Sentiment Analysis Based on Multi-Granularity Joint Solution
    Fang, Xianghui
    Wang, Guoyin
    Liu, Qun
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 315 - 321
  • [10] A Text Vector Representation Model Merging Multi-Granularity Information
    Nie W.
    Chen Y.
    Ma J.
    Data Analysis and Knowledge Discovery, 2019, 3 (09) : 45 - 52