ProtoUDA: Prototype-Based Unsupervised Adaptation for Cross-Domain Text Recognition

被引:0
|
作者
Liu, Xiao-Qian [1 ]
Ding, Xue-Ying [1 ]
Luo, Xin [1 ]
Xu, Xin-Shun [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
基金
中国国家自然科学基金;
关键词
Text recognition; Prototypes; Feature extraction; Task analysis; Visualization; Decoding; Adaptation models; Unsupervised learning; prototype; text recognition; contrastive learning; domain adaptation; MODEL; DIFFUSION;
D O I
10.1109/TKDE.2023.3344761
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text recognition reads from real scene text or handwritten text, facilitating many real-world applications such as driverless cars, visual Q&A, and image-based machine translation. Although impressive results have been achieved in single-domain text recognition, it still suffers from great challenges in cross-domain due to the domain gaps among the synthetic text, the real scene text, and the handwritten text. Existing standard unsupervised domain adaptation (UDA) methods struggle to solve the text recognition task since they view a domain or a text image (containing a character sequence) as a whole, ignoring the subunits that make up the sequence. In the paper, we present a Prototyped-based Unsupervised Domain Adaptation method for text recognition (ProtoUDA), where the class prototypes are computed from the source domain, target domain, and the mixed (source-target) domain, respectively. Technically, ProtoUDA initially extracts pseudo-labeled character features under word-level supervised information. Further, based on these character features, we propose two parallel and complementary modules to perform class-level and instance-level alignment, which explicitly transfer the knowledge learned in the source domain to the target domain. Among them, class-level alignment is to close the distance between the similar source prototypes and target prototypes. The instance-level alignment is based on contrastive learning, making the character instances of the mixed domain close to the corresponding class mixed prototype while staying away from other class mixed prototypes. To our knowledge, we are the first to adopt contrastive learning in UDA-based text recognition tasks. Extensive experiments on several benchmark datasets show the superiority of our method over state-of-the-art methods.
引用
收藏
页码:9096 / 9108
页数:13
相关论文
共 50 条
  • [1] Unsupervised Energy-based Adversarial Domain Adaptation for Cross-domain Text Classification
    Zou, Han
    Yang, Jianfei
    Wu, Xiaojian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1208 - 1218
  • [2] Prototype-Based Multisource Domain Adaptation
    Zhou, Lihua
    Ye, Mao
    Zhang, Dan
    Zhu, Ce
    Ji, Luping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5308 - 5320
  • [3] TextAdapter: Self-Supervised Domain Adaptation for Cross-Domain Text Recognition
    Liu, Xiao-Qian
    Zhang, Peng-Fei
    Luo, Xin
    Huang, Zi
    Xu, Xin-Shun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9854 - 9865
  • [4] Unsupervised Domain Adaptation for Medical Image Segmentation with Dynamic Prototype-based Contrastive Learning
    En, Qing
    Guo, Yuhong
    CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, 2024, 248 : 312 - 325
  • [5] Cross-domain feature enhancement for unsupervised domain adaptation
    Long Sifan
    Wang Shengsheng
    Zhao Xin
    Fu Zihao
    Wang Bilin
    Applied Intelligence, 2022, 52 : 17326 - 17340
  • [6] Cross-Domain Error Minimization for Unsupervised Domain Adaptation
    Du, Yuntao
    Chen, Yinghao
    Cui, Fengli
    Zhang, Xiaowen
    Wang, Chongjun
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 : 429 - 448
  • [7] Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation
    Wang, Rui
    Wu, Zuxuan
    Weng, Zejia
    Chen, Jingjing
    Qi, Guo-Jun
    Jiang, Yu-Gang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1665 - 1673
  • [8] Unsupervised Domain Adaptation with Imbalanced Cross-Domain Data
    Hsu, Tzu-Ming Harry
    Chen, Wei-Yu
    Hou, Cheng-An
    Tsai, Yao-Hung Hubert
    Yeh, Yi-Ren
    Wang, Yu-Chiang Frank
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4121 - 4129
  • [9] Cross-domain feature enhancement for unsupervised domain adaptation
    Sifan, Long
    Shengsheng, Wang
    Xin, Zhao
    Zihao, Fu
    Bilin, Wang
    APPLIED INTELLIGENCE, 2022, 52 (15) : 17326 - 17340
  • [10] UDA-FlyRecog: Unsupervised domain adaptation for drosophila cross-domain recognition model
    Deng, Hong
    Cai, Xin
    Yin, ChengLe
    Gao, XueShun
    Hu, Chang
    He, WenJie
    Peng, YingQiong
    JOURNAL OF STORED PRODUCTS RESEARCH, 2023, 104