SCS: Style and Content Supervision Network for Character Recognition with Unseen Font Style

被引:2
|
作者
Tang, Wei [1 ,2 ,3 ]
Jiang, Yiwen [1 ,2 ,3 ]
Gao, Neng [3 ]
Xiang, Ji [3 ]
Su, Yijun [1 ,2 ,3 ]
Li, Xiang [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, State Key Lab Informat Secur, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
来源
NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V | 2019年 / 1143卷
基金
中国国家自然科学基金;
关键词
Character recognition; Convolutional neural networks; Style overfitting; Style supervision;
D O I
10.1007/978-3-030-36802-9_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is a significant style overfitting problem in traditional content supervision models of character recognition: insufficient generalization ability to recognize the characters with unseen font styles. To overcome this problem, in this paper we propose a novel framework named Style and Content Supervision (SCS) network, which integrates style and content supervision to resist style overfitting. Different from traditional models only supervised by content labels, SCS simultaneously leverages the style and content supervision to separate the task-specific features of style and content, and then mixes the style-specific and content-specific features using bilinear model to capture the hidden correlation between them. Experimental results prove that the proposed model is able to achieve the state-of-the-art performance on several widely used real world character sets, and it obtains relatively strong robustness when the size of training set is shrinking.
引用
收藏
页码:20 / 31
页数:12
相关论文
共 50 条
  • [21] Distilling Content from Style for Handwritten Word Recognition
    Kang, Lei
    Riba, Pau
    Rusinol, Marcal
    Fornes, Alicia
    Villegas, Mauricio
    2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 139 - 144
  • [22] Network of Steel: Neural Font Style Transfer from Heavy Metal to Corporate Logos
    Ter-Sarkisov, Aram
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 621 - 629
  • [23] Gaussian Process Style Transfer Mapping for Historical Chinese Character Recognition
    Feng, Jixiong
    Peng, Liangrui
    Lebourgeois, Franck
    DOCUMENT RECOGNITION AND RETRIEVAL XXII, 2015, 9402
  • [24] Historical Chinese Character Recognition Method Based on Style Transfer Mapping
    Li, Bohan
    Peng, Liangrui
    Ji, Jingning
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 96 - 100
  • [25] The WuShu Database for Cursive Script Character and Style Recognition<bold> </bold>
    Shan, Xinrui
    Zhang, Kejun
    Shen, Lyukesheng
    Wang, Bolin
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
  • [26] CHARACTER IMAGE SYNTHESIS BASED ON SELECTED CONTENT AND REFERENCED STYLE EMBEDDING
    Zhu, Anna
    Zhang, Qiyang
    Lu, Xiongbo
    Xiong, Shengwu
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 374 - 379
  • [27] Puff-Net: Efficient Style Transfer with Pure Content and Style Feature Fusion Network
    Zheng, Sizhe
    Gao, Pan
    Zhou, Peng
    Qin, Jie
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 8059 - 8068
  • [28] Deep Content Guidance Network for Arbitrary Style Transfer
    Shi, Di-Bo
    Xie, Huan
    Ji, Yi
    Li, Ying
    Liu, Chun-Ping
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [29] Content-Independent Font Recognition on a Single Chinese Character using Sparse Representation
    Song, Weikang
    Lian, Zhouhui
    Tang, Yingmin
    Xiao, Jianguo
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 376 - 380
  • [30] Chinese Character Style Transfer Model Based on Convolutional Neural Network
    Chen, Weiran
    Liu, Chunping
    Ji, Yi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 558 - 569