Reducing Style Overfitting for Character Recognition via Parallel Neural Networks with Style to Content Connection

被引:1
|
作者
Tang, Wei [1 ,2 ,3 ]
Jiang, Yiwen [1 ,2 ,3 ]
Gao, Neng [3 ]
Xiang, Ji [3 ]
Shen, Jiahui [3 ]
Li, Xiang [1 ,2 ,3 ]
Su, Yijun [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, State Key Lab Informat Secur, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
来源
2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019) | 2019年
关键词
character recognition; style overfitting; neural network;
D O I
10.1109/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00117
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
There is a significant style overfitting problem in neural-based character recognition: insufficient generalization ability to recognize characters with unseen styles. To address this problem, we propose a novel framework named Style-Melt Nets (SMN), which disentangles the style and content factors to extract pure content feature. In this framework, a pair of parallel style net and content net is designed to respectively infer the style labels and content labels of input character images, and the style feature produced by the style net is fed to the content net for eliminating the style influence on content feature. In addition, the marginal distribution of character pixels is considered as an important structure indicator for enhancing the content representations. Furthermore, to increase the style diversity of training data, an efficient data augmentation approach for changing the thickness of the strokes and generating outline characters is presented. Extensive experimental results demonstrate the benefit of our methods, and the proposed SMN is able to achieve the state-of-the-art performance on multiple real world character sets.
引用
收藏
页码:784 / 791
页数:8
相关论文
共 50 条
  • [31] Neural Artistic Style Transfer Using Deep Neural Networks
    Gupta, Bharath
    Govinda, K.
    Rajkumar, R.
    Masih, Jolly
    PROCEEDINGS OF ACADEMIA-INDUSTRY CONSORTIUM FOR DATA SCIENCE (AICDS 2020), 2022, 1411 : 1 - 12
  • [32] Masked Neural Style Transfer using Convolutional Neural Networks
    Handa, Arushi
    Garg, Prerna
    Khare, Vijay
    2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 2099 - 2104
  • [33] Gaussian Process Style Transfer Mapping for Historical Chinese Character Recognition
    Feng, Jixiong
    Peng, Liangrui
    Lebourgeois, Franck
    DOCUMENT RECOGNITION AND RETRIEVAL XXII, 2015, 9402
  • [34] Historical Chinese Character Recognition Method Based on Style Transfer Mapping
    Li, Bohan
    Peng, Liangrui
    Ji, Jingning
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 96 - 100
  • [35] The WuShu Database for Cursive Script Character and Style Recognition<bold> </bold>
    Shan, Xinrui
    Zhang, Kejun
    Shen, Lyukesheng
    Wang, Bolin
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
  • [36] CHARACTER IMAGE SYNTHESIS BASED ON SELECTED CONTENT AND REFERENCED STYLE EMBEDDING
    Zhu, Anna
    Zhang, Qiyang
    Lu, Xiongbo
    Xiong, Shengwu
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 374 - 379
  • [37] Online Music Style Recognition via Mobile Computing
    Yuan, Lizhu
    Zhang, Yue
    INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2022, 13 (02)
  • [38] Chinese Character Style Transfer Model Based on Convolutional Neural Network
    Chen, Weiran
    Liu, Chunping
    Ji, Yi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 558 - 569
  • [39] Image Style Transfer Using Convolutional Neural Networks
    Gatys, Leon A.
    Ecker, Alexander S.
    Bethge, Matthias
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2414 - 2423
  • [40] Video Style Transfer based on Convolutional Neural Networks
    Dong, Sun
    Ding, Youdong
    Qian, Yun
    Li, Mengfan
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022