Semi-supervised feature learning for improving writer identification

被引:30
|
作者
Chen, Shiming [1 ]
Wang, Yisong [1 ,2 ]
Lin, Chin-Teng [4 ]
Ding, Weiping [3 ]
Cao, Zehong [4 ,5 ]
机构
[1] Guizhou Univ, Sch Comp Sci & Technol, Guiyang, Guizhou, Peoples R China
[2] Key Laborary Intelligent Med Image Anal & Precise, Guiyang, Guizhou, Peoples R China
[3] Nantong Univ, Sch Comp Sci & Technol, Nantong, Jiangsu, Peoples R China
[4] Univ Technol Sydney, Fac Engn & IT, Ctr Artificial Intelligence, Sydney, NSW, Australia
[5] Univ Tasmania, Sch Technol Environm & Design, Discipline ICT, Hobart, Tas, Australia
关键词
Semi-supervised feature learning; Feature extraction; Regularization; CNN; Writer identification; DESCRIPTORS; RECOGNITION; RETRIEVAL; DOCUMENTS;
D O I
10.1016/j.ins.2019.01.024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data augmentation is typically used by supervised feature learning approaches for of fine writer identification, but such approaches require a mass of additional training data and potentially lead to overfitting errors. In this study, a semi-supervised feature learning pipeline is proposed to improve the performance of writer identification by training with extra unlabeled data and the original labeled data simultaneously. Specifically, we propose a weighted label smoothing regularization (WLSR) method for data augmentation, which assigns a weighted uniform label distribution to the extra unlabeled data. The WLSR method regularizes the convolutional neural network (CNN) baseline to allow more discriminative features to be learned to represent the properties of different writing styles. The experimental results on well-known benchmark datasets (ICDAR2013 and CVL) showed that our proposed semi-supervised feature learning approach significantly improves the baseline measurement and perform competitively with existing writer identification approaches. Our findings provide new insights into offline writer identification. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:156 / 170
页数:15
相关论文
共 50 条
  • [41] Semi-Supervised Multiview Feature Selection With Adaptive Graph Learning
    Jiang, Bingbing
    Wu, Xingyu
    Zhou, Xiren
    Liu, Yi
    Cohn, Anthony G.
    Sheng, Weiguo
    Chen, Huanhuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3615 - 3629
  • [42] Semi-supervised Learning of Bottleneck Feature for Music Genre Classification
    Dai, Jia
    Liu, Wenju
    Zheng, Hao
    Xue, Wei
    Ni, Chongjia
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 552 - 562
  • [43] Robust Soft Semi-supervised Discriminant Projection for Feature Learning
    Wang, Xiaoyu
    Zhang, Zhao
    Zhang, Yan
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 445 - 453
  • [44] A human motion feature based on semi-supervised learning of GMM
    Tian Qi
    Yinfu Feng
    Jun Xiao
    Hanzhi Zhang
    Yueting Zhuang
    Xiaosong Yang
    Jianjun Zhang
    Multimedia Systems, 2017, 23 : 85 - 93
  • [45] Flexible data representation with feature convolution for semi-supervised learning
    Dornaika, F.
    APPLIED INTELLIGENCE, 2021, 51 (11) : 7690 - 7704
  • [46] A Feature Space Learning Model Based on Semi-Supervised Clustering
    Guan, Renchu
    Wang, Xu
    Marchese, Maurizio
    Liang, Yanchun
    Yang, Chen
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 1, 2017, : 403 - 409
  • [47] Feature-based approach to semi-supervised similarity learning
    Gosselin, Philippe H.
    Cord, Matthieu
    PATTERN RECOGNITION, 2006, 39 (10) : 1839 - 1851
  • [48] A semi-supervised Laplacian extreme learning machine and feature fusion with CNN for industrial superheat identification
    Lei, Yongxiang
    Chen, Xiaofang
    Min, Mengcan
    Xie, Yongfang
    NEUROCOMPUTING, 2020, 381 (381) : 186 - 195
  • [49] MarginMatch: Improving Semi-Supervised Learning with Pseudo-Margins
    Sosea, Tiberiu
    Caragea, Cornelia
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15773 - 15782
  • [50] Improving classification with semi-supervised and fine-grained learning
    Lai, Danyu
    Tian, Wei
    Chen, Long
    PATTERN RECOGNITION, 2019, 88 : 547 - 556