Training an End-to-End Model for Offline Handwritten Japanese Text Recognition by Generated Synthetic Patterns

被引:17
|
作者
Nam Tuan Ly [1 ]
Cuong Tuan Nguyen [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
关键词
Handwritten Japanese Text Recognition; End-to-End Model; CNN; BLSTM; Synthetic Image Generation;
D O I
10.1109/ICFHR-2018.2018.00022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an end-to-end model of Deep Convolutional Recurrent Network (DCRN) for recognizing offline handwritten Japanese text lines. The end-to-end DCRN model has three parts: a convolutional feature extractor using Deep Convolutional Neural Network (DCNN) to extract a feature sequence from a text line image; recurrent layers employing a Deep Bidirectional LSTM to predict pre-frame from the feature sequence; and a transcription layer using Connectionist Temporal Classification (CTC) to convert the pre-frame predictions into the label sequence. Since our end-to-end model requires a large data for training, we synthesize handwritten text line images from sentences in corpora and handwritten character patterns in the Nakayosi and Kuchibue database with elastic distortions. In the experiment, we evaluate the performance of the end-to-end model and the effectiveness of the synthetic data generation method on the test set of the TUAT Kondate database. The results of the experiments show that our end-to-end model achieves higher than the state-of-the-art recognition accuracy on the test set of TUAT Kondate with 96.35% and 98.05% character level recognition accuracies without and with the generated synthetic data, respectively.
引用
收藏
页码:74 / 79
页数:6
相关论文
共 50 条
  • [11] END-TO-END CHINESE TEXT RECOGNITION
    Hu, Jie
    Guo, Tszhang
    Cao, Ji
    Zhang, Changshui
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1407 - 1411
  • [12] End-to-End Scene Text Recognition
    Wang, Kai
    Babenko, Boris
    Belongie, Serge
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 1457 - 1464
  • [13] End-to-End Chinese Image Text Recognition with Attention Model
    Sheng, Fenfen
    Zhai, Chuanlei
    Chen, Zhineng
    Xu, Bo
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 180 - 189
  • [14] An End-to-End Scene Text Recognition for Bilingual Text
    Albalawi, Bayan M.
    Jamal, Amani T.
    Al Khuzayem, Lama A.
    Alsaedi, Olaa A.
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (09)
  • [15] End-to-End Handwritten Text Detection and Transcription in Full Pages
    Carbonell, Manuel
    Mas, Joan
    Villegas, Mauricio
    Fornes, Alicia
    Llados, Josep
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW), VOL 5, 2019, : 29 - 34
  • [16] RTNet: An End-to-End Method for Handwritten Text Image Translation
    Su, Tonghua
    Liu, Shuchen
    Zhou, Shengjie
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 99 - 113
  • [17] An attention-based end-to-end model for multiple text lines recognition in japanese historical documents
    Ly, Nam Tuan
    Nguyen, Cuong Tuan
    Nakagawa, Masaki
    Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, 2019, : 629 - 634
  • [18] An end-to-end model for multi-view scene text recognition
    Banerjee, Ayan
    Shivakumara, Palaiahnakote
    Bhattacharya, Saumik
    Pal, Umapada
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2024, 149
  • [19] End-to-End Optical Character Recognition for Bengali Handwritten Words
    Safir, Farisa Benta
    Ohi, Abu Quwsar
    Mridha, M. F.
    Monowar, Muhammad Mostafa
    Hamid, Md Abdul
    2021 IEEE NATIONAL COMPUTING COLLEGES CONFERENCE (NCCC 2021), 2021, : 1067 - +
  • [20] PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition
    Dezhi Peng
    Lianwen Jin
    Yuliang Liu
    Canjie Luo
    Songxuan Lai
    International Journal of Computer Vision, 2022, 130 : 2623 - 2645