Training an End-to-End Model for Offline Handwritten Japanese Text Recognition by Generated Synthetic Patterns

被引：17

作者：

Nam Tuan Ly ^{[1
]}

Cuong Tuan Nguyen ^{[1
]}

Nakagawa, Masaki ^{[1
]}

机构：

[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan

来源：

PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2018年

关键词：

Handwritten Japanese Text Recognition; End-to-End Model; CNN; BLSTM; Synthetic Image Generation;

D O I：

10.1109/ICFHR-2018.2018.00022

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents an end-to-end model of Deep Convolutional Recurrent Network (DCRN) for recognizing offline handwritten Japanese text lines. The end-to-end DCRN model has three parts: a convolutional feature extractor using Deep Convolutional Neural Network (DCNN) to extract a feature sequence from a text line image; recurrent layers employing a Deep Bidirectional LSTM to predict pre-frame from the feature sequence; and a transcription layer using Connectionist Temporal Classification (CTC) to convert the pre-frame predictions into the label sequence. Since our end-to-end model requires a large data for training, we synthesize handwritten text line images from sentences in corpora and handwritten character patterns in the Nakayosi and Kuchibue database with elastic distortions. In the experiment, we evaluate the performance of the end-to-end model and the effectiveness of the synthetic data generation method on the test set of the TUAT Kondate database. The results of the experiments show that our end-to-end model achieves higher than the state-of-the-art recognition accuracy on the test set of TUAT Kondate with 96.35% and 98.05% character level recognition accuracies without and with the generated synthetic data, respectively.

引用

页码：74 / 79

页数：6

共 50 条

[31] Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition
Bluche, Theodore
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[32] End-To-End Deep-Learning-Based Tamil Handwritten Document Recognition and Classification Model
Vinotheni, C.
Pandian, S. Lakshmana
IEEE ACCESS, 2023, 11 : 43195 - 43204
[33] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
Junho Jo
Hyung Il Koo
Jae Woong Soh
Nam Ik Cho
Multimedia Tools and Applications, 2020, 79 : 32137 - 32150
[34] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
Jo, Junho
Koo, Hyung Il
Soh, Jae Woong
Cho, Nam Ik
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32137 - 32150
[35] End-to-End Text Recognition using Local Ternary Patterns, MSER and Deep Convolutional Nets
Opitz, Michael
Diem, Markus
Fiel, Stefan
Kleber, Florian
Sablatnig, Robert
2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 186 - 190
[36] Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation
Kim, Hanbyul
Seo, Seunghyun
Lee, Lukas
Baek, Seolki
INTERSPEECH 2023, 2023, : 1653 - 1657
[37] ADVERSARIAL TRAINING OF END-TO-END SPEECH RECOGNITION USING A CRITICIZING LANGUAGE MODEL
Liu, Alexander H.
Lee, Hung-yi
Lee, Lin-shan
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6176 - 6180
[38] SELF-TRAINING FOR END-TO-END SPEECH RECOGNITION
Kahn, Jacob
Lee, Ann
Hannun, Awni
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7084 - 7088
[39] Transformer-based end-to-end scene text recognition
Zhu, Xinghao
Zhang, Zhi
PROCEEDINGS OF THE 2021 IEEE 16TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2021), 2021, : 1691 - 1695
[40] End-to-End Scene Text Recognition with Character Centroid Prediction
Zhao, Wei
Ma, Jinwen
NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 291 - 299

← 1 2 3 4 5 →