Nom document digitalization by deep convolution neural networks

被引:9
|
作者
Kha Cong Nguyen [1 ]
Cuong Tuan Nguyen [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
关键词
D O I
10.1016/j.patrec.2020.02.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nom is an ancient script used in Vietnam until the current Latin-based Vietnamese alphabet became common, and a large number of ancient Nom documents are in existence. Due to the gradual degradation of Nom documents and a decrease in the number of scholars who can understand them, a system to digitalize Nom documents is urgently necessary. This paper presents a segmentation-based method for digitalizing Nom documents using deep convolution neural networks. Nom pages are preprocessed, segmented into isolated characters, and then recognized by a single-character OCR. The structure of the U-Net is applied to create segmentation maps and extract character regions from them. Subsequently, we propose coarse and fine combined classifiers to recognize each character pattern. The results by the best classifier are revised by a decoder using a langue model. The decoder is the same as the connectionist temporal classification decoder used in end-to-end text recognition systems. Compared with the traditional segmentation method using projection profiles and the Voronoi diagram (IoU = 81.23%), the segmentation method using the deep convolution neural network produces a better result (IoU = 92.08%) for detecting character regions. The proposed CNN models for recognizing segmented character patterns outperforms the traditional models using the modified quadratic discriminant function and the learning vector quantization with the recognition rate of 85.07%. The combination of coarse and fine classifiers, the training dataset with salt and pepper noises, and the attention layer are the key factors in the recognition rate improvement. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:8 / 16
页数:9
相关论文
共 50 条
  • [31] Fully Learnable Group Convolution for Acceleration of Deep Neural Networks
    Wang, Xijun
    Kan, Meina
    Shan, Shiguang
    Chen, Xilin
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9041 - 9050
  • [32] Image detail enhancement of nanocomposites based on deep convolution neural networks
    Feng, Zhanwei
    Yan, Kun
    FERROELECTRICS, 2023, 610 (01) : 28 - 40
  • [33] Termite Pest Identification Method Based on Deep Convolution Neural Networks
    Huang, Jia-Hsin
    Liu, Yu-Ting
    Ni, Hung Chih
    Chen, Bo-Ye
    Huang, Shih-Ying
    Tsai, Huai-Kuang
    Li, Hou-Feng
    JOURNAL OF ECONOMIC ENTOMOLOGY, 2021, 114 (06) : 2452 - 2459
  • [34] Error Analysis and Improving the Accuracy of Winograd Convolution for Deep Neural Networks
    Barabasz, Barbara
    Anderson, Andrew
    Soodhalter, Kirk M.
    Gregg, David
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2020, 46 (04):
  • [35] Improving the quality of underwater imaging using deep convolution neural networks
    Nagaraj V. Dharwadkar
    Anjali M.Yadav
    Mohammad Ali Kadampur
    Iran Journal of Computer Science, 2022, 5 (2) : 127 - 141
  • [36] A method of rainfall runoff forecasting based on deep convolution neural networks
    Li, Xiaoli
    Du, Zhenlong
    Song, Guomei
    2018 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2018, : 304 - 310
  • [37] A Comparison Among Different Numeric Representations in Deep Convolution Neural Networks
    Wang P.
    Gao Y.
    Liu Z.
    Wang H.
    Wang D.
    Wang, Dongsheng (wds@tsinghua.edu.cn), 1600, Science Press (54): : 1348 - 1356
  • [38] Data-centric Computation Mode for Convolution in Deep Neural Networks
    Wang, Peiqi
    Liu, Zhenyu
    Wang, HaiXia
    Wang, Dongsheng
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 133 - 139
  • [39] Deep green function convolution for improving saliency in convolutional neural networks
    Beaini, Dominique
    Achiche, Sofiane
    Duperre, Alexandre
    Raison, Maxime
    VISUAL COMPUTER, 2021, 37 (02): : 227 - 244
  • [40] Deep green function convolution for improving saliency in convolutional neural networks
    Dominique Beaini
    Sofiane Achiche
    Alexandre Duperré
    Maxime Raison
    The Visual Computer, 2021, 37 : 227 - 244