Nom document digitalization by deep convolution neural networks

被引:9
|
作者
Kha Cong Nguyen [1 ]
Cuong Tuan Nguyen [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
关键词
D O I
10.1016/j.patrec.2020.02.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nom is an ancient script used in Vietnam until the current Latin-based Vietnamese alphabet became common, and a large number of ancient Nom documents are in existence. Due to the gradual degradation of Nom documents and a decrease in the number of scholars who can understand them, a system to digitalize Nom documents is urgently necessary. This paper presents a segmentation-based method for digitalizing Nom documents using deep convolution neural networks. Nom pages are preprocessed, segmented into isolated characters, and then recognized by a single-character OCR. The structure of the U-Net is applied to create segmentation maps and extract character regions from them. Subsequently, we propose coarse and fine combined classifiers to recognize each character pattern. The results by the best classifier are revised by a decoder using a langue model. The decoder is the same as the connectionist temporal classification decoder used in end-to-end text recognition systems. Compared with the traditional segmentation method using projection profiles and the Voronoi diagram (IoU = 81.23%), the segmentation method using the deep convolution neural network produces a better result (IoU = 92.08%) for detecting character regions. The proposed CNN models for recognizing segmented character patterns outperforms the traditional models using the modified quadratic discriminant function and the learning vector quantization with the recognition rate of 85.07%. The combination of coarse and fine classifiers, the training dataset with salt and pepper noises, and the attention layer are the key factors in the recognition rate improvement. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:8 / 16
页数:9
相关论文
共 50 条
  • [21] Exploring Deep Recurrent Convolution Neural Networks for Subjectivity Classification
    Zhang, Xuejun
    Huang, Shan
    Zhao, Jianqiang
    Du, Xiaogang
    He, Fucun
    IEEE ACCESS, 2019, 7 : 347 - 357
  • [22] Single Image Dehazing Using Deep Convolution Neural Networks
    Zhang, Shengdong
    He, Fazhi
    Yao, Jian
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 128 - 137
  • [23] Safety helmet recognition based on deep convolution neural networks
    Zhou, Ningning
    Huang, Guofang
    Shi, Shaodong
    DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 959 - 967
  • [24] OGCNet: Overlapped group convolution for deep convolutional neural networks
    Li, Guoqing
    Zhang, Meng
    Zhang, Jingwei
    Zhang, Qianru
    KNOWLEDGE-BASED SYSTEMS, 2022, 253
  • [25] Winograd Convolution for Deep Neural Networks: Efficient Point Selection
    Alam, Syed Asad
    Anderson, Andrew
    Barabasz, Barbara
    Gregg, David
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (06)
  • [26] OGCNet: Overlapped group convolution for deep convolutional neural networks
    Li, Guoqing
    Zhang, Meng
    Zhang, Jingwei
    Zhang, Qianru
    Knowledge-Based Systems, 2022, 253
  • [27] A Generic OCR Using Deep Siamese Convolution Neural Networks
    Sokar, Ghada
    Hemayed, Elsayed E.
    Rehan, Mohamed
    2018 IEEE 9TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2018, : 1238 - 1244
  • [28] Sentiment Analysis of Text using Deep Convolution Neural Networks
    Chachra, Anmol
    Mehndiratta, Pulkit
    Gupta, Mohit
    2017 TENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2017, : 247 - 252
  • [29] Malware visualization methods based on deep convolution neural networks
    Ren, Zhuojun
    Chen, Guang
    Lu, Wenke
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (15-16) : 10975 - 10993
  • [30] Splatter: An Efficient Sparse Image Convolution for Deep Neural Networks
    Lee, Tristan
    Lee, Byeong Kil
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 506 - 509