Nom document digitalization by deep convolution neural networks

被引:9
|
作者
Kha Cong Nguyen [1 ]
Cuong Tuan Nguyen [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
关键词
D O I
10.1016/j.patrec.2020.02.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nom is an ancient script used in Vietnam until the current Latin-based Vietnamese alphabet became common, and a large number of ancient Nom documents are in existence. Due to the gradual degradation of Nom documents and a decrease in the number of scholars who can understand them, a system to digitalize Nom documents is urgently necessary. This paper presents a segmentation-based method for digitalizing Nom documents using deep convolution neural networks. Nom pages are preprocessed, segmented into isolated characters, and then recognized by a single-character OCR. The structure of the U-Net is applied to create segmentation maps and extract character regions from them. Subsequently, we propose coarse and fine combined classifiers to recognize each character pattern. The results by the best classifier are revised by a decoder using a langue model. The decoder is the same as the connectionist temporal classification decoder used in end-to-end text recognition systems. Compared with the traditional segmentation method using projection profiles and the Voronoi diagram (IoU = 81.23%), the segmentation method using the deep convolution neural network produces a better result (IoU = 92.08%) for detecting character regions. The proposed CNN models for recognizing segmented character patterns outperforms the traditional models using the modified quadratic discriminant function and the learning vector quantization with the recognition rate of 85.07%. The combination of coarse and fine classifiers, the training dataset with salt and pepper noises, and the attention layer are the key factors in the recognition rate improvement. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:8 / 16
页数:9
相关论文
共 50 条
  • [1] An Architecture to Accelerate Convolution in Deep Neural Networks
    Ardakani, Arash
    Condo, Carlo
    Ahmadi, Mehdi
    Gross, Warren J.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (04) : 1349 - 1362
  • [2] Deep Convolution Neural Networks for Image Classification
    Kulkarni, Arun D.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 18 - 23
  • [3] THE COMBINATION OF CONVOLUTION NEURAL NETWORKS AND DEEP NEURAL NETWORKS FOR FAKE NEWS DETECTION
    Jawad, Zainab A.
    Obaid, Ahmed J.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2023, 18 (01): : 814 - 826
  • [4] Finger Type Classification with Deep Convolution Neural Networks
    Al-Wajih, Yousif Ahmed
    Hamanah, Waleed M.
    Abido, Mohammad A.
    Al-Sunni, Fouad
    Alwajih, Fakhraddin
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS (ICINCO), 2022, : 247 - 254
  • [5] Deep Convolution Neural Networks for Twitter Sentiment Analysis
    Zhao Jianqiang
    Gui Xiaolin
    Zhang Xuejun
    IEEE ACCESS, 2018, 6 : 23253 - 23260
  • [6] Bengali text document categorization based on very deep convolution neural network
    Hossain, Md. Rajib
    Hoque, Mohammed Moshiul
    Siddique, Nazmul
    Sarker, Iqbal H.
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184
  • [7] Detection of document modification based on deep neural networks
    Kim, Noo-ri
    Choi, YunSeok
    Lee, HyunSoo
    Choi, Jae-Young
    Kim, Suntae
    Kim, Jeong-Ah
    Cho, Youngwha
    Lee, Jee-Hyong
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2018, 9 (04) : 1089 - 1096
  • [8] Evaluating Deep Neural Networks for Image Document Enhancement
    Kirsten, Lucas N.
    Piccoli, Ricardo
    Ribani, Ricardo
    PROCEEDINGS OF THE 21ST ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG '21), 2021,
  • [9] Attentive deep neural networks for legal document retrieval
    Nguyen, Ha-Thanh
    Phi, Manh-Kien
    Ngo, Xuan-Bach
    Tran, Vu
    Nguyen, Le-Minh
    Tu, Minh-Phuong
    Artificial Intelligence and Law, 32 (01): : 57 - 86
  • [10] Detection of document modification based on deep neural networks
    Noo-ri Kim
    YunSeok Choi
    HyunSoo Lee
    Jae-Young Choi
    Suntae Kim
    Jeong-Ah Kim
    Youngwha Cho
    Jee-Hyong Lee
    Journal of Ambient Intelligence and Humanized Computing, 2018, 9 : 1089 - 1096