Manchu Word Recognition Based on Convolutional Neural Network with Spatial Pyramid Pooling

被引:0
|
作者
Li, Min [1 ]
Zheng, Ruirui [1 ]
Xu, Shuang [1 ]
Fu, Yu [1 ]
Huang, Di [2 ]
机构
[1] Dalian Minzu Univ, Coll Informat & Commun Engn, Dalian, Peoples R China
[2] Northern Univ Nationalities, Coll Math & Informat Sci, Yinchuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Manchu word recognition; convolutional neural network; spatial pyramid pooling; optical character recognition;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Manchu character recognition is important in protecting and researching Manchu culture and history. Previous methods of Manchu character recognition are mainly based on conventional machine learning using shallow artificial selection features, thus recognition results are unsatisfactory. The method with convolutional neural networks achieves high accuracy on optical character recognition as the convolution operators can automatically extract deep structure features. The convolutional neural network needs input images with the fixed size, but as a kind of phonemic language, the Manchu word has an arbitrary length. So it is needed to normalize the size of images if applying conventional convolutional neural network directly on Manchu word recognition. This normalization process will restrain the promotion of Manchu character recognition accuracy. This paper utilizes the spatial pyramid pooling layer instead of the last max-pooling layer in a convolutional neural network, and proposes a classifier for recognizing the arbitrary size Manchu word without segmenting the word. Without need of normalizing image sizes, the proposed model obtains the better recognition accuracy. The experiments indicate that the proposed Manchu word recognition models achieve the highest accuracy of 0.9768, higher than the conventional convolutional neural network. Furthermore there is no normalization on input images with arbitrary sizes in recognizing process. The proposed Manchu word recognition models outperform conventional counterparts in both accuracy and flexibility.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Traffic Sign Recognition in Harsh Environment Using Attention Based Convolutional Pooling Neural Network
    Jun Ho Chung
    Dong Won Kim
    Tae Koo Kang
    Myo Taeg Lim
    Neural Processing Letters, 2020, 51 : 2551 - 2573
  • [42] Lightweight Spatial Pyramid Convolutional Neural Network for Traffic Sign Classification
    Rachmadi, Reza Fuad
    Koutaki, Gou
    Ogata, Kohichi
    2018 INDONESIAN ASSOCIATION FOR PATTERN RECOGNITION INTERNATIONAL CONFERENCE (INAPR), 2018, : 23 - 28
  • [43] Traffic Sign Recognition in Harsh Environment Using Attention Based Convolutional Pooling Neural Network
    Chung, Jun Ho
    Kim, Dong Won
    Kang, Tae Koo
    Lim, Myo Taeg
    NEURAL PROCESSING LETTERS, 2020, 51 (03) : 2551 - 2573
  • [44] Learning Pooling for Convolutional Neural Network
    Sun, Manli
    Song, Zhanjie
    Jiang, Xiaoheng
    Pan, Jing
    Pang, Yanwei
    NEUROCOMPUTING, 2017, 224 : 96 - 104
  • [45] Word Recognition For Color Classification Using Convolutional Neural Network
    Tuasikal, Dyah Ayu Anggreini
    Nugraha, M. B.
    Yudhatama, Emilio
    Muharom, Ahmad Syahril
    Pura, Megantara
    PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON NEW MEDIA STUDIES (CONMEDIA 2019), 2019, : 228 - 231
  • [46] TANDEM HMM WITH CONVOLUTIONAL NEURAL NETWORK FOR HANDWRITTEN WORD RECOGNITION
    Bluche, Theodore
    Ney, Hermann
    Kermorvant, Christopher
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 2390 - 2394
  • [47] Spatial Pyramid-based Wavelet Embedding Deep Convolutional Neural Network for Semantic Segmentation
    Liu, Jin
    Liu, Yazhou
    Sun, Quansen
    PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 326 - 337
  • [48] Sentiment Classification Based on Piecewise Pooling Convolutional Neural Network
    Zhang, Yuhong
    Wang, Qinqin
    Li, Yuling
    Wu, Xindong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2018, 56 (02): : 285 - 297
  • [49] Max-Pooling Convolutional Neural Network for Chinese Digital Gesture Recognition
    Zhao Qian
    Li Yawei
    Zhu Mengyu
    Yang Yuliang
    Xiao Ling
    Xu Chunyu
    Li Lin
    INFORMATION TECHNOLOGY AND INTELLIGENT TRANSPORTATION SYSTEMS, VOL 2, 2017, 455 : 79 - 89
  • [50] Video-Based Human Action Recognition Using Spatial Pyramid Pooling and 3D Densely Convolutional Networks
    Yang, Wanli
    Chen, Yimin
    Huang, Chen
    Gao, Mingke
    FUTURE INTERNET, 2018, 10 (12):