Manchu Word Recognition Based on Convolutional Neural Network with Spatial Pyramid Pooling

被引:0
|
作者
Li, Min [1 ]
Zheng, Ruirui [1 ]
Xu, Shuang [1 ]
Fu, Yu [1 ]
Huang, Di [2 ]
机构
[1] Dalian Minzu Univ, Coll Informat & Commun Engn, Dalian, Peoples R China
[2] Northern Univ Nationalities, Coll Math & Informat Sci, Yinchuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Manchu word recognition; convolutional neural network; spatial pyramid pooling; optical character recognition;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Manchu character recognition is important in protecting and researching Manchu culture and history. Previous methods of Manchu character recognition are mainly based on conventional machine learning using shallow artificial selection features, thus recognition results are unsatisfactory. The method with convolutional neural networks achieves high accuracy on optical character recognition as the convolution operators can automatically extract deep structure features. The convolutional neural network needs input images with the fixed size, but as a kind of phonemic language, the Manchu word has an arbitrary length. So it is needed to normalize the size of images if applying conventional convolutional neural network directly on Manchu word recognition. This normalization process will restrain the promotion of Manchu character recognition accuracy. This paper utilizes the spatial pyramid pooling layer instead of the last max-pooling layer in a convolutional neural network, and proposes a classifier for recognizing the arbitrary size Manchu word without segmenting the word. Without need of normalizing image sizes, the proposed model obtains the better recognition accuracy. The experiments indicate that the proposed Manchu word recognition models achieve the highest accuracy of 0.9768, higher than the conventional convolutional neural network. Furthermore there is no normalization on input images with arbitrary sizes in recognizing process. The proposed Manchu word recognition models outperform conventional counterparts in both accuracy and flexibility.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] A pyramid stripe pooling-based convolutional neural network for malware detection and classification
    Jiang J.
    Zhang Y.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (03) : 2785 - 2796
  • [22] Graph Convolutional Neural Network Gesture Recognition Based on Pooling Algorithm
    Chen, Hong
    Qi, Baoqiang
    Zhao, Hongdong
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
  • [23] Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification
    Yu, Zhesong
    Xu, Xiaoshuo
    Chen, Xiaoou
    Yang, Deshun
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4846 - 4852
  • [24] Detection of Algorithmically Generated Domain Names Using the Recurrent Convolutional Neural Network with Spatial Pyramid Pooling
    Liu, Zhanghui
    Zhang, Yudong
    Chen, Yuzhong
    Fan, Xinwen
    Dong, Chen
    ENTROPY, 2020, 22 (09)
  • [25] Lightweight spatial pyramid pooling convolutional neural network assisted hyperspectral imaging for Hangbaiju origin identification
    Dong, Ming-Yue
    Long, Wan-Jun
    Wu, Hai-Long
    Wang, Tong
    Fu, Hai-Yan
    Huang, Kun
    Ren, Hang
    Yu, Ru-Qin
    MICROCHEMICAL JOURNAL, 2025, 208
  • [26] Spatial Pyramid Pooling with Atrous Convolutional for MobileNet
    Mohamed, Nur Ayuni
    Zulkifley, Mohd Asyraf
    Abdani, Siti Raihanah
    2020 18TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2020, : 333 - 336
  • [27] Acoustic Scene Classification Using Spatial Pyramid Pooling With Convolutional Neural Networks
    Basbug, Ahmet Melih
    Sert, Mustafa
    2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 128 - 131
  • [28] Crop pest identification based on spatial pyramid pooling and deep convolution neural network
    Zhang B.
    Zhang M.
    Chen Y.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2019, 35 (19): : 209 - 215
  • [29] Traffic Sign Recognition with Convolutional Neural Network Based on Max Pooling Positions
    Qian, Rongqiang
    Yue, Yong
    Coenen, Frans
    Zhang, Bailing
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 578 - 582
  • [30] SPATIOTEMPORAL PYRAMID POOLING IN 3D CONVOLUTIONAL NEURAL NETWORKS FOR ACTION RECOGNITION
    Cheng, Cheng
    Lv, Pin
    Su, Bing
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3468 - 3472