An innovative network based on double receptive field and Recursive Bi-directional Long Short-Term Memory

被引:1
|
作者
Meng, Peng-fei [1 ]
Jia, Shuang-cheng [1 ]
Li, Qian [1 ]
机构
[1] Mogo Auto Intelligence & Telemat Informat Technol, Beijing, Peoples R China
关键词
D O I
10.1038/s41598-021-01520-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sequence recognition of natural scene images has always been an important research topic in the field of computer vision. CRNN has been proven to be a popular end-to-end character sequence recognition network. However, the problem of wide characters is not considered under the setting of CRNN. The CRNN is less effective in recognizing long dense small characters. Aiming at the shortcomings of CRNN, we proposed an improved CRNN network, named CRNN-RES, based on BiLSTM and multiple receptive fields. Specifically, on the one hand, the CRNN-RES uses a dual pooling core to enhance the CNN network's ability to extract features. On the other hand, by improving the last RNN layer, the BiLSTM is changed to a shared parameter BiLSTM network using recursive residuals, which reduces the number of network parameters and improves the accuracy. In addition, we designed a structure that can flexibly configure the length of the input data sequence in the RNN layer, called the CRFC layer. Comparing the CRNN-RES network proposed in this paper with the original CRNN network, the extensive experiments show that when recognizing English characters and numbers, the parameters of CRNN-RES is 8197549, which decreased 133,752 parameters compare with CRNN. In the public dataset ICDAR 2003 (IC03), ICDAR 2013 (IC13), IIIT 5k-word (IIIT5k), and Street View Text (SVT), the CRNN-RES obtain the accuracy of 96.90%, 89.85%, 83.63%, and 82.96%, which higher than CRNN by 1.40%, 3.15%, 5.43%, and 2.16% respectively.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] DEEP BI-DIRECTIONAL LONG SHORT-TERM MEMORY BASED SPEECH ENHANCEMENT FOR WIND NOISE REDUCTION
    Lee, Jinkyu
    Kim, Keulbit
    Shabestary, Turaj
    Kang, Hong-Goo
    2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 41 - 45
  • [22] High Precision Dimensional Measurement with Convolutional Neural Network and Bi-Directional Long Short-Term Memory (LSTM)
    Wang, Yuhao
    Chen, Qibai
    Ding, Meng
    Li, Jiangyun
    SENSORS, 2019, 19 (23)
  • [23] Tiny-RainNet: a deep convolutional neural network with bi-directional long short-term memory model for short-term rainfall prediction
    Zhang, Chang-Jiang
    Wang, Hui-Yuan
    Zeng, Jing
    Ma, Lei-Ming
    Guan, Li
    METEOROLOGICAL APPLICATIONS, 2020, 27 (05)
  • [24] Bi-directional Long Short Term Memory Neural Network for Short-Term Traffic Speed Prediction Using Gravitational Search Algorithm
    Naheliya, Bharti
    Redhu, Poonam
    Kumar, Kranti
    INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2024, 22 (02) : 316 - 327
  • [25] CNN-based bi-directional and directional long-short term memory network for determination of face mask
    Koklu, Murat
    Cinar, Ilkay
    Taspinar, Yavuz Selim
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 71
  • [26] Bi-directional Long Short Term Memory Neural Network for Short-Term Traffic Speed Prediction Using Gravitational Search Algorithm
    Naheliya, Bharti
    Redhu, Poonam
    Kumar, Kranti
    International Journal of Intelligent Transportation Systems Research, 2024,
  • [27] Humor Prediction with Bi-directional Long-Short Term Memory
    Yan, Jiahuan
    Yang, Yule
    Zhu, Xi
    2021 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, INFORMATION AND COMMUNICATION ENGINEERING, 2021, 11933
  • [28] Stock Market Prediction Using Deep Attention Bi-directional Long Short-Term Memory
    Prakash, B.
    Saleena, B.
    COMPUTATIONAL ECONOMICS, 2024,
  • [29] Intelligent Tool-Wear Prediction Based on Informer Encoder and Bi-Directional Long Short-Term Memory
    Xie, Xingang
    Huang, Min
    Liu, Yue
    An, Qi
    MACHINES, 2023, 11 (01)
  • [30] Transient Stability Assessment of Power System Based on Bi-directional Long-short-term Memory Network
    Sun L.
    Bai J.
    Zhou Z.
    Zhao C.
    Bai, Jingtao (1830363811@qq.com), 1600, Automation of Electric Power Systems Press (44): : 64 - 72