Symbol Location-Aware Network for Improving Handwritten Mathematical Expression Recognition

被引:4
|
作者
Fu, Yingnan [1 ]
Cai, Wenyuan [2 ]
Gao, Ming [1 ]
Zhou, Aoying [1 ]
机构
[1] East China Normal Univ, Sch Data Sci & Engn, Shanghai, Peoples R China
[2] Shanghai Hypers Data Technology Inc, Shanghai, Peoples R China
关键词
handwritten mathematical expression recognition; symbol counting; global context; dynamic programming; PERFORMANCE EVALUATION;
D O I
10.1145/3591106.3592259
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently most handwritten mathematical expression recognition methods adopt the attention-based encoder-decoder framework, which generates LaTeX sequences from given images. However, the accuracy of the attention mechanism limits the performance of HMER models. Lacking global context information in the decoding process is also a challenge for HMER. Some methods adopt symbol-level counting to localize symbols for improving the model performance, while these methods cannot work well. In this paper, we propose a method named SLAN, shorted for a Symbol Location-Aware Network, to solve the HMER problem. Specifically, we propose an advanced relation-level counting method to detect symbols in the image. We solve the lacking global context problem with a new global context-aware decoder. For improving the accuracy of attention, we design a novel attention alignment loss function by the dynamic programming algorithm, which can learn attention alignment directly without pixel-level labels. We conducted extensive experiments on the CROHME dataset to demonstrate the effectiveness of each part of SLAN and achieved state-of-the-art performance.
引用
收藏
页码:516 / 524
页数:9
相关论文
共 50 条
  • [1] Improving Handwritten Mathematical Expression Recognition via Similar Symbol Distinguishing
    Li, Zhe
    Wang, Xinyu
    Liu, Yuliang
    Jin, Lianwen
    Huang, Yichao
    Ding, Kai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 90 - 102
  • [2] Syntax-Aware Network for Handwritten Mathematical Expression Recognition
    Yuan, Ye
    Liu, Xiao
    Dikubab, Wondimu
    Liu, Hui
    Ji, Zhilong
    Wu, Zhongqin
    Bai, Xiang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4543 - 4552
  • [3] A Neural Network Model for Online Handwritten Mathematical Symbol Recognition
    Thammano, Arit
    Rugkunchon, Sukhumal
    INTELLIGENT COMPUTING, PART I: INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, ICIC 2006, PART I, 2006, 4113 : 292 - 298
  • [4] Learning Symbol Relation Tree for Online Handwritten Mathematical Expression Recognition
    Thanh-Nghia Truong
    Hung Tuan Nguyen
    Cuong Tuan Nguyen
    Nakagawa, Masaki
    PATTERN RECOGNITION, ACPR 2021, PT II, 2022, 13189 : 307 - 321
  • [5] Location-Aware Human Activity Recognition
    Nguyen, Tam T.
    Fernandez, Daniel
    Nguyen, Quy T. K.
    Bagheri, Ebrahim
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2017, 2017, 10604 : 821 - 835
  • [6] When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition
    Li, Bohan
    Yuan, Ye
    Liang, Dingkang
    Liu, Xiao
    Ji, Zhilong
    Bai, Jinfeng
    Liu, Wenyu
    Bai, Xiang
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 197 - 214
  • [7] Temporal Classification Constraint for Improving Handwritten Mathematical Expression Recognition
    Cuong Tuan Nguyen
    Hung Tuan Nguyen
    Morizumi, Kei
    Nakagawa, Masaki
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II, 2021, 12917 : 113 - 125
  • [8] Location-aware Dynamic Network Provisioning
    Van-Quyet Nguyen
    Sinh-Ngoc Nguyen
    Choi, Deokjai
    Kim, Kyungbaek
    2017 19TH ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS 2017): MANAGING A WORLD OF THINGS, 2017, : 239 - 242
  • [9] Cell Counting by a Location-Aware Network
    Wang, Zuhui
    Yin, Zhaozheng
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 120 - 129
  • [10] Handwritten and Audio Information Fusion for Mathematical Symbol Recognition
    Medjkoune, Sofiane
    Mouchere, Harold
    Petitrenaud, Simon
    Viard-Gaudin, Christian
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 379 - 383