A tree-based model with branch parallel decoding for handwritten mathematical expression recognition

被引:4
|
作者
Li, Zhe [1 ]
Yang, Wentao [1 ]
Qi, Hengnian [2 ]
Jin, Lianwen [1 ,4 ]
Huang, Yichao [3 ]
Ding, Kai [3 ]
机构
[1] South China Univ Technol, 381 wushan Rd, Guangzhou, Peoples R China
[2] Huzhou Univ, 759,Erhuandong Rd, Huzhou 313000, Peoples R China
[3] IntSig Informat Co, 1268,Wanrong Rd, Shanghai 200040, Peoples R China
[4] South China Univ Technol, Sch Elect & Informat, Guangzhou, Peoples R China
关键词
Handwritten mathematical expression; recognition; Tree-based model; Parallel decoding; Attention mechanism;
D O I
10.1016/j.patcog.2023.110220
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten mathematical expression recognition (HMER) is a challenging task in the field of computer vision due to the complex two-dimensional spatial structure and diverse handwriting styles of mathematical expressions (MEs). Recent mainstream approach treats MEs as objects with tree structures, modeled by sequence decoders or tree decoders. These decoders recognize the symbols and relationships between symbols in MEs in depth-first order, resulting in long decoding steps that can harm their performance, particularly for MEs with complex structures. In this paper, we propose a novel tree-based model with branch parallel decoding for HMER, which parses the structures of ME trees by explicitly predicting the relationships between symbols. In addition, a query constructing module is proposed to assist the decoder in decoding the branches of ME trees in parallel, thus reducing the number of decoding time steps and alleviating the problem of long sequence attention decoding. As a result, our model outperforms existing models on three widely-used benchmarks and demonstrates significant improvements in HMER performance.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Using Speech for Handwritten Mathematical Expression Recognition Disambiguation
    Medjkoune, Sofiane
    Mouchere, Harold
    Petitrenaud, Simon
    Viard-Gaudin, Christian
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 187 - 192
  • [32] Online Handwritten Mathematical Expression Recognition and Applications: A Survey
    Zhelezniakov, Dmytro
    Zaytsev, Viktor
    Radyvonenko, Olga
    IEEE ACCESS, 2021, 9 : 38352 - 38373
  • [33] Stroke Extraction for Offline Handwritten Mathematical Expression Recognition
    Chan, Chungkwong
    IEEE ACCESS, 2020, 8 : 61565 - 61575
  • [34] CoMER: Modeling Coverage for Transformer-Based Handwritten Mathematical Expression Recognition
    Zhao, Wenqi
    Gao, Liangcai
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 392 - 408
  • [35] Offline handwritten mathematical expression recognition based on YOLOv5s
    Fei Li
    Hongbo Fang
    Dengzhun Wang
    Ruixin Liu
    Qing Hou
    Benliang Xie
    The Visual Computer, 2024, 40 : 1439 - 1452
  • [36] Offline handwritten mathematical expression recognition based on YOLOv5s
    Li, Fei
    Fang, Hongbo
    Wang, Dengzhun
    Liu, Ruixin
    Hou, Qing
    Xie, Benliang
    VISUAL COMPUTER, 2024, 40 (03): : 1439 - 1452
  • [37] Semantic Tree-Based 3D Scene Model Recognition
    Yuan, Juefei
    Wang, Tianyang
    Zhe, Shandian
    Lu, Yijuan
    Li, Bo
    THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 85 - 90
  • [38] High performance in tree-based parallel architectures
    Ancona, F
    Rovetta, S
    Zunino, R
    23RD EUROMICRO CONFERENCE - NEW FRONTIERS OF INFORMATION TECHNOLOGY, PROCEEDINGS, 1997, : 474 - 481
  • [39] A framework for parallel tree-based scientific simulations
    Liu, PF
    Wu, JJ
    PROCEEDINGS OF THE 1997 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 1997, : 137 - 144
  • [40] An N-ary Tree-based Model for Similarity Evaluation on Mathematical Formulae
    Dai, Yifan
    Chen, Liangyu
    Zhang, Zihan
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2578 - 2584