A tree-based model with branch parallel decoding for handwritten mathematical expression recognition

被引:4
|
作者
Li, Zhe [1 ]
Yang, Wentao [1 ]
Qi, Hengnian [2 ]
Jin, Lianwen [1 ,4 ]
Huang, Yichao [3 ]
Ding, Kai [3 ]
机构
[1] South China Univ Technol, 381 wushan Rd, Guangzhou, Peoples R China
[2] Huzhou Univ, 759,Erhuandong Rd, Huzhou 313000, Peoples R China
[3] IntSig Informat Co, 1268,Wanrong Rd, Shanghai 200040, Peoples R China
[4] South China Univ Technol, Sch Elect & Informat, Guangzhou, Peoples R China
关键词
Handwritten mathematical expression; recognition; Tree-based model; Parallel decoding; Attention mechanism;
D O I
10.1016/j.patcog.2023.110220
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten mathematical expression recognition (HMER) is a challenging task in the field of computer vision due to the complex two-dimensional spatial structure and diverse handwriting styles of mathematical expressions (MEs). Recent mainstream approach treats MEs as objects with tree structures, modeled by sequence decoders or tree decoders. These decoders recognize the symbols and relationships between symbols in MEs in depth-first order, resulting in long decoding steps that can harm their performance, particularly for MEs with complex structures. In this paper, we propose a novel tree-based model with branch parallel decoding for HMER, which parses the structures of ME trees by explicitly predicting the relationships between symbols. In addition, a query constructing module is proposed to assist the decoder in decoding the branches of ME trees in parallel, thus reducing the number of decoding time steps and alleviating the problem of long sequence attention decoding. As a result, our model outperforms existing models on three widely-used benchmarks and demonstrates significant improvements in HMER performance.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A tree-based model with branch parallel decoding for handwritten mathematical expression recognition
    Li, Zhe
    Yang, Wentao
    Qi, Hengnian
    Jin, Lianwen
    Huang, Yichao
    Ding, Kai
    Pattern Recognition, 2024, 149
  • [2] Visual-Textual Attention for Tree-Based Handwritten Mathematical Expression Recognition
    Liao, Wei
    Liu, Jiayi
    Chen, Jianghan
    Wang, Qiu-Feng
    Huang, Kaizhu
    ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2023, 2024, 14374 : 375 - 384
  • [3] Tree-based data augmentation and mutual learning for offline handwritten mathematical expression recognition
    Yang, Chen
    Du, Jun
    Zhang, Jianshu
    Wu, Changjie
    Chen, Mingjun
    Wu, JiaJia
    PATTERN RECOGNITION, 2022, 132
  • [4] Tree-based BLSTM for mathematical expression recognition
    Zhang, Ting
    Mouchere, Harold
    Viard-Gaudin, Christian
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 914 - 919
  • [5] SRD: A Tree Structure Based Decoder for Online Handwritten Mathematical Expression Recognition
    Zhang, Jianshu
    Du, Jun
    Yang, Yongxin
    Song, Yi-Zhe
    Dai, Lirong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2471 - 2480
  • [6] A Transformer-based Syntax Tree Decoder for Handwritten Mathematical Expression Recognition
    Zhou B.
    Cao J.
    Wang Y.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2023, 59 (06): : 909 - 914
  • [7] A comprehensive handwritten Indic script recognition system: a tree-based approach
    Singh P.K.
    Sarkar R.
    Bhateja V.
    Nasipuri M.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (01) : 943 - 960
  • [8] Learning Symbol Relation Tree for Online Handwritten Mathematical Expression Recognition
    Thanh-Nghia Truong
    Hung Tuan Nguyen
    Cuong Tuan Nguyen
    Nakagawa, Masaki
    PATTERN RECOGNITION, ACPR 2021, PT II, 2022, 13189 : 307 - 321
  • [9] A Tree-Based Context Model for Object Recognition
    Choi, Myung Jin
    Torralba, Antonio
    Willsky, Alan S.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) : 240 - 252
  • [10] Bidirectional trained tree-structured decoder for Handwritten Mathematical Expression Recognition
    Cheng, Hanbo
    Liu, Chenyu
    Hu, Pengfei
    Zhang, Zhenrong
    Ma, Jiefeng
    Du, Jun
    PATTERN RECOGNITION, 2025, 165