Visual-Textual Attention for Tree-Based Handwritten Mathematical Expression Recognition

被引:0
|
作者
Liao, Wei [1 ]
Liu, Jiayi [1 ]
Chen, Jianghan [1 ]
Wang, Qiu-Feng [1 ]
Huang, Kaizhu [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou, Peoples R China
[2] Duke Kunshan Univ, Data Sci Res Ctr, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten mathematical expression recognition; Tree decoder; Visual-textual attention; Mutual learning; DECODER;
D O I
10.1007/978-981-97-1417-9_35
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Handwritten mathematical expression recognition (HMER) has attracted much attention and achieved remarkable progress under the encoder-decoder framework. However, it is still challenging due to complex structures and illegible handwriting. In this paper, we propose to refine the encoder-decoder framework for HMER. Firstly, we propose a multi-scale vision and textual attention fusion mechanism to enhance the contexts from both spatial and semantic information. Next, most of HMER works simply regard the HMER as a sequence-to-sequence problem (i.e., Latex string), ignoring the structure information in the mathematical expressions. To overcome this issue, we utilize a tree decoder to capture such structure contexts. Furthermore, we propose a parent-children mutual learning method to enhance the learning of our encoder-decoder model. Extensive experiments on the HMER benchmark datasets of CROHME 2014, 2016 and 2019 demonstrate the effectiveness of the proposed method.
引用
收藏
页码:375 / 384
页数:10
相关论文
共 50 条
  • [1] A tree-based model with branch parallel decoding for handwritten mathematical expression recognition
    Li, Zhe
    Yang, Wentao
    Qi, Hengnian
    Jin, Lianwen
    Huang, Yichao
    Ding, Kai
    PATTERN RECOGNITION, 2024, 149
  • [2] A tree-based model with branch parallel decoding for handwritten mathematical expression recognition
    Li, Zhe
    Yang, Wentao
    Qi, Hengnian
    Jin, Lianwen
    Huang, Yichao
    Ding, Kai
    Pattern Recognition, 2024, 149
  • [3] Tree-based data augmentation and mutual learning for offline handwritten mathematical expression recognition
    Yang, Chen
    Du, Jun
    Zhang, Jianshu
    Wu, Changjie
    Chen, Mingjun
    Wu, JiaJia
    PATTERN RECOGNITION, 2022, 132
  • [4] Tree-based BLSTM for mathematical expression recognition
    Zhang, Ting
    Mouchere, Harold
    Viard-Gaudin, Christian
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 914 - 919
  • [5] Stroke Based Posterior Attention for Online Handwritten Mathematical Expression Recognition
    Wu, Changjie
    Wang, Qing
    Zhang, Jianshu
    Du, Jun
    Wang, Jiaming
    Wu, Jiajia
    Hu, Jinshui
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2943 - 2949
  • [6] Attention Guidance Mechanism for Handwritten Mathematical Expression Recognition
    Liu, Yutian
    Ke, Wenjun
    Wei, Jianguo
    arXiv,
  • [7] Visual-Textual Attribute Learning for Class-Incremental Facial Expression Recognition
    Lv, Yuanling
    Huang, Guangyu
    Yan, Yan
    Xue, Jing-Hao
    Chen, Si
    Wang, Hanzi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8038 - 8051
  • [8] Spatial Attention and Syntax Rule Enhanced Tree Decoder for Offline Handwritten Mathematical Expression Recognition
    Lin, Zihao
    Li, Jinrong
    Yang, Fan
    Huang, Shuangping
    Yang, Xu
    Lin, Jianmin
    Yang, Ming
    FRONTIERS IN HANDWRITING RECOGNITION, ICFHR 2022, 2022, 13639 : 213 - 227
  • [9] A Simple Visual-Textual Baseline for Pedestrian Attribute Recognition
    Cheng, Xinhua
    Jia, Mengxi
    Wang, Qian
    Zhang, Jian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6994 - 7004
  • [10] SRD: A Tree Structure Based Decoder for Online Handwritten Mathematical Expression Recognition
    Zhang, Jianshu
    Du, Jun
    Yang, Yongxin
    Song, Yi-Zhe
    Dai, Lirong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2471 - 2480