Visual-Textual Attention for Tree-Based Handwritten Mathematical Expression Recognition

被引:0
|
作者
Liao, Wei [1 ]
Liu, Jiayi [1 ]
Chen, Jianghan [1 ]
Wang, Qiu-Feng [1 ]
Huang, Kaizhu [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou, Peoples R China
[2] Duke Kunshan Univ, Data Sci Res Ctr, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten mathematical expression recognition; Tree decoder; Visual-textual attention; Mutual learning; DECODER;
D O I
10.1007/978-981-97-1417-9_35
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Handwritten mathematical expression recognition (HMER) has attracted much attention and achieved remarkable progress under the encoder-decoder framework. However, it is still challenging due to complex structures and illegible handwriting. In this paper, we propose to refine the encoder-decoder framework for HMER. Firstly, we propose a multi-scale vision and textual attention fusion mechanism to enhance the contexts from both spatial and semantic information. Next, most of HMER works simply regard the HMER as a sequence-to-sequence problem (i.e., Latex string), ignoring the structure information in the mathematical expressions. To overcome this issue, we utilize a tree decoder to capture such structure contexts. Furthermore, we propose a parent-children mutual learning method to enhance the learning of our encoder-decoder model. Extensive experiments on the HMER benchmark datasets of CROHME 2014, 2016 and 2019 demonstrate the effectiveness of the proposed method.
引用
收藏
页码:375 / 384
页数:10
相关论文
共 50 条
  • [31] A tree-BLSTM-based recognition system for online handwritten mathematical expressions
    Ting Zhang
    Harold Mouchère
    Christian Viard-Gaudin
    Neural Computing and Applications, 2020, 32 : 4689 - 4708
  • [32] A tree-BLSTM-based recognition system for online handwritten mathematical expressions
    Zhang, Ting
    Mouchere, Harold
    Viard-Gaudin, Christian
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (09): : 4689 - 4708
  • [33] SATD: syntax-aware handwritten mathematical expression recognition based on tree-structured transformer decoder
    Fu, Pengbin
    Xiao, Ganyun
    Yang, Huirong
    VISUAL COMPUTER, 2025, 41 (02): : 883 - 900
  • [34] Unbiased Evaluation of Handwritten Mathematical Expression Recognition
    Alvaro, Francisco
    Sanchez, Joan-Andreu
    Benedi, Jose-Miguel
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 181 - 186
  • [35] A Hybrid Classifier for Handwritten Mathematical Expression Recognition
    Awal, Ahmad-Montaser
    Mouchere, Harold
    Viard-Gaudin, Christian
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [36] Context-dependent HMM modeling using tree-based clustering for the recognition of handwritten words
    Bianne, Anne-Laure
    Kermorvant, Christopher
    Likforman-Sulem, Laurence
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [37] Visual-textual Capsule Routing for Text-based Video Segmentation
    McIntosh, Bruce
    Duarte, Kevin
    Rawat, Yogesh S.
    Shah, Mubarak
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9939 - 9948
  • [38] Social Media Popularity Prediction Based on Visual-Textual Features with XGBoost
    Chen, Junhong
    Liang, Dayong
    Zhu, Zhanmo
    Zhou, Xiaojing
    Ye, Zihan
    Mo, Xiuyun
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2692 - 2696
  • [39] Hardest and semi-hard negative pairs mining for text-based person search with visual-textual attention
    Ge, Jing
    Wang, Qianxiang
    Gao, Guangyu
    MULTIMEDIA SYSTEMS, 2023, 29 (05) : 3081 - 3093
  • [40] Visual-textual sentiment classification with bi-directional multi-level attention networks
    Xu, Jie
    Huang, Feiran
    Zhang, Xiaoming
    Wang, Senzhang
    Li, Chaozhuo
    Li, Zhoujun
    He, Yueying
    KNOWLEDGE-BASED SYSTEMS, 2019, 178 : 61 - 73