Visual-Textual Attention for Tree-Based Handwritten Mathematical Expression Recognition

被引:0
|
作者
Liao, Wei [1 ]
Liu, Jiayi [1 ]
Chen, Jianghan [1 ]
Wang, Qiu-Feng [1 ]
Huang, Kaizhu [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou, Peoples R China
[2] Duke Kunshan Univ, Data Sci Res Ctr, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten mathematical expression recognition; Tree decoder; Visual-textual attention; Mutual learning; DECODER;
D O I
10.1007/978-981-97-1417-9_35
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Handwritten mathematical expression recognition (HMER) has attracted much attention and achieved remarkable progress under the encoder-decoder framework. However, it is still challenging due to complex structures and illegible handwriting. In this paper, we propose to refine the encoder-decoder framework for HMER. Firstly, we propose a multi-scale vision and textual attention fusion mechanism to enhance the contexts from both spatial and semantic information. Next, most of HMER works simply regard the HMER as a sequence-to-sequence problem (i.e., Latex string), ignoring the structure information in the mathematical expressions. To overcome this issue, we utilize a tree decoder to capture such structure contexts. Furthermore, we propose a parent-children mutual learning method to enhance the learning of our encoder-decoder model. Extensive experiments on the HMER benchmark datasets of CROHME 2014, 2016 and 2019 demonstrate the effectiveness of the proposed method.
引用
收藏
页码:375 / 384
页数:10
相关论文
共 50 条
  • [41] A selective attention-based method for visual pattern recognition with application to handwritten digit recognition and face recognition
    Salah, AA
    Alpaydin, E
    Akarun, L
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (03) : 420 - 425
  • [42] Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer
    Zhao, Wenqi
    Gao, Liangcai
    Yan, Zuoyu
    Peng, Shuai
    Du, Lin
    Zhang, Ziyin
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 570 - 584
  • [43] An Attention-Aware Model for Human Action Recognition on Tree-Based Skeleton Sequences
    Ding, Runwei
    Liu, Chang
    Liu, Hong
    SOCIAL ROBOTICS, ICSR 2018, 2018, 11357 : 569 - 579
  • [44] Syntactic data generation for handwritten mathematical expression recognition
    Thanh-Nghia Truong
    Cuong Tuan Nguyen
    Nakagawa, Masaki
    PATTERN RECOGNITION LETTERS, 2022, 153 : 83 - 91
  • [45] Survey of Mathematical Expression Recognition for Printed and Handwritten Documents
    Aggarwal, Ridhi
    Pandey, Shilpa
    Tiwari, Anil Kumar
    Harit, Gaurav
    IETE TECHNICAL REVIEW, 2022, 39 (06) : 1245 - 1253
  • [46] Primitive Contrastive Learning for Handwritten Mathematical Expression Recognition
    Guo, Hong-Yu
    Wang, Chuang
    Yin, Fei
    Liu, Heng-Ye
    Wu, Jin-Wen
    Liu, Cheng-Lin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 847 - 854
  • [47] Structural String Decoder for Handwritten Mathematical Expression Recognition
    Wu, Jiajia
    Hu, Jinshui
    Chen, Mingjun
    Dai, Lirong
    Niu, Xuejing
    Wang, Ning
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3246 - 3251
  • [48] Handwritten Mathematical Expression Recognition: An approach on data augmentation
    Khanh-Ngoc Bui
    Quoc-Kim-Hoang Nguyen
    Thanh-Sach Le
    2021 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND APPLICATIONS (ACOMP 2021), 2021, : 46 - 53
  • [49] Using Speech for Handwritten Mathematical Expression Recognition Disambiguation
    Medjkoune, Sofiane
    Mouchere, Harold
    Petitrenaud, Simon
    Viard-Gaudin, Christian
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 187 - 192
  • [50] Quality Assessment of Light Field Images Based on Contrastive Visual-Textual Model
    Wang, Han-Ling
    Ke, Xiao
    Jiang, Ao-Xin
    Guo, Wen-Zhong
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (10): : 3562 - 3577