Characters as graphs: Interpretable handwritten Chinese character recognition via Pyramid Graph Transformer

被引:9
|
作者
Gan, Ji [1 ,2 ]
Chen, Yuyan [1 ]
Hu, Bo [1 ,2 ]
Leng, Jiaxu [1 ,2 ]
Wang, Weiqiang [3 ]
Gao, Xinbo [1 ,2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing, Peoples R China
[2] Chongqing Inst Brain & Intelligence, Guangyang Bay Lab, Chongqing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten Chinese character Recognition; Transformer; Graph convolutional network; Pyramid graph; ONLINE; REPRESENTATION; EXTRACTION;
D O I
10.1016/j.patcog.2023.109317
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is meaningful but challenging to teach machines to recognize handwritten Chinese characters. However, conventional approaches typically view handwritten Chinese characters as either static images or tempo-ral trajectories, which may ignore the inherent geometric semantics of characters. Instead, here we first propose to represent handwritten characters as skeleton graphs, explicitly considering the natural charac-teristics of characters (i.e., characters as graphs). Furthermore, we propose a novel Pyramid Graph Trans-former (PyGT) to specifically process the graph-structured characters, which fully integrates the advan-tages of Transformers and graph convolutional networks. Specifically, our PyGT can learn better graph fea-tures through (i) capturing the global information from all nodes with graph attention mechanism and (ii) modelling the explicit local adjacency structures of nodes with graph convolutions. Furthermore, the PyGT learns the multi-resolution features by constructing a progressive shrinking pyramid. Compared with ex-isting approaches, it is more interpretable to recognize characters as geometric graphs. Moreover, the pro-posed method is generic for both online and offline handwritten Chinese character recognition (HCCR), and it also can be feasibly extended to handwritten text recognition. Extensive experiments empirically demonstrate the superiority of PyGT over the prevalent approaches including 2D-CNN, RNN/1D-CNN, and Vision Transformer (ViT) for HCCR. The code is available at https://github.com/ganji15/PyGT-HCCR .& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A novel algorithm for handwritten Chinese character recognition
    Qi, F
    Deng, MH
    Qian, MP
    Zhu, XQ
    ADVANCES IN MULTIMODAL INTERFACES - ICMI 2000, PROCEEDINGS, 2000, 1948 : 379 - 385
  • [22] Wavelet analysis for handwritten Chinese character recognition
    Yang, J
    Yu, SY
    Zhao, RC
    1997 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT PROCESSING SYSTEMS, VOLS 1 & 2, 1997, : 1023 - 1026
  • [23] Handwritten Chinese character recognition by metasynthetic approach
    Hao, HW
    Xiao, XH
    Dai, RW
    PATTERN RECOGNITION, 1997, 30 (08) : 1321 - 1328
  • [24] Metasynthetic approach to the recognition of handwritten chinese character
    Xiao, Xuhong
    Dai, Ruwei
    Zidonghua Xuebao/Acta Automatica Sinica, 1997, 23 (05): : 621 - 627
  • [25] DenseRAN for Offline Handwritten Chinese Character Recognition
    Wang, Wenchao
    Zhang, Jianshu
    Du, Jun
    Wang, Zi-Rui
    Zhu, Yixing
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 104 - 109
  • [26] ONLINE HANDWRITTEN CHINESE CHARACTER-RECOGNITION VIA A FUZZY ATTRIBUTE REPRESENTATION
    CHEN, JW
    LEE, SY
    IMAGE AND VISION COMPUTING, 1994, 12 (10) : 669 - 681
  • [27] Handwritten Character Recognition using Hierarchical Graph Matching
    Al Mubarok, Abdulloh
    Nugroho, Hertog
    2016 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2016, : 454 - 459
  • [28] PF-ViT: Parallel and Fast Vision Transformer for Offline Handwritten Chinese Character Recognition
    Dan, Yongping
    Zhu, Zongnan
    Jin, Weishou
    Li, Zhuo
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [29] Graph based Method for Online Handwritten Character Recognition
    Zitouni, Rabiaa
    Bezine, Hala
    Arous, Najet
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 1: GRAPP, 2020, : 263 - 270
  • [30] RECOGNITION OF HANDWRITTEN ARABIC CHARACTERS VIA SEGMENTATION
    ALYOUSEFI, HS
    UDPA, SS
    ARAB GULF JOURNAL OF SCIENTIFIC RESEARCH, 1990, 8 (02): : 49 - 59