Two-step sequence transformer based method for Cham to Latin script transliteration

被引:0
|
作者
Tien-Nam Nguyen [1 ]
Burie, Jean-Christophe [1 ]
Thi-Lan Le [2 ]
Schweyer, Anne-Valerie [3 ]
机构
[1] Lab Informat Image Interact L3i, La Rochelle, France
[2] Sch Elect & Elect Engn SEEE, Hanoi, Vietnam
[3] CNRS, Ctr Asie Sud Est CASE, Paris, France
关键词
Transliteration; Historical documents; Cham manuscript images; Transformer; Sequence to Sequence;
D O I
10.1145/3604951.3605525
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fusion information between visual and textual information is an interesting way to better represent the features. In thiswork, we propose a method for the text line transliteration of Cham manuscripts by combining visual and textual modality. Instead of using a standard approach that directly recognizes the words in the image, we split the problem into two steps. Firstly, we propose a scenario for recognition where similar characters are considered as unique characters, then we use the transformer model which considers both visual and context information to adjust the prediction when it concerns similar characters to be able to distinguish them. Based on this two-step strategy, the proposed method consists of a sequence to sequence model and a multi-modal transformer. Thus, we can take advantage of both the sequence-to-sequence model and the transformer model. Extensive experiments prove that the proposed method outperforms the approaches of the literature on our Cham manuscripts dataset.
引用
收藏
页码:25 / 30
页数:6
相关论文
共 50 条
  • [21] A two-step grid redistribution method
    Tang, L
    Baeder, JD
    COMPUTERS & FLUIDS, 2003, 32 (03) : 323 - 336
  • [22] A two-step method to make microglia
    Natasha Bray
    Nature Reviews Neuroscience, 2017, 18 : 264 - 264
  • [23] Fast deconvolution by a two-step method
    Barone, P
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1999, 21 (03): : 883 - 899
  • [24] Two-step control grading method
    Zhou Yao-lie
    Shao Dan
    Journal of Zhejiang University-SCIENCE A, 2001, 2 (4): : 471 - 475
  • [25] Shooting by a Two-Step Galerkin Method
    Bizzarri, Federico
    Brambilla, Angelo
    Codecasa, Lorenzo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2019, 66 (01) : 383 - 390
  • [26] Fast deconvolution by a two-step method
    Ist. Applicazioni Calcolo M. Picone, Viale del Policlinico 137, 00165 Roma, Italy
    Siam J. Sci. Comput., 3 (883-899):
  • [27] A Method to Accommodate Backward Compatibility on the Learning Application-based Transliteration to the Balinese Script
    Indrawan, Gede
    Nurhayata, I. Gede
    Sariyasa
    Paramarta, I. Ketut
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (06) : 280 - 286
  • [28] Using a Two-Step Method to Measure Transgender Identity in Latin America/the Caribbean, Portugal, and Spain
    Reisner, Sari L.
    Biello, Katie
    Rosenberger, Joshua G.
    Austin, S. Bryn
    Haneuse, Sebastien
    Perez-Brumer, Amaya
    Novak, David S.
    Mimiaga, Matthew J.
    ARCHIVES OF SEXUAL BEHAVIOR, 2014, 43 (08) : 1503 - 1514
  • [29] Using a Two-Step Method to Measure Transgender Identity in Latin America/the Caribbean, Portugal, and Spain
    Sari L. Reisner
    Katie Biello
    Joshua G. Rosenberger
    S. Bryn Austin
    Sebastien Haneuse
    Amaya Perez-Brumer
    David S. Novak
    Matthew J. Mimiaga
    Archives of Sexual Behavior, 2014, 43 : 1503 - 1514
  • [30] A Transformer-Based Educational Virtual Assistant Using Diacriticized Latin Script
    Lam, Khang Nhut
    Nguy, Loc Huu
    Le, Van Lam
    Kalita, Jugal
    IEEE ACCESS, 2023, 11 : 90094 - 90104