Transformer-Based Direct Hidden Markov Model for Machine Translation

被引:0
|
作者
Wang, Weiyue [1 ]
Yang, Zijian [1 ]
Gao, Yingbo [1 ]
Ney, Hermann [1 ]
机构
[1] Rhein Westfal TH Aachen, Comp Sci Dept, Human Language Technol & Pattern Recognit Grp, Aachen, Germany
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The neural hidden Markov model has been proposed as an alternative to attention mechanism in machine translation with recurrent neural networks. However, since the introduction of the transformer models, its performance has been surpassed. This work proposes to introduce the concept of the hidden Markov model to the transformer architecture, which outperforms the transformer baseline. Interestingly, we find that the zero-order model already provides promising performance, giving it an edge compared to a model with first-order dependency, which performs similarly but is significantly slower in training and decoding.
引用
收藏
页码:23 / 32
页数:10
相关论文
共 50 条
  • [41] FAULT DIAGNOSIS APPROACH BASED ON HIDDEN MARKOV MODEL AND SUPPORT VECTOR MACHINE
    LIU Guanjun LIU Xinmin QIU Jing HU Niaoqing College of Mechatronics Engineering and Automation
    Chinese Journal of Mechanical Engineering, 2007, (05) : 92 - 95
  • [42] A Hidden Markov Model-Based Method for Virtual Machine Anomaly Detection
    Shi, Chaochen
    Yu, Jiangshan
    PROVABLE SECURITY, PROVSEC 2019, 2019, 11821 : 372 - 380
  • [43] On-line Fault Diagnosis of Electric Machine based on the Hidden Markov Model
    Zhang, Jiayuan
    Zhan, Wei
    Ehsani, Mehrdad
    2016 IEEE TRANSPORTATION ELECTRIFICATION CONFERENCE AND EXPO (ITEC), 2016,
  • [44] Detection of machine failure: Hidden Markov Model approach
    Tai, Allen H.
    Ching, Wai-Ki
    Chan, L. Y.
    COMPUTERS & INDUSTRIAL ENGINEERING, 2009, 57 (02) : 608 - 619
  • [45] RM-Transformer: A Transformer-based Model for Mandarin Speech Recognition
    Lu, Xingyu
    Hu, Jianguo
    Li, Shenhao
    Ding, Yanyu
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 194 - 198
  • [46] Re-Transformer: A Self-Attention Based Model for Machine Translation
    Liu, Huey-Ing
    Chen, Wei-Lin
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 3 - 10
  • [47] Combined Medical Image Super-Resolution and Modality Translation Using GAN Transformer-Based Model
    Abdollahi, Melika
    Davoudi, Heidar
    Ebrahimi, Mehran
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1133 - 1138
  • [48] Towards an astronomical foundation model for stars with a transformer-based model
    Leung, Henry W.
    Bovy, Jo
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2024, 527 (01) : 1494 - 1520
  • [49] An Improved LA-Transformer Machine Translation Model
    Wang, Zumin
    Zhang, Chengye
    Bai, Fengbo
    Wang, Yingjie
    Proceedings - 2023 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Autonomous and Trusted Vehicles, Scalable Computing and Communications, Digital Twin, Privacy Computing and Data Security, Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PCDS/Metaverse 2023, 2023,
  • [50] Research on Autoarrangement System of Accompaniment Chords Based on Hidden Markov Model with Machine Learning
    Shi, Shuo
    Xi, Shuting
    Tsai, Sang-Bing
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021