Multi-modal simultaneous machine translation fusion of image information

被引:1
|
作者
Huang, Yan [1 ]
Wanga, Zhanyang [1 ]
Zhang, TianYuan [1 ]
Xu, Chun [2 ]
Lianga, Hui [1 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Software Engn, Zhengzhou, Henan, Peoples R China
[2] Xinjiang Univ Finance & Econ, Coll Comp, Urumqi, Xinjiang, Peoples R China
来源
JOURNAL OF ENGINEERING RESEARCH | 2023年 / 11卷 / 02期
关键词
Simultaneous translation; Real-time; Surrounding scenes; Multi-modal; Image information;
D O I
10.1016/j.jer.2023.100085
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Simultaneous translation is to translate a sentence before people finish it, to understand the speaker's intention in real-time. At present, simultaneous machine translation still relies on text-to-text data resources. However, the output information from the encoder side is used for the decoder as the input data recourse in the pure text translation system. This information is only derived from the text content, and the input information is single, causing a shortage of decoding information at the decoder and the vocabulary is missed in translation. The translator will also visually capture the information of the surrounding scenes to assist himself in the translation work, based on this feature, we propose a multi-modal simultaneous machine translation of fusion image information. We extract information from the image, add the information to the decoder side of the translation system, increase the input data resource of the decoder, and help the system improve the translation quality. We use the Multi30K dataset for experimental verification. Compared with the translation system of plain text, the method we propose can translate more complete sentences, richer content, and better translation results.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Multi-layer, multi-modal medical image intelligent fusion
    Nair, Rekha R.
    Singh, Tripty
    Basavapattana, Abhinandan
    Pawar, Manasa M.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42821 - 42847
  • [32] Multi-layer, multi-modal medical image intelligent fusion
    Rekha R. Nair
    Tripty Singh
    Abhinandan Basavapattana
    Manasa M. Pawar
    Multimedia Tools and Applications, 2022, 81 : 42821 - 42847
  • [33] Multi-modal brain image fusion using multi feature guided fusion network
    Shibu, Tom Michael
    Madan, Niranjan
    Paramanandham, Nirmala
    Kumar, Aakash
    Santosh, Ashwin
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [34] MURF: Mutually Reinforcing Multi-Modal Image Registration and Fusion
    Xu, Han
    Yuan, Jiteng
    Ma, Jiayi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12148 - 12166
  • [35] Adaptive decomposition method for multi-modal medical image fusion
    Wang, Jing
    Li, Xiongfei
    Zhang, Yan
    Zhang, Xiaoli
    IET IMAGE PROCESSING, 2018, 12 (08) : 1403 - 1412
  • [36] Adherent Peanut Image Segmentation Based on Multi-Modal Fusion
    Wang, Yujing
    Ye, Fang
    Zeng, Jiusun
    Cai, Jinhui
    Huang, Wangsen
    SENSORS, 2024, 24 (14)
  • [37] Multi-modal Image Fusion Based on ROI and Laplacian Pyramid
    Gao, Xiong
    Zhang, Hong
    Chen, Hao
    Li, Jiafeng
    SIXTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2014), 2015, 9443
  • [38] MULTI-MODAL MEDICAL IMAGE FUSION USING CURVELET ALGORITHM
    Mathiyalagan, P.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 2453 - 2458
  • [39] SWT and PCA image fusion methods for multi-modal imagery
    Bashir, Rabia
    Junejo, Riaz
    Qadri, Nadia N.
    Fleury, Martin
    Qadri, Muhammad Yasir
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (02) : 1235 - 1263
  • [40] Fabric image retrieval based on multi-modal feature fusion
    Ning Zhang
    Yixin Liu
    Zhongjian Li
    Jun Xiang
    Ruru Pan
    Signal, Image and Video Processing, 2024, 18 : 2207 - 2217