Multi-modal simultaneous machine translation fusion of image information

被引：1

作者：

Huang, Yan ^{[1
]}

Wanga, Zhanyang ^{[1
]}

Zhang, TianYuan ^{[1
]}

Xu, Chun ^{[2
]}

Lianga, Hui ^{[1
]}

机构：

[1] Zhengzhou Univ Light Ind, Coll Software Engn, Zhengzhou, Henan, Peoples R China

[2] Xinjiang Univ Finance & Econ, Coll Comp, Urumqi, Xinjiang, Peoples R China

来源：

JOURNAL OF ENGINEERING RESEARCH | 2023年 / 11卷 / 02期

关键词：

Simultaneous translation; Real-time; Surrounding scenes; Multi-modal; Image information;

D O I：

10.1016/j.jer.2023.100085

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Simultaneous translation is to translate a sentence before people finish it, to understand the speaker's intention in real-time. At present, simultaneous machine translation still relies on text-to-text data resources. However, the output information from the encoder side is used for the decoder as the input data recourse in the pure text translation system. This information is only derived from the text content, and the input information is single, causing a shortage of decoding information at the decoder and the vocabulary is missed in translation. The translator will also visually capture the information of the surrounding scenes to assist himself in the translation work, based on this feature, we propose a multi-modal simultaneous machine translation of fusion image information. We extract information from the image, add the information to the decoder side of the translation system, increase the input data resource of the decoder, and help the system improve the translation quality. We use the Multi30K dataset for experimental verification. Compared with the translation system of plain text, the method we propose can translate more complete sentences, richer content, and better translation results.

引用

页数：7

共 50 条

[31] Multi-layer, multi-modal medical image intelligent fusion
Nair, Rekha R.
Singh, Tripty
Basavapattana, Abhinandan
Pawar, Manasa M.
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42821 - 42847
[32] Multi-layer, multi-modal medical image intelligent fusion
Rekha R. Nair
Tripty Singh
Abhinandan Basavapattana
Manasa M. Pawar
Multimedia Tools and Applications, 2022, 81 : 42821 - 42847
[33] Multi-modal brain image fusion using multi feature guided fusion network
Shibu, Tom Michael
Madan, Niranjan
Paramanandham, Nirmala
Kumar, Aakash
Santosh, Ashwin
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
[34] MURF: Mutually Reinforcing Multi-Modal Image Registration and Fusion
Xu, Han
Yuan, Jiteng
Ma, Jiayi
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12148 - 12166
[35] Adaptive decomposition method for multi-modal medical image fusion
Wang, Jing
Li, Xiongfei
Zhang, Yan
Zhang, Xiaoli
IET IMAGE PROCESSING, 2018, 12 (08) : 1403 - 1412
[36] Adherent Peanut Image Segmentation Based on Multi-Modal Fusion
Wang, Yujing
Ye, Fang
Zeng, Jiusun
Cai, Jinhui
Huang, Wangsen
SENSORS, 2024, 24 (14)
[37] Multi-modal Image Fusion Based on ROI and Laplacian Pyramid
Gao, Xiong
Zhang, Hong
Chen, Hao
Li, Jiafeng
SIXTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2014), 2015, 9443
[38] MULTI-MODAL MEDICAL IMAGE FUSION USING CURVELET ALGORITHM
Mathiyalagan, P.
2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 2453 - 2458
[39] SWT and PCA image fusion methods for multi-modal imagery
Bashir, Rabia
Junejo, Riaz
Qadri, Nadia N.
Fleury, Martin
Qadri, Muhammad Yasir
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (02) : 1235 - 1263
[40] Fabric image retrieval based on multi-modal feature fusion
Ning Zhang
Yixin Liu
Zhongjian Li
Jun Xiang
Ruru Pan
Signal, Image and Video Processing, 2024, 18 : 2207 - 2217

← 1 2 3 4 5 →