Construction of an Online Cloud Platform for Zhuang Speech Recognition and Translation with Edge-Computing-Based Deep Learning Algorithm

被引：1

作者：

Fan, Zeping ^{[1
,2
]}

Huang, Min ^{[1
,2
]}

Zhang, Xuejun ^{[1
,2
,3
]}

Liu, Rongqi ^{[1
,2
]}

Lyu, Xinyi ^{[1
]}

Duan, Taisen ^{[1
,2
]}

Bu, Zhaohui ^{[4
]}

Liang, Jianghua ^{[5
]}

机构：

[1] Guangxi Univ, Sch Comp & Elect & Informat, Nanning 530004, Peoples R China

[2] Guangxi Univ, Guangxi Key Lab Multimedia Commun & Network Techno, Nanning 530004, Peoples R China

[3] Guangxi Big White & Little Black Robots Co Ltd, Nanning 530007, Peoples R China

[4] Guangxi Univ, Sch Foreign Language, Nanning 530004, Peoples R China

[5] Guangxi Univ, Sch Journalism & Commun, Nanning 530004, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 22期

关键词：

automatic speech recognition; natural language processing; neural machine translation; transformer; cloud edge computing; network programming;

D O I：

10.3390/app132212184

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

The Zhuang ethnic minority in China possesses its own ethnic language and no ethnic script. Cultural exchange and transmission encounter hurdles as the Zhuang rely exclusively on oral communication. An online cloud-based platform was required to enhance linguistic communication. First, a database of 200 h of annotated Zhuang speech was created by collecting standard Zhuang speeches and improving database quality by removing transcription inconsistencies and text normalization. Second, SAformerNet, a more efficient and accurate transformer-based automatic speech recognition (ASR) network, is achieved by inserting additional downsampling modules. Subsequently, a Neural Machine Translation (NMT) model for translating Zhuang into other languages is constructed by fine-tuning the BART model and corpus filtering strategy. Finally, for the network's responsiveness to real-world needs, edge-computing techniques are applied to relieve network bandwidth pressure. An edge-computing private cloud system based on FPGA acceleration is proposed to improve model operation efficiency. Experiments show that the most critical metric of the system, model accuracy, is above 93%, and inference time is reduced by 29%. The computational delay for multi-head self-attention (MHSA) and feed-forward network (FFN) modules has been reduced by 7.1 and 1.9 times, respectively, and terminal response time is accelerated by 20% on average. Generally, the scheme provides a prototype tool for small-scale Zhuang remote natural language tasks in mountainous areas.

引用

页数：19

共 50 条

[21] The design of regional medical cloud computing information platform based on deep learning
Kaidong Zhang
International Journal of System Assurance Engineering and Management, 2021, 12 : 757 - 764
[22] Research on Business English Translation Architecture Based on Artificial Intelligence Speech Recognition and Edge Computing
Xu, Yunwei
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
[23] Cloud computing English teaching application platform based on machine learning algorithm
Zhang, Peili
SOFT COMPUTING, 2023,
[24] RETRACTED: Hybrid Algorithm for English Translation Speech Recognition Based on Deep Learning Model and Clustering (Retracted Article)
Zhang, Baicheng
SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
[25] Task Offloading for Automatic Speech Recognition in Edge-Cloud Computing Based Mobile Networks
Cheng, Shitong
Xu, Zhenghui
Li, Xiuhua
Wu, Xiongwei
Fan, Qilin
Wang, Xiaofei
Leung, Victor C. M.
2020 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2020, : 140 - 145
[26] Edge Computing System applying Integrated Object Recognition based on Deep Learning
Kim, Kwihoon
Oh, Sewon
2021 23RD INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT 2021): ON-LINE SECURITY IN PANDEMIC ERA, 2021, : 415 - 419
[27] Edge Computing System applying Integrated Object Recognition based on Deep Learning
Kim, Kwihoon
Oh, Sewon
2022 24TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ARITIFLCIAL INTELLIGENCE TECHNOLOGIES TOWARD CYBERSECURITY, 2022, : 415 - +
[28] Research on Mobile Learning Platform Construction in Higher Vocational Colleges Based on Cloud Computing
Nie, Jing
2015 11TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2015, : 91 - 94
[29] Design of Political Online Teaching Based on Artificial Speech Recognition and Deep Learning
Chen, Xiajin
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[30] IoT traffic management using deep learning based on osmotic cloud to edge computing
Absardi, Zeinab Nazemi
Javidan, Reza
TELECOMMUNICATION SYSTEMS, 2024, 87 (02) : 419 - 435

← 1 2 3 4 5 →