Vietnamese Voice2Text: A Web Application for Whisper Implementation in Vietnamese Automatic Speech Recognition Tasks: Vietnamese Voice2Text

被引：0

作者：

Nguyen, Quangphuoc ^{[1
]}

Nguyen, Ngocminh ^{[1
]}

Dang, Thanhluan ^{[1
]}

Tran, Vanha ^{[1
]}

机构：

[1] Fpt University, Hanoi, Viet Nam

来源：

ACM International Conference Proceeding Series | 2023年

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Application programming interfaces (API) - Character recognition - Codes (symbols) - Computer software reusability - High level languages - Scalability - Speech recognition - Structural design - User interfaces

引用

页码：312 / 318

共 12 条

[1] Transformer-Based Joint Learning Approach for Text Normalization in Vietnamese Automatic Speech Recognition Systems
Viet The Bui
Tho Chi Luong
Oanh Thi Tran
CYBERNETICS AND SYSTEMS, 2024, 55 (07) : 1614 - 1630
[2] Speech Act Classification in Vietnamese Utterance and Its Application in Smart Mobile Voice Interaction
Thi-Lan Ngo
Quang-Vu Duong
Son-Bao Pham
Xuan-Hieu Phan
PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 396 - 402
[3] Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language
Le, Viet-Bac
Besacier, Laurent
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (08): : 1471 - 1482
[4] The First Vietnamese FOSD-Tacotron-2-based Text-to-Speech Model Dataset
Tran, Duc Chung
DATA IN BRIEF, 2020, 31
[5] Named Entity Recognition for Vietnamese Spoken Texts and Its Application in Smart Mobile Voice Interaction
Phuong-Nam Tran
Van-Duc Ta
Quoc-Tuan Truong
Quang-Vu Duong
Thac-Thong Nguyen
Xuan-Hieu Phan
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 : 170 - 180
[6] Exploring a Web-Based Application to Convert Tamil and Vietnamese Speech to Text without the Effect of Code-Switching and Code-Mixing
K. Phung
R. Ramachandran
E. Ogunshile
Programming and Computer Software, 2021, 47 : 757 - 764
[7] Exploring a Web-Based Application to Convert Tamil and Vietnamese Speech to Text without the Effect of Code-Switching and Code-Mixing
Phung, K.
Ramachandran, R.
Ogunshile, E.
PROGRAMMING AND COMPUTER SOFTWARE, 2021, 47 (08) : 757 - 764
[8] Deep Voice 2: Multi-Speaker Neural Text-to-Speech
Arik, Sercan O.
Diamos, Gregory
Gibiansky, Andrew
Miller, John
Peng, Kainan
Ping, Wei
Raiman, Jonathan
Zhou, Yanqi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[9] An Approach to accept input in Text Editor through voice and its Analysis, designing, development and implementation using Speech Recognition
Surahio, Farhan Ali
Jumani, Awais Khan
Talpur, Sawan
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2016, 16 (03): : 14 - 20
[10] EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion
Miao, Chenfeng
Zhu, Qingying
Chen, Minchuan
Ma, Jun
Wang, Shaojun
Xiao, Jing
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1650 - 1661

← 1 2 →