Vietnamese Voice2Text: A Web Application for Whisper Implementation in Vietnamese Automatic Speech Recognition Tasks: Vietnamese Voice2Text

被引:0
|
作者
Nguyen, Quangphuoc [1 ]
Nguyen, Ngocminh [1 ]
Dang, Thanhluan [1 ]
Tran, Vanha [1 ]
机构
[1] Fpt University, Hanoi, Viet Nam
来源
ACM International Conference Proceeding Series | 2023年
关键词
Compilation and indexing terms; Copyright 2025 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Application programming interfaces (API) - Character recognition - Codes (symbols) - Computer software reusability - High level languages - Scalability - Speech recognition - Structural design - User interfaces
引用
收藏
页码:312 / 318
相关论文
共 12 条
  • [1] Transformer-Based Joint Learning Approach for Text Normalization in Vietnamese Automatic Speech Recognition Systems
    Viet The Bui
    Tho Chi Luong
    Oanh Thi Tran
    CYBERNETICS AND SYSTEMS, 2024, 55 (07) : 1614 - 1630
  • [2] Speech Act Classification in Vietnamese Utterance and Its Application in Smart Mobile Voice Interaction
    Thi-Lan Ngo
    Quang-Vu Duong
    Son-Bao Pham
    Xuan-Hieu Phan
    PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 396 - 402
  • [3] Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language
    Le, Viet-Bac
    Besacier, Laurent
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (08): : 1471 - 1482
  • [4] The First Vietnamese FOSD-Tacotron-2-based Text-to-Speech Model Dataset
    Tran, Duc Chung
    DATA IN BRIEF, 2020, 31
  • [5] Named Entity Recognition for Vietnamese Spoken Texts and Its Application in Smart Mobile Voice Interaction
    Phuong-Nam Tran
    Van-Duc Ta
    Quoc-Tuan Truong
    Quang-Vu Duong
    Thac-Thong Nguyen
    Xuan-Hieu Phan
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 : 170 - 180
  • [6] Exploring a Web-Based Application to Convert Tamil and Vietnamese Speech to Text without the Effect of Code-Switching and Code-Mixing
    K. Phung
    R. Ramachandran
    E. Ogunshile
    Programming and Computer Software, 2021, 47 : 757 - 764
  • [7] Exploring a Web-Based Application to Convert Tamil and Vietnamese Speech to Text without the Effect of Code-Switching and Code-Mixing
    Phung, K.
    Ramachandran, R.
    Ogunshile, E.
    PROGRAMMING AND COMPUTER SOFTWARE, 2021, 47 (08) : 757 - 764
  • [8] Deep Voice 2: Multi-Speaker Neural Text-to-Speech
    Arik, Sercan O.
    Diamos, Gregory
    Gibiansky, Andrew
    Miller, John
    Peng, Kainan
    Ping, Wei
    Raiman, Jonathan
    Zhou, Yanqi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [9] An Approach to accept input in Text Editor through voice and its Analysis, designing, development and implementation using Speech Recognition
    Surahio, Farhan Ali
    Jumani, Awais Khan
    Talpur, Sawan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2016, 16 (03): : 14 - 20
  • [10] EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion
    Miao, Chenfeng
    Zhu, Qingying
    Chen, Minchuan
    Ma, Jun
    Wang, Shaojun
    Xiao, Jing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1650 - 1661