Multimodal Seed Data Augmentation for Low-Resource Audio Latin Cuengh Language

被引:0
|
作者
Jiang, Lanlan [1 ]
Qin, Xingguo [2 ]
Zhang, Jingwei [2 ]
Li, Jun [2 ]
机构
[1] Guilin Univ Elect Technol, Sch Business, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 20期
基金
中国国家自然科学基金;
关键词
seed data augmentation; low-resource data; Latin Cuengh language; multimodal;
D O I
10.3390/app14209533
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Latin Cuengh is a low-resource dialect that is prevalent in select ethnic minority regions in China. This language presents unique challenges for intelligent research and preservation efforts, primarily due to its oral tradition and the limited availability of textual resources. Prior research has sought to bolster intelligent processing capabilities with regard to Latin Cuengh through data augmentation techniques leveraging scarce textual data, with modest success. In this study, we introduce an innovative multimodal seed data augmentation model designed to significantly enhance the intelligent recognition and comprehension of this dialect. After supplementing the pre-trained model with extensive speech data, we fine-tune its performance with a modest corpus of multilingual textual seed data, employing both Latin Cuengh and Chinese texts as bilingual seed data to enrich its multilingual properties. We then refine its parameters through a variety of downstream tasks. The proposed model achieves a commendable performance across both multi-classification and binary classification tasks, with its average accuracy and F1 measure increasing by more than 3%. Moreover, the model's training efficiency is substantially ameliorated through strategic seed data augmentation. Our research provides insights into the informatization of low-resource languages and contributes to their dissemination and preservation.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation
    Byambadorj, Zolzaya
    Nishimura, Ryota
    Ayush, Altangerel
    Ohta, Kengo
    Kitaoka, Norihide
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [42] Efficient Data Augmentation via lexical matching for boosting performance on Statistical Machine Translation for Indic and a Low-resource language
    Saxena, Shefali
    Gupta, Ayush
    Daniel, Philemon
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 64255 - 64269
  • [43] A Study on Low-resource Language Identification
    Qi, Zhaodi
    Ma, Yong
    Gu, Mingliang
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1897 - 1902
  • [44] Enhancing African low-resource languages: Swahili data for language modelling
    Shikali, Casper S.
    Mokhosi, Refuoe
    DATA IN BRIEF, 2020, 31
  • [45] On the scalability of data augmentation techniques for low-resource machine translation between Chinese and Vietnamese
    Vu, Huan
    Bui, Ngoc Dung
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2023, 7 (02) : 241 - 253
  • [46] A Unified Data Augmentation Framework for Low-Resource Multi-domain Dialogue Generation
    Liu, Yongkang
    Nie, Ercong
    Feng, Shi
    Hua, Zheng
    Ding, Zifeng
    Wang, Daling
    Zhang, Yifei
    Schuetze, Hinrich
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT II, ECML PKDD 2024, 2024, 14942 : 162 - 177
  • [47] Language Model Priors and Data Augmentation Strategies for Low-resource Machine Translation: A Case Study Using Finnish to Northern Sami
    Saleva, Jonne
    Lignos, Constantine
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12949 - 12956
  • [48] Multi-speaker TTS system for low-resource language using cross-lingual transfer learning and data augmentation
    Byambadorj, Zolzaya
    Nishimura, Ryota
    Ayush, Altangerel
    Ohta, Kengo
    Kitaoka, Norihide
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 849 - 853
  • [49] Effectiveness of Data Augmentation and Pretraining for Improving Neural Headline Generation in Low-Resource Settings
    Martinc, Matej
    Montariol, Syrielle
    Pivovarova, Lidia
    Zosa, Elaine
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3561 - 3570
  • [50] Improving Neural Machine Translation for Low-resource English-Myanmar-Thai Language Pairs with SwitchOut Data Augmentation Algorithm
    San, Mya Ei
    Thu, Ye Kyaw
    Supnithi, Thepchai
    Usanavasin, Sasiporn
    2022 17TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2022) / 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (AIOT 2022), 2022,