Multimodal Seed Data Augmentation for Low-Resource Audio Latin Cuengh Language

被引：0

作者：

Jiang, Lanlan ^{[1
]}

Qin, Xingguo ^{[2
]}

Zhang, Jingwei ^{[2
]}

Li, Jun ^{[2
]}

机构：

[1] Guilin Univ Elect Technol, Sch Business, Guilin 541004, Peoples R China

[2] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 20期

基金：

中国国家自然科学基金;

关键词：

seed data augmentation; low-resource data; Latin Cuengh language; multimodal;

D O I：

10.3390/app14209533

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Latin Cuengh is a low-resource dialect that is prevalent in select ethnic minority regions in China. This language presents unique challenges for intelligent research and preservation efforts, primarily due to its oral tradition and the limited availability of textual resources. Prior research has sought to bolster intelligent processing capabilities with regard to Latin Cuengh through data augmentation techniques leveraging scarce textual data, with modest success. In this study, we introduce an innovative multimodal seed data augmentation model designed to significantly enhance the intelligent recognition and comprehension of this dialect. After supplementing the pre-trained model with extensive speech data, we fine-tune its performance with a modest corpus of multilingual textual seed data, employing both Latin Cuengh and Chinese texts as bilingual seed data to enrich its multilingual properties. We then refine its parameters through a variety of downstream tasks. The proposed model achieves a commendable performance across both multi-classification and binary classification tasks, with its average accuracy and F1 measure increasing by more than 3%. Moreover, the model's training efficiency is substantially ameliorated through strategic seed data augmentation. Our research provides insights into the informatization of low-resource languages and contributes to their dissemination and preservation.

引用

页数：13

共 50 条

[41] Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation
Byambadorj, Zolzaya
Nishimura, Ryota
Ayush, Altangerel
Ohta, Kengo
Kitaoka, Norihide
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
[42] Efficient Data Augmentation via lexical matching for boosting performance on Statistical Machine Translation for Indic and a Low-resource language
Saxena, Shefali
Gupta, Ayush
Daniel, Philemon
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 64255 - 64269
[43] A Study on Low-resource Language Identification
Qi, Zhaodi
Ma, Yong
Gu, Mingliang
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1897 - 1902
[44] Enhancing African low-resource languages: Swahili data for language modelling
Shikali, Casper S.
Mokhosi, Refuoe
DATA IN BRIEF, 2020, 31
[45] On the scalability of data augmentation techniques for low-resource machine translation between Chinese and Vietnamese
Vu, Huan
Bui, Ngoc Dung
JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2023, 7 (02) : 241 - 253
[46] A Unified Data Augmentation Framework for Low-Resource Multi-domain Dialogue Generation
Liu, Yongkang
Nie, Ercong
Feng, Shi
Hua, Zheng
Ding, Zifeng
Wang, Daling
Zhang, Yifei
Schuetze, Hinrich
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT II, ECML PKDD 2024, 2024, 14942 : 162 - 177
[47] Language Model Priors and Data Augmentation Strategies for Low-resource Machine Translation: A Case Study Using Finnish to Northern Sami
Saleva, Jonne
Lignos, Constantine
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12949 - 12956
[48] Multi-speaker TTS system for low-resource language using cross-lingual transfer learning and data augmentation
Byambadorj, Zolzaya
Nishimura, Ryota
Ayush, Altangerel
Ohta, Kengo
Kitaoka, Norihide
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 849 - 853
[49] Effectiveness of Data Augmentation and Pretraining for Improving Neural Headline Generation in Low-Resource Settings
Martinc, Matej
Montariol, Syrielle
Pivovarova, Lidia
Zosa, Elaine
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3561 - 3570
[50] Improving Neural Machine Translation for Low-resource English-Myanmar-Thai Language Pairs with SwitchOut Data Augmentation Algorithm
San, Mya Ei
Thu, Ye Kyaw
Supnithi, Thepchai
Usanavasin, Sasiporn
2022 17TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2022) / 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (AIOT 2022), 2022,

← 1 2 3 4 5 →