Leveraging translations for speech transcription in low-resource settings

被引:0
|
作者
Anastasopoulos, Antonios [1 ]
Chiang, David [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
基金
美国国家科学基金会;
关键词
neural multi-source models; speech transcription; endangered languages;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently proposed data collection frameworks for endangered language documentation aim not only to collect speech in the language of interest, but also to collect translations into a high resource language that will render the collected resource interpretable. We focus on this scenario and explore whether we can improve transcription quality under these extremely low resource settings with the assistance of text translations. We present a neural multi-source model and evaluate several variations of it on three low-resource datasets. We find that our multi-source model with shared attention outperforms the baselines, reducing transcription character error rate by up to 12.3%.
引用
收藏
页码:1279 / 1283
页数:5
相关论文
共 50 条
  • [1] Acoustic Modeling for Hindi Speech Recognition in Low-Resource Settings
    Dey, Anik
    Zhang, Weibin
    Fung, Pascale
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 891 - 894
  • [2] AlloST: Low-resource Speech Translation without Source Transcription
    Cheng, Yao-Fei
    Lee, Hung-Shin
    Wang, Hsin-Min
    INTERSPEECH 2021, 2021, : 2252 - 2256
  • [3] Telemedicine in low-resource settings
    Wootton, Richard
    Bonnardot, Laurent
    FRONTIERS IN PUBLIC HEALTH, 2015, 3
  • [4] Appendicitis in Low-Resource Settings
    Bessoff, Kovi E.
    Forrester, Joseph D.
    SURGICAL INFECTIONS, 2020, 21 (06) : 523 - 532
  • [5] Bioengineering for low-resource settings
    Nature Reviews Bioengineering, 2023, 1 (9): : 607 - 607
  • [6] DOMAIN ADAPTATION OF END-TO-END SPEECH RECOGNITION IN LOW-RESOURCE SETTINGS
    Samarakoon, Lahiru
    Mak, Brian
    Lam, Albert Y. S.
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 382 - 388
  • [7] Introducing Healthcare in Low-resource Settings
    Lahariya, Chandrakant
    HEALTHCARE IN LOW-RESOURCE SETTINGS, 2013, 1 (01): : 1 - 2
  • [8] Overview of teledermatology in low-resource settings
    Delaigue, S.
    Bonnardot, L.
    Olson, D.
    Morand, J. J.
    MEDECINE ET SANTE TROPICALES, 2015, 25 (04): : 365 - 372
  • [9] Speech-to-speech Low-resource Translation
    Liu, Hsiao-Chuan
    Day, Min-Yuh
    Wang, Chih-Chien
    2023 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI, 2023, : 91 - 95
  • [10] Measuring neurodevelopment in low-resource settings
    Gladstone, Melissa
    Abubakar, Amina
    Idro, Richard
    Langfitt, John
    Newton, Charles R.
    LANCET CHILD & ADOLESCENT HEALTH, 2017, 1 (04): : 258 - 259