Leveraging translations for speech transcription in low-resource settings

被引:0
|
作者
Anastasopoulos, Antonios [1 ]
Chiang, David [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
来源
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年
基金
美国国家科学基金会;
关键词
neural multi-source models; speech transcription; endangered languages;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently proposed data collection frameworks for endangered language documentation aim not only to collect speech in the language of interest, but also to collect translations into a high resource language that will render the collected resource interpretable. We focus on this scenario and explore whether we can improve transcription quality under these extremely low resource settings with the assistance of text translations. We present a neural multi-source model and evaluate several variations of it on three low-resource datasets. We find that our multi-source model with shared attention outperforms the baselines, reducing transcription character error rate by up to 12.3%.
引用
收藏
页码:1279 / 1283
页数:5
相关论文
共 50 条
  • [21] Portable colposcopy in low-resource settings
    Walmer, DK
    Merisier, D
    Littman, E
    Rodriguez, G
    Venero, N
    Henderson, M
    Katz, D
    Edwards, R
    JAIDS-JOURNAL OF ACQUIRED IMMUNE DEFICIENCY SYNDROMES, 2004, 37 : S167 - S170
  • [22] Respiratory problems in low-resource settings
    Leng, Mhoira E. F.
    Daniel, Sunitha
    Munday, Daniel
    CURRENT OPINION IN SUPPORTIVE AND PALLIATIVE CARE, 2017, 11 (03) : 174 - 178
  • [23] Systematic Investigation of Strategies Tailored for Low-Resource Settings for Low-Resource Dependency Parsing
    Sandhan, Jivnesh
    Behera, Laxmidhar
    Goyal, Pawan
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 2164 - 2171
  • [24] Automatic Speech Transcription for Low-Resource Languages - The Case of Yoloxfochitl Mixtec (Mexico)
    Mitral, Vikramjit
    Katholl, Andreas
    Amith, Jonathan D.
    Castillo Garcia, Rey
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3076 - 3080
  • [25] Deriving phonetic transcriptions and discovering word segmentations for speech-to-speech translation in low-resource settings
    Wilkinson, Andrew
    Zhao, Tiancheng
    Black, Alan W.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3086 - 3090
  • [26] SUBSPACE MIXTURE MODEL FOR LOW-RESOURCE SPEECH RECOGNITION IN CROSS-LINGUAL SETTINGS
    Miao, Yajie
    Metze, Florian
    Waibel, Alex
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7339 - 7343
  • [27] Management of Empyema Thoracis in Low-Resource Settings
    Bekele, Abebe
    Alayande, Barnabas Tobi
    THORACIC SURGERY CLINICS, 2022, 32 (03) : 361 - 372
  • [28] Challenges in the diagnosis of meningitis in low-resource settings
    Yansouni, Cedric P.
    Lynen, Lut
    Colebunders, Robert
    TROPICAL MEDICINE & INTERNATIONAL HEALTH, 2010, 15 (12) : 1556 - 1557
  • [29] Monitoring mortality trends in low-resource settings
    Pagel, Christina
    Prost, Audrey
    Nair, Nirmala
    Tripathy, Prasanta
    Costello, Anthony
    Utley, Martin
    BULLETIN OF THE WORLD HEALTH ORGANIZATION, 2012, 90 (06) : 474 - 476
  • [30] Dealing with neonatal emergencies in low-resource settings
    Shukla, Vivek
    Mwenechanya, Musaku
    Carlo, Waldemar A.
    SEMINARS IN FETAL & NEONATAL MEDICINE, 2019, 24 (06):