Using Syntax in Large-Scale Audio Document Translation

被引:0
|
作者
Zheng, Jing [1 ]
Ayan, Necip Fazil [1 ]
Wang, Wen [1 ]
Burkett, David [2 ]
机构
[1] SRI Int, Speech Technol & Res Lab, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
[2] Univ Calif Berkeley, EECS Dept, Berkeley, CA 94720 USA
关键词
syntax; machine translation; audio document;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the use of syntax has very effectively improved machine translation (MT) quality in many text translation tasks. However, using syntax in speech translation poses additional challenges because of disfluencies and other spoken language phenomena, and of errors introduced by automatic speech recognition (ASR). In this paper, we investigate the effect of using syntax in a large-scale audio document translation task targeting broadcast news and broadcast conversations. We do so by comparing the performance of three synchronous context-free grammar based translation approaches: 1) hierarchical phrase-based translation, 2) syntax-augmented MT, and 3) string-to-dependency MT. The results show a positive effect of explicitly using syntax when translating broadcast news, but no benefit when translating broadcast conversations. The results indicate that improving the robustness of syntactic systems against conversational language style is important to their success and requires future effort.
引用
收藏
页码:444 / +
页数:2
相关论文
共 50 条
  • [31] The Intervalgram: An Audio Feature for Large-Scale Cover-Song Recognition
    Walters, Thomas C.
    Ross, David A.
    Lyon, Richard F.
    FROM SOUNDS TO MUSIC AND EMOTIONS, 2013, 7900 : 197 - 213
  • [32] REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION
    Wu, Yuzhong
    Lee, Tan
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 331 - 335
  • [33] LARGE-SCALE AUDIO EVENT DISCOVERY IN ONE MILLION YOUTUBE VIDEOS
    Fansen, Aren
    Gemmeke, Fort F.
    Ellis, Daniel P. W.
    Liu, Xiaofeng
    lawrence, Wade
    Freedman, Dylan
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 786 - 790
  • [34] Large-scale audio dataset for emergency vehicle sirens and road noises
    Muhammad Asif
    Muhammad Usaid
    Munaf Rashid
    Tabarka Rajab
    Samreen Hussain
    Sarwar Wasi
    Scientific Data, 9
  • [35] Large-scale audio dataset for emergency vehicle sirens and road noises
    Asif, Muhammad
    Usaid, Muhammad
    Rashid, Munaf
    Rajab, Tabarka
    Hussain, Samreen
    Wasi, Sarwar
    SCIENTIFIC DATA, 2022, 9 (01)
  • [36] LARGE-SCALE AUDIO FEATURE EXTRACTION AND SVM FOR ACOUSTIC SCENE CLASSIFICATION
    Geiger, Juergen T.
    Schuller, Bjoern
    Rigoll, Gerhard
    2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
  • [37] Data Mining of Syntax Errors in a Large-Scale Online Python']Python Course
    Lee, Jung A.
    Koprinska, Irena
    Jeffries, Bryn
    ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS AND DOCTORAL CONSORTIUM, PT II, 2022, 13356 : 599 - 603
  • [38] A Solution to the Problems in Large-Scale Corpus Construction for Police Translation
    Hao, Ding
    PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL CONFERENCE ON LAW AND LANGUAGE OF THE INTERNATIONAL ACADEMY OF LINGUISTIC LAW (IALL2017): LAW, LANGUAGE AND JUSTICE, 2017, : 232 - 239
  • [39] MODIFIED LASSO SCREENING FOR AUDIO WORD-BASED MUSIC CLASSIFICATION USING LARGE-SCALE DICTIONARY
    Jao, Ping-Keng
    Yeh, Chin-Chia Michael
    Yang, Yi-Hsuan
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [40] High-speed retrieval of large-scale audio contents using a pre-selection method
    Heo, Sung-Phil
    Kim, Hee Chan
    WMSCI 2005: 9TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 1, 2005, : 190 - 193