Discriminative Training and Unsupervised Adaptation for Labeling Prosodic Events with Limited Training Data

被引:0
|
作者
Fernandez, Raul [1 ]
Ramabhadran, Bhuvana [1 ]
机构
[1] IBM Corp, TJ Watson Res Lab, Yorktown Hts, NY 10598 USA
关键词
prosody labeling; conditional random fields;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many applications of spoken-language systems can benefit from having access to annotations of prosodic events. Unfortunately, obtaining human annotations of these events, even sensible amounts to train a supervised system, can become a laborious and costly effort. In this paper we explore applying conditional random fields to automatically label major and minor break indices and pitch accents from a corpus of recorded and transcribed speech using a large set of fully automatically-extracted acoustic and linguistic features. We demonstrate the robustness of these features when used in a discriminative training framework as a function of reducing the amount of training data. We also explore adapting the baseline system in an unsupervised fashion to a target dataset for which no prosodic labels are available, and show how, when operating at point where only limited amounts of data are available, an unsupervised approach can offer up to an additional 3% improvement.
引用
收藏
页码:1429 / 1432
页数:4
相关论文
共 50 条
  • [21] Active clustering for labeling training data
    Lutz, Quentin
    de Panafieu, Elie
    Scott, Alex
    Stein, Maya
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [22] Tri-Training for Authorship Attribution with Limited Training Data
    Qian, Tieyun
    Liu, Bing
    Chen, Li
    Peng, Zhiyong
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 345 - 351
  • [23] On learning control with limited training data
    Ou, Y
    Xu, Y
    2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-3, PROCEEDINGS, 2003, : 4148 - 4153
  • [24] Automatic lipreading with limited training data
    Wang, S. L.
    Lau, W. H.
    Liew, A. W. C.
    Leung, S. H.
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 881 - +
  • [25] Unsupervised corrupt data detection for text training
    Liu, Peiyang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 248
  • [26] Unsupervised training of Bayesian networks for data clustering
    Pham, Duc Truong
    Ruz, Gonzalo A.
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2009, 465 (2109): : 2927 - 2948
  • [27] Class-Level Adaptation Network with Self Training for Unsupervised Domain Adaptation
    Jin, Yuncheng
    Chen, Zhihong
    Cheng, Zhaowei
    Chen, Chao
    Jin, Xinyu
    Sun, Bin
    BDCAT'19: PROCEEDINGS OF THE 6TH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, 2019, : 137 - 143
  • [28] Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
    Reddy, Arun
    Paul, William
    Rivera, Corban
    Shah, Ketul
    de Melo, Celso M.
    Chellappa, Rama
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 18919 - 18929
  • [29] Unsupervised data labeling and incremental cross-domain training for enhanced hybrid eye gaze estimation
    de la Santa, Alejandro Garcia
    Muguerza, Javier
    Perez, David Lopez
    Elordi, Unai
    Unzueta, Luis
    Villanueva, Arantxa
    PROCEEDINGS OF THE 2024 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS, ETRA 2024, 2024,
  • [30] Heterogeneous separation consistency training for adaptation of unsupervised speech separation
    Jiangyu Han
    Yanhua Long
    EURASIP Journal on Audio, Speech, and Music Processing, 2023