Testing acoustic voice quality classification across languages and speech styles

被引:1
|
作者
Braun, Bettina [1 ]
Dehe, Nicole [1 ]
Einfeldt, Marieke [1 ]
Wochner, Daniela [1 ]
Zahner-Ritter, Katharina [2 ]
机构
[1] Univ Konstanz, Dept Linguist, Constance, Germany
[2] Univ Trier, Dept 2, Phonet, Trier, Germany
来源
关键词
voice quality; phonation type; acoustic measures; random forest; cross-linguistic generalization; infant-directed speech; German; Chinese; Icelandic; INFANT-DIRECTED SPEECH; PERCEPTION; EMOTION; BREATHY; FEMALE;
D O I
10.21437/Interspeech.2021-315
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many studies relate acoustic voice quality measures to perceptual classification. We extend this line of research by training a classifier on a balanced set of perceptually annotated voice quality categories with high inter-rater agreement, and test it on speech samples from a different language and on a different speech style. Annotations were done on continuous speech from different laboratory settings. In Experiment 1, we trained a random forest with Standard Chinese and German recordings labelled as modal, breathy, or glottalized. The model had an accuracy of 78.7% on unseen data from the same sample (most important variables were harmonics-to-noise ratio, cepstral-peak prominence, and H1-A2). This model was then used to classify data from a different language (Icelandic, Experiment 2) and to classify a different speech style (German infant-directed speech (IDS), Experiment 3). Cross-linguistic generalizability was high for Icelandic (78.6% accuracy), but lower for German IDS (71.7% accuracy). Accuracy of recordings of adult-directed speech from the same speakers as in Experiment 3 (77%, Experiment 4) suggests that it is the special speech style of IDS, rather than the recording setting that led to lower performance. Results are discussed in terms of efficiency of coding and generalizability across languages and speech styles.
引用
收藏
页码:3920 / 3924
页数:5
相关论文
共 50 条
  • [21] Speech Acts and Politeness across Languages and Cultures
    Sun, Chengjiao
    DISCOURSE STUDIES, 2014, 16 (05) : 689 - 691
  • [22] Capturing Formality in Speech Across Domains and Languages
    Bhattacharya, Debasmita
    Chi, Jie
    Hirschberg, Julia
    Bell, Peter
    INTERSPEECH 2023, 2023, : 1030 - 1034
  • [23] Speech Acts and Politeness across Languages and Cultures
    Cunningham, D. Joseph
    Vyatkina, Nina
    MODERN LANGUAGE JOURNAL, 2013, 97 (03): : 816 - 817
  • [24] Speech acts and politeness across languages and cultures
    Portoles Falomir, Laura
    INTERCULTURAL PRAGMATICS, 2015, 12 (02) : 283 - 288
  • [25] Editorial: The production of speech sounds across languages
    Verdonschot, Rinus G.
    Tamaoka, Katsuo
    JAPANESE PSYCHOLOGICAL RESEARCH, 2015, 57 (01) : 1 - 3
  • [26] Correlation of the Iranian Voice Quality of Life Profile (IVQLP) with Acoustic Measurements across Three Common Voice Disorders
    Dehqan, Ali
    Scherer, Ronald C.
    Yadegari, Fariba
    Dashti, Gholamali
    JOURNAL OF VOICE, 2018, 32 (04) : 514.e7 - 514.e11
  • [27] The structure of acoustic voice variation in bilingual speech
    Johnson, Khia A.
    Babel, Molly
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (06): : 3221 - 3238
  • [28] Testing linguistic minorities across languages
    Solano-Flores, G
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2004, 39 (5-6) : 287 - 287
  • [29] Voice modulatory cues to structure across languages and species
    Matzinger, Theresa
    Fitch, W. Tecumseh
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2021, 376 (1840)
  • [30] Acoustic Analysis of Syllables across Indian Languages
    Prakash, Anusha
    Prakash, Jeena J.
    Murthy, Hema A.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 327 - 331