Assessing speaker independence on a speech-based depression level estimation system

被引:20
|
作者
Lopez-Otero, Paula [1 ]
Docio-Fernandez, Laura [1 ]
Garcia-Mateo, Carmen [1 ]
机构
[1] Univ Vigo, AtlantTIC Res Ctr, EE Telecomunicac, Vigo 36310, Spain
关键词
Soft biometrics; Depression; iVectors; Speaker independence; INVENTORY;
D O I
10.1016/j.patrec.2015.05.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Soft biometrics refers to traits that provide valuable information about an individual without being sufficient for their authentication, as they lack uniqueness and distinctiveness. This definition includes features related to the psychological state of individuals, such as emotions or mental health disorders like depression. Depression has recently been attracting the attention of speech researchers, with audio/visual emotion challenge (AVEC) 2013 and 2014 organized to encourage researchers to develop approaches to accurately estimate speaker depression level. The evaluation frameworks provided for these evaluations do not take speaker independence into account in experiment design, despite this being an important factor in developing a robust speech based system. We assess the influence of prior knowledge of the speakers in a depression estimation experiment, using an iVector-based state-of-the-art approach to depression level estimation to perform a speaker-dependent experiment and a speaker-independent experiment. We conclude that having previous information about the depression level of a given speaker dramatically improves system performance. Hence, we suggest that experimental frameworks must be carefully designed in order to serve as a genuinely useful resource for the development of robust depression estimation systems. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:343 / 350
页数:8
相关论文
共 50 条
  • [41] Analysis of Phonetic Markedness and Gestural Effort Measures for Acoustic Speech-Based Depression Classification
    Stasak, Brian
    Epps, Julien
    Lawson, Aaron
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2017, : 165 - 170
  • [42] "Assistance-On-Demand": a Speech-Based Assistance System for Urban Intersections
    Schoemig, Nadja
    Maag, Christian
    Heckmann, Martin
    Neukum, Alexandra
    Wersing, Heiko
    AUTOMOTIVEUI 2016: 8TH INTERNATIONAL CONFERENCE ON AUTOMOTIVE USER INTERFACES AND INTERACTIVE VEHICULAR APPLICATIONS, 2016, : 51 - 56
  • [43] Deploying a speech-based information system as a research platform for speech recognition research in real environments
    Nishimura, R
    Nishihara, Y
    Tsurumi, R
    Lee, A
    Saruwatari, H
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2005, 88 (12): : 43 - 54
  • [44] Speaker recognition system based on pitch estimation
    Ben Jdira, Makrem
    Jemaa, Imen
    Ouni, Kais
    2014 INTERNATIONAL CONFERENCE ON ELECTRICAL SCIENCES AND TECHNOLOGIES IN MAGHREB (CISTEM), 2014,
  • [45] An adaptive speech enhancement system based on noise level estimation and lateral inhibition
    Choi, Jae Seung
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2007, 93 (04) : 632 - 644
  • [46] Speech-Based L2 Call System for English Foreign Speakers
    Ateeq, Mohammad
    Hanani, Abualsoud
    SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 43 - 53
  • [47] SPEECH BANDWIDTH EXTENSION BASED ON SPEECH PHONETIC CONTENT AND SPEAKER VOCAL TRACT SHAPE ESTIMATION
    Katsir, Itai
    Cohen, Israel
    Malah, David
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 461 - 465
  • [48] Fuzzy Estimation System on Gait Independence Level by Footprint Dynamics
    Yagi, Takamoto
    Takeda, Takahiro
    Sueyoshi, Katsunori
    Ohshiro, Yoshitetsu
    Hata, Yutaka
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 1265 - 1268
  • [49] Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection
    Huang, Zhaocheng
    Epps, Julien
    Joachim, Dale
    Sethu, Vidhyasaharan
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (02) : 435 - 448
  • [50] Speaker Authentication System Based on Voice Biometrics and Speech Recognition
    Dovydaitis, Laurynas
    Rasymas, Tomas
    Rudzionis, Vytautas
    BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2016, 2017, 263 : 79 - 84