Assessing speaker independence on a speech-based depression level estimation system

被引:20
|
作者
Lopez-Otero, Paula [1 ]
Docio-Fernandez, Laura [1 ]
Garcia-Mateo, Carmen [1 ]
机构
[1] Univ Vigo, AtlantTIC Res Ctr, EE Telecomunicac, Vigo 36310, Spain
关键词
Soft biometrics; Depression; iVectors; Speaker independence; INVENTORY;
D O I
10.1016/j.patrec.2015.05.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Soft biometrics refers to traits that provide valuable information about an individual without being sufficient for their authentication, as they lack uniqueness and distinctiveness. This definition includes features related to the psychological state of individuals, such as emotions or mental health disorders like depression. Depression has recently been attracting the attention of speech researchers, with audio/visual emotion challenge (AVEC) 2013 and 2014 organized to encourage researchers to develop approaches to accurately estimate speaker depression level. The evaluation frameworks provided for these evaluations do not take speaker independence into account in experiment design, despite this being an important factor in developing a robust speech based system. We assess the influence of prior knowledge of the speakers in a depression estimation experiment, using an iVector-based state-of-the-art approach to depression level estimation to perform a speaker-dependent experiment and a speaker-independent experiment. We conclude that having previous information about the depression level of a given speaker dramatically improves system performance. Hence, we suggest that experimental frameworks must be carefully designed in order to serve as a genuinely useful resource for the development of robust depression estimation systems. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:343 / 350
页数:8
相关论文
共 50 条
  • [31] Toward Knowledge-Driven Speech-Based Models of Depression: Leveraging Spectrotemporal Variations in Speech Vowels
    Feng, Kexin
    Feng, Kexin
    2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22), 2022,
  • [32] Speech-Based Location Estimation of First Responders in a Simulated Search and Rescue Scenario
    Mokaram, Saeid
    Moore, Roger K.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2734 - 2738
  • [33] Contrastive Learning with Multi-level Embeddings for Speech-Based Emotion Recognition
    Si, Mei
    HCI INTERNATIONAL 2024-LATE BREAKING POSTERS, HCII 2024, PT I, 2025, 2319 : 312 - 321
  • [34] ConflictNET: End-to-End Learning for Speech-Based Conflict Intensity Estimation
    Rajan, Vandana
    Brutti, Alessio
    Cavallaro, Andrea
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (11) : 1668 - 1672
  • [35] Tree-Based Estimation of Speaker Characteristics for Speech Recognition
    Blomberg, Mats
    Elenius, Daniel
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 584 - 587
  • [36] MonParLoc: A Speech-Based System for Parkinson's Disease Analysis and Monitoring
    Palacios-Alonso, Daniel
    Melendez-Morales, Guillermo
    Lopez-Arribas, Agustin
    Lazaro-Carrascosa, Carlos
    Gomez-Rodellar, Andres
    Gomez-Vilda, Pedro
    IEEE ACCESS, 2020, 8 : 188243 - 188255
  • [37] The Effects of Listening Agent in Speech-Based On-line Test System
    Kimura, Hidemasa
    Hayashi, Jumpei
    Demise, Yuichi
    Hasegawa, Dai
    Sakuta, Hiroshi
    PROCEEDINGS OF 2015 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2015, : 366 - 370
  • [38] Robust Multi-Scenario Speech-Based Emotion Recognition System
    Fangfang Zhu-Zhou
    Gil-Pita, Roberto
    Garcia-Gomez, Joaquin
    Rosa-Zurera, Manuel
    SENSORS, 2022, 22 (06)
  • [39] Quester: A Speech-Based Question Answering Support System for Oral Presentations
    Asadi, Reza
    Trinh, Ha
    Fell, Harriet J.
    Bickmore, Timothy W.
    IUI 2018: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2018, : 583 - 594
  • [40] Unsupervised Filterbank Learning for Speech-based Access System for Agricultural Commodity
    Sailor, Hardik B.
    Patil, Hemant A.
    Rajpal, Avni
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 210 - 215