Assessing speaker independence on a speech-based depression level estimation system

被引：20

作者：

Lopez-Otero, Paula ^{[1
]}

Docio-Fernandez, Laura ^{[1
]}

Garcia-Mateo, Carmen ^{[1
]}

机构：

[1] Univ Vigo, AtlantTIC Res Ctr, EE Telecomunicac, Vigo 36310, Spain

来源：

PATTERN RECOGNITION LETTERS | 2015年 / 68卷

关键词：

Soft biometrics; Depression; iVectors; Speaker independence; INVENTORY;

D O I：

10.1016/j.patrec.2015.05.017

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Soft biometrics refers to traits that provide valuable information about an individual without being sufficient for their authentication, as they lack uniqueness and distinctiveness. This definition includes features related to the psychological state of individuals, such as emotions or mental health disorders like depression. Depression has recently been attracting the attention of speech researchers, with audio/visual emotion challenge (AVEC) 2013 and 2014 organized to encourage researchers to develop approaches to accurately estimate speaker depression level. The evaluation frameworks provided for these evaluations do not take speaker independence into account in experiment design, despite this being an important factor in developing a robust speech based system. We assess the influence of prior knowledge of the speakers in a depression estimation experiment, using an iVector-based state-of-the-art approach to depression level estimation to perform a speaker-dependent experiment and a speaker-independent experiment. We conclude that having previous information about the depression level of a given speaker dramatically improves system performance. Hence, we suggest that experimental frameworks must be carefully designed in order to serve as a genuinely useful resource for the development of robust depression estimation systems. (C) 2015 Elsevier B.V. All rights reserved.

引用

页码：343 / 350

页数：8

共 50 条

[31] Toward Knowledge-Driven Speech-Based Models of Depression: Leveraging Spectrotemporal Variations in Speech Vowels
Feng, Kexin
Feng, Kexin
2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22), 2022,
[32] Speech-Based Location Estimation of First Responders in a Simulated Search and Rescue Scenario
Mokaram, Saeid
Moore, Roger K.
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2734 - 2738
[33] Contrastive Learning with Multi-level Embeddings for Speech-Based Emotion Recognition
Si, Mei
HCI INTERNATIONAL 2024-LATE BREAKING POSTERS, HCII 2024, PT I, 2025, 2319 : 312 - 321
[34] ConflictNET: End-to-End Learning for Speech-Based Conflict Intensity Estimation
Rajan, Vandana
Brutti, Alessio
Cavallaro, Andrea
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (11) : 1668 - 1672
[35] Tree-Based Estimation of Speaker Characteristics for Speech Recognition
Blomberg, Mats
Elenius, Daniel
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 584 - 587
[36] MonParLoc: A Speech-Based System for Parkinson's Disease Analysis and Monitoring
Palacios-Alonso, Daniel
Melendez-Morales, Guillermo
Lopez-Arribas, Agustin
Lazaro-Carrascosa, Carlos
Gomez-Rodellar, Andres
Gomez-Vilda, Pedro
IEEE ACCESS, 2020, 8 : 188243 - 188255
[37] The Effects of Listening Agent in Speech-Based On-line Test System
Kimura, Hidemasa
Hayashi, Jumpei
Demise, Yuichi
Hasegawa, Dai
Sakuta, Hiroshi
PROCEEDINGS OF 2015 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2015, : 366 - 370
[38] Robust Multi-Scenario Speech-Based Emotion Recognition System
Fangfang Zhu-Zhou
Gil-Pita, Roberto
Garcia-Gomez, Joaquin
Rosa-Zurera, Manuel
SENSORS, 2022, 22 (06)
[39] Quester: A Speech-Based Question Answering Support System for Oral Presentations
Asadi, Reza
Trinh, Ha
Fell, Harriet J.
Bickmore, Timothy W.
IUI 2018: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2018, : 583 - 594
[40] Unsupervised Filterbank Learning for Speech-based Access System for Agricultural Commodity
Sailor, Hardik B.
Patil, Hemant A.
Rajpal, Avni
2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 210 - 215

← 1 2 3 4 5 →