Assessing speaker independence on a speech-based depression level estimation system

被引：20

作者：

Lopez-Otero, Paula ^{[1
]}

Docio-Fernandez, Laura ^{[1
]}

Garcia-Mateo, Carmen ^{[1
]}

机构：

[1] Univ Vigo, AtlantTIC Res Ctr, EE Telecomunicac, Vigo 36310, Spain

来源：

PATTERN RECOGNITION LETTERS | 2015年 / 68卷

关键词：

Soft biometrics; Depression; iVectors; Speaker independence; INVENTORY;

D O I：

10.1016/j.patrec.2015.05.017

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Soft biometrics refers to traits that provide valuable information about an individual without being sufficient for their authentication, as they lack uniqueness and distinctiveness. This definition includes features related to the psychological state of individuals, such as emotions or mental health disorders like depression. Depression has recently been attracting the attention of speech researchers, with audio/visual emotion challenge (AVEC) 2013 and 2014 organized to encourage researchers to develop approaches to accurately estimate speaker depression level. The evaluation frameworks provided for these evaluations do not take speaker independence into account in experiment design, despite this being an important factor in developing a robust speech based system. We assess the influence of prior knowledge of the speakers in a depression estimation experiment, using an iVector-based state-of-the-art approach to depression level estimation to perform a speaker-dependent experiment and a speaker-independent experiment. We conclude that having previous information about the depression level of a given speaker dramatically improves system performance. Hence, we suggest that experimental frameworks must be carefully designed in order to serve as a genuinely useful resource for the development of robust depression estimation systems. (C) 2015 Elsevier B.V. All rights reserved.

引用

页码：343 / 350

页数：8

共 50 条

[41] Analysis of Phonetic Markedness and Gestural Effort Measures for Acoustic Speech-Based Depression Classification
Stasak, Brian
Epps, Julien
Lawson, Aaron
2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2017, : 165 - 170
[42] "Assistance-On-Demand": a Speech-Based Assistance System for Urban Intersections
Schoemig, Nadja
Maag, Christian
Heckmann, Martin
Neukum, Alexandra
Wersing, Heiko
AUTOMOTIVEUI 2016: 8TH INTERNATIONAL CONFERENCE ON AUTOMOTIVE USER INTERFACES AND INTERACTIVE VEHICULAR APPLICATIONS, 2016, : 51 - 56
[43] Deploying a speech-based information system as a research platform for speech recognition research in real environments
Nishimura, R
Nishihara, Y
Tsurumi, R
Lee, A
Saruwatari, H
Shikano, K
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2005, 88 (12): : 43 - 54
[44] Speaker recognition system based on pitch estimation
Ben Jdira, Makrem
Jemaa, Imen
Ouni, Kais
2014 INTERNATIONAL CONFERENCE ON ELECTRICAL SCIENCES AND TECHNOLOGIES IN MAGHREB (CISTEM), 2014,
[45] An adaptive speech enhancement system based on noise level estimation and lateral inhibition
Choi, Jae Seung
ACTA ACUSTICA UNITED WITH ACUSTICA, 2007, 93 (04) : 632 - 644
[46] Speech-Based L2 Call System for English Foreign Speakers
Ateeq, Mohammad
Hanani, Abualsoud
SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 43 - 53
[47] SPEECH BANDWIDTH EXTENSION BASED ON SPEECH PHONETIC CONTENT AND SPEAKER VOCAL TRACT SHAPE ESTIMATION
Katsir, Itai
Cohen, Israel
Malah, David
19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 461 - 465
[48] Fuzzy Estimation System on Gait Independence Level by Footprint Dynamics
Yagi, Takamoto
Takeda, Takahiro
Sueyoshi, Katsunori
Ohshiro, Yoshitetsu
Hata, Yutaka
6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 1265 - 1268
[49] Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection
Huang, Zhaocheng
Epps, Julien
Joachim, Dale
Sethu, Vidhyasaharan
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (02) : 435 - 448
[50] Speaker Authentication System Based on Voice Biometrics and Speech Recognition
Dovydaitis, Laurynas
Rasymas, Tomas
Rudzionis, Vytautas
BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2016, 2017, 263 : 79 - 84

← 1 2 3 4 5 →