Exploring Modulation Spectrum Features for Speech-Based Depression Level Classification

被引:0
|
作者
Bozkurt, Elif [1 ]
Toledo-Ronen, Orith [2 ]
Sorin, Alexander [2 ]
Hoory, Ron [2 ]
机构
[1] Koc Univ, Multimedia Vis & Graph Lab, Istanbul, Turkey
[2] Haifa Univ Mt Carmel, IBM Res Haifa, Haifa, Israel
关键词
depression assessment; modulation spectrum; prosody; feature fusion; decision fusion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a Modulation Spectrum-based manageable feature set for detection of depressed speech. Modulation Spectrum (MS) is obtained from the conventional speech spectrogram by spectral analysis along the temporal trajectories of the acoustic frequency bins. While MS representation of speech provides rich and high-dimensional joint frequency information, extraction of discriminative features from it remains as an open question. We propose a lower dimensional representation, which first employs a Mel frequency filterbank in the acoustic frequency domain and Discrete Cosine Transform in the modulation frequency domain, and then applies feature selection in both domains. We compare and fuse the proposed feature set with other complementary prosodic and spectral features at the feature and decision levels. In our experiments, we use Support Vector Machines for discriminating the depressed speech in a speaker-independent fashion. Feature-level fusion of the proposed MS-based features with other prosodic and spectral features after dimension reduction provides up to 9% improvement over the baseline results and also correlates the most with clinical ratings of patients' depression level.
引用
收藏
页码:1243 / 1247
页数:5
相关论文
共 50 条
  • [1] SPEECH-BASED STRESS CLASSIFICATION BASED ON MODULATION SPECTRAL FEATURES AND CONVOLUTIONAL NEURAL NETWORKS
    Avila, Anderson R.
    Kshirsagar, Shruti R.
    Tiwari, Abhishek
    Lafond, Daniel
    O'Shaughnessy, Douglas
    Falk, Tiago H.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [2] GLOTTAL FEATURES FOR SPEECH-BASED COGNITIVE LOAD CLASSIFICATION
    Yap, Tet Fei
    Epps, Julien
    Choi, Eric H. C.
    Ambikairajah, Eliathamby
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5234 - 5237
  • [3] Glottal Source Features for Automatic Speech-based Depression Assessment
    Simantiraki, Olympia
    Charonyktakis, Paulos
    Pampouchidou, Anastasia
    Tsiknakis, Manolis
    Cooker, Martin
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2700 - 2704
  • [4] Avoiding dominance of speaker features in speech-based depression detection
    Zuo, Lishi
    Mak, Man-Wai
    PATTERN RECOGNITION LETTERS, 2023, 173 : 50 - 56
  • [5] Enhancing Speech-Based Depression Detection Through Gender Dependent Vowel-Level Formant Features
    Cummins, Nicholas
    Vlasenko, Bogdan
    Sagha, Hesam
    Schuller, Bjoern
    ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2017, 2017, 10259 : 209 - 214
  • [6] Siamese Neural Network for Speech-Based Depression Classification and Severity Assessment
    Ntalampiras, Stavros
    Qi, Wen
    JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2024, : 577 - 593
  • [7] Differential Performance of Automatic Speech-Based Depression Classification Across Smartphones
    Stasak, Brian
    Epps, Julien
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2017, : 171 - 175
  • [8] Assessing speaker independence on a speech-based depression level estimation system
    Lopez-Otero, Paula
    Docio-Fernandez, Laura
    Garcia-Mateo, Carmen
    PATTERN RECOGNITION LETTERS, 2015, 68 : 343 - 350
  • [9] Analysis of Phonetic Markedness and Gestural Effort Measures for Acoustic Speech-Based Depression Classification
    Stasak, Brian
    Epps, Julien
    Lawson, Aaron
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2017, : 165 - 170
  • [10] Speech-based Evaluation of Emotions-Depression Correlation
    Verde, Laura
    Campanile, Lelio
    Marulli, Fiammetta
    Marrone, Stefano
    2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 324 - 329