Exploring Modulation Spectrum Features for Speech-Based Depression Level Classification

被引:0
|
作者
Bozkurt, Elif [1 ]
Toledo-Ronen, Orith [2 ]
Sorin, Alexander [2 ]
Hoory, Ron [2 ]
机构
[1] Koc Univ, Multimedia Vis & Graph Lab, Istanbul, Turkey
[2] Haifa Univ Mt Carmel, IBM Res Haifa, Haifa, Israel
来源
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4 | 2014年
关键词
depression assessment; modulation spectrum; prosody; feature fusion; decision fusion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a Modulation Spectrum-based manageable feature set for detection of depressed speech. Modulation Spectrum (MS) is obtained from the conventional speech spectrogram by spectral analysis along the temporal trajectories of the acoustic frequency bins. While MS representation of speech provides rich and high-dimensional joint frequency information, extraction of discriminative features from it remains as an open question. We propose a lower dimensional representation, which first employs a Mel frequency filterbank in the acoustic frequency domain and Discrete Cosine Transform in the modulation frequency domain, and then applies feature selection in both domains. We compare and fuse the proposed feature set with other complementary prosodic and spectral features at the feature and decision levels. In our experiments, we use Support Vector Machines for discriminating the depressed speech in a speaker-independent fashion. Feature-level fusion of the proposed MS-based features with other prosodic and spectral features after dimension reduction provides up to 9% improvement over the baseline results and also correlates the most with clinical ratings of patients' depression level.
引用
收藏
页码:1243 / 1247
页数:5
相关论文
共 50 条
  • [41] MODULATION SPECTRUM BASED BEAMFORMING FOR SPEECH ENHANCEMENT
    Karimian-Azari, Sam
    Falk, Tiago H.
    2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 91 - 95
  • [42] Imagined speech classification exploiting EEG power spectrum features
    Hossain, Arman
    Khan, Protima
    Kader, Md. Fazlul
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (08) : 2529 - 2544
  • [43] Modulation classification based on multifractal features
    He Tao
    Zhou Zheng-ou
    Li Xi-rong
    2006 6TH INTERNATIONAL CONFERENCE ON ITS TELECOMMUNICATIONS PROCEEDINGS, 2006, : 152 - +
  • [44] Speech-Based Emotion Analysis Using Log-Mel Spectrograms and MFCC Features
    Yetkin, Ahmet Kemal
    Kose, Hatice
    2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [45] Investigating Speech-based Features in Differentiating Obstructive Sleep Apnea in People Experiencing Homelessness
    Taghibeyglou, B.
    Chow, A.
    Mclaurin, P.
    Yasokaran, O.
    Adams, R.
    Mohammed, M.
    Singh, M.
    Ayas, N.
    Pendharkar, S. R.
    Almeida, F. R.
    Rac, V.
    Saha, S.
    Yadollahi, A.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2024, 209
  • [46] Acoustic Features for Classification Based Speech Separation
    Wang, Yuxuan
    Han, Kun
    Wang, DeLiang
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1530 - 1533
  • [47] SPEECH-BASED EMOTION CLASSIFICATION USING MULTICLASS SVM WITH HYBRID KERNEL AND THRESHOLDING FUSION
    Yang, N.
    Muraleedharan, R.
    Kohl, J.
    Demirkol, I.
    Heinzelman, W.
    Sturge-Apple, M.
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 455 - 460
  • [48] Critical analysis of the impact of glottal features in the classification of clinical depression in speech
    Moore, Elliot, II
    Clements, Mark A.
    Peifer, John W.
    Weisser, Lydia
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2008, 55 (01) : 96 - 107
  • [49] Evaluation of objective features for classification of clinical depression in speech by genetic programming
    Torres, Juan
    Saad, Ashraf
    Moore, Elliot
    SOFT COMPUTING IN INDUSTRIAL APPLICATIONS: RECENT AND EMERGING METHODS AND TECHNIQUES, 2007, 39 : 132 - +
  • [50] Exploring the Potential of Speech-based Virtual Assistants in Mixed Reality Applications for People with Cognitive Disabilities
    Vona, Francesco
    Torelli, Emanuele
    Beccaluva, Eleonora
    Garzotto, Franca
    PROCEEDINGS OF THE WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES AVI 2020, 2020,