Depressive Tendency Recognition by Fusing Speech and Text Features: A Comparative Analysis

被引:0
|
作者
He, Yimin [1 ]
Lu, Xiaoyong [1 ]
Yuan, Jingyi [1 ]
Pan, Tao [1 ]
Wang, Yafan [1 ]
机构
[1] Northwest Normal Univ Lanzhou, Lanzhou, Gansu, Peoples R China
来源
2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2022年
基金
美国国家科学基金会;
关键词
Depression recognition; Deep learning; GRU; Multimodal;
D O I
10.1109/ISCSLP57327.2022.10038078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression will be accompanied by long-term depression, loss of interest, excessive guilt, and other states, seriously affecting people's physical and mental health, causing certain harm to the individual family and society. Depressed people speak slowly, single tone, and the pause time is longer. At the same time, the expression of emotion is often accompanied by many negative words. Therefore, the combination of speech and text information using Gated Recurrent Neural Network for depression prediction, this fusion method from the signal layer to the language layer to analyze the data. It is more comprehensive than only use a single speech feature or text feature model. At the same time, the comparative analysis of different speech types, three kinds of emotional stimulus corpus, and gender was carried out. The multimodal system is more effective than a single modality, and speech type, emotional stimulus, and gender have certain effects on depressive recognition.
引用
收藏
页码:344 / 348
页数:5
相关论文
共 50 条
  • [1] Depressive Tendency Recognition Using the Gated Recurrent Unit From Speech and Text Features
    Yuan, Jingyi
    Lu, Xiaoyong
    Liu, Yang
    Shi, Daimin
    Pan, Tao
    Li, Yanqin
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 42 - 46
  • [2] System for judgment of depressive tendency from speech analysis
    Suzuki, Tomoko
    Tamura, Naoyoshi
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2007, 90 (10): : 103 - 114
  • [3] Improving Speech Translation by Fusing Speech and Text
    Yin, Wenbiao
    Liu, Zhicheng
    Zhao, Chengqi
    Wang, Tao
    Tong, Jian
    Ye, Rong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 6262 - 6273
  • [4] Comparative Analysis of Using Different Text Features, Models, and Methods in Text Author Recognition
    Azimov, R. B.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2024, 60 (05) : 711 - 725
  • [5] Speech-Visual Emotion Recognition by Fusing Shared and Specific Features
    Chen, Guanghui
    Jiao, Shuang
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 678 - 682
  • [6] Comparative analysis of Dysarthric speech recognition: multiple features and robust templates
    Revathi, Arunachalam
    Nagakrishnan, R.
    Sasikaladevi, N.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 31245 - 31259
  • [7] Comparative analysis of Dysarthric speech recognition: multiple features and robust templates
    Arunachalam Revathi
    R. Nagakrishnan
    N. Sasikaladevi
    Multimedia Tools and Applications, 2022, 81 : 31245 - 31259
  • [8] Depression recognition using a proposed speech chain model fusing speech production and perception features
    Du, Minghao
    Liu, Shuang
    Wang, Tao
    Zhang, Wenquan
    Ke, Yufeng
    Chen, Long
    Ming, Dong
    JOURNAL OF AFFECTIVE DISORDERS, 2023, 323 : 299 - 308
  • [9] SPEECH VS. TEXT: A COMPARATIVE ANALYSIS OF FEATURES FOR DEPRESSION DETECTION SYSTEMS
    Morales, Michelle Renee
    Levitan, Rivka
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 136 - 143
  • [10] Medical Named Entity Recognition Fusing Part-of-Speech and Stroke Features
    Yi, Fen
    Liu, Hong
    Wang, You
    Wu, Sheng
    Sun, Cheng
    Feng, Peng
    Zhang, Jin
    APPLIED SCIENCES-BASEL, 2023, 13 (15):