Cross-modal dynamic sentiment annotation for speech sentiment analysis

被引:0
|
作者
Chen, Jincai [1 ]
Sun, Chao [1 ]
Zhang, Sheng [1 ]
Zeng, Jiangfeng [2 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China
[2] Cent China Normal Univ, Sch Informat Management, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Speech sentiment analysis; Multi-modal video; Sentiment profiles; Cross-modal annotation;
D O I
10.1016/j.compeleceng.2023.108598
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Traditionally, one single hard label determines the sentiment label of an entire utterance for speech sentiment analysis. It obviously ignores the inherent dynamic and ambiguity of speech sentiments. Moreover, there are few segment-level ground truth labels in the most existing sentiment corpora, due to the label ambiguity and annotation cost. In this work, to capture segment-level sentiment fluctuations across one utterance, we propose sentiment profiles (SPs) to express segment-level soft labels. Meanwhile, we introduce massive multi-modal wild video data to solve the data shortage problem, and facial expression knowledge is used to guide audio segments generate soft labels through the Cross-modal Sentiment Annotation Module. Then, we design a Speech Encoder Module to encode audio segments into SPs. We further exploit the sentiment profile purifier (SPP) to iteratively improve the accuracy of SPs. Numerous experiments show that our model achieves state-of-the-art performance on CH-SIMS and IEMOCAP datasets with unlabeled data respectively.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Target-Oriented Sentiment Classification with Sequential Cross-Modal Semantic Graph
    Huang, Yufeng
    Chen, Zhuo
    Chen, Jiaoyan
    Pan, Jeff Z.
    Yao, Zhen
    Zhang, Wen
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 587 - 599
  • [42] Automatic Sentiment Annotation of Idiomatic Expressions for Sentiment Analysis Task
    Tahayna, Bashar M. A.
    Ayyasamy, Ramesh Kumar
    Akbar, Rehan
    IEEE ACCESS, 2022, 10 : 122234 - 122242
  • [43] Multimodal sentiment analysis model based on multi-task learning and stacked cross-modal Transformer
    Chen Q.-H.
    Sun J.-J.
    Lou Y.-B.
    Fang Z.-J.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (12): : 2421 - 2429
  • [44] Text-dominant multimodal perception network for sentiment analysis based on cross-modal semantic enhancements
    Li, Zuhe
    Liu, Panbo
    Pan, Yushan
    Yu, Jun
    Liu, Weihua
    Chen, Haoran
    Luo, Yiming
    Wang, Hao
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [45] Cross-Modal Multitask Transformer for End-to-End Multimodal Aspect-Based Sentiment Analysis
    Yang, Li
    Na, Jin-Cheon
    Yu, Jianfei
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (05)
  • [46] Deep Coordinated Textual and Visual Network for Sentiment-Oriented Cross-Modal Retrieval
    Fu, Jiamei
    She, Dongyu
    Yao, Xingxu
    Zhang, Yuxiang
    Yang, Jufeng
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 684 - 696
  • [47] Social Image-Text Sentiment Classification With Cross-Modal Consistency and Knowledge Distillation
    Liu, Huan
    Li, Ke
    Fan, Jianping
    Yan, Caixia
    Qin, Tao
    Zheng, Qinghua
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 3332 - 3344
  • [48] Cross-Modal Sentiment Sensing with Visual-Augmented Representation and Diverse Decision Fusion
    Zhang, Sun
    Li, Bo
    Yin, Chunyong
    SENSORS, 2022, 22 (01)
  • [49] Annotation of a Corpus of Tweets for Sentiment Analysis
    dos Santos, Allisfrank
    Barros Junior, Jorge Daniel
    Camargo, Heloisa de Arruda
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 294 - 302
  • [50] A Text-Centered Shared-Private Framework via Cross-Modal Prediction for Multimodal Sentiment Analysis
    Wu, Yang
    Lin, Zijie
    Zhao, Yanyan
    Qin, Bing
    Zhu, Li-Nan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4730 - 4738