Cross-modal dynamic sentiment annotation for speech sentiment analysis

被引：0

作者：

Chen, Jincai ^{[1
]}

Sun, Chao ^{[1
]}

Zhang, Sheng ^{[1
]}

Zeng, Jiangfeng ^{[2
]}

机构：

[1] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China

[2] Cent China Normal Univ, Sch Informat Management, Wuhan 430079, Peoples R China

来源：

COMPUTERS & ELECTRICAL ENGINEERING | 2023年 / 106卷

基金：

中国国家自然科学基金;

关键词：

Speech sentiment analysis; Multi-modal video; Sentiment profiles; Cross-modal annotation;

D O I：

10.1016/j.compeleceng.2023.108598

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Traditionally, one single hard label determines the sentiment label of an entire utterance for speech sentiment analysis. It obviously ignores the inherent dynamic and ambiguity of speech sentiments. Moreover, there are few segment-level ground truth labels in the most existing sentiment corpora, due to the label ambiguity and annotation cost. In this work, to capture segment-level sentiment fluctuations across one utterance, we propose sentiment profiles (SPs) to express segment-level soft labels. Meanwhile, we introduce massive multi-modal wild video data to solve the data shortage problem, and facial expression knowledge is used to guide audio segments generate soft labels through the Cross-modal Sentiment Annotation Module. Then, we design a Speech Encoder Module to encode audio segments into SPs. We further exploit the sentiment profile purifier (SPP) to iteratively improve the accuracy of SPs. Numerous experiments show that our model achieves state-of-the-art performance on CH-SIMS and IEMOCAP datasets with unlabeled data respectively.

引用

页数：14

共 50 条

[41] Target-Oriented Sentiment Classification with Sequential Cross-Modal Semantic Graph
Huang, Yufeng
Chen, Zhuo
Chen, Jiaoyan
Pan, Jeff Z.
Yao, Zhen
Zhang, Wen
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 587 - 599
[42] Automatic Sentiment Annotation of Idiomatic Expressions for Sentiment Analysis Task
Tahayna, Bashar M. A.
Ayyasamy, Ramesh Kumar
Akbar, Rehan
IEEE ACCESS, 2022, 10 : 122234 - 122242
[43] Multimodal sentiment analysis model based on multi-task learning and stacked cross-modal Transformer
Chen Q.-H.
Sun J.-J.
Lou Y.-B.
Fang Z.-J.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (12): : 2421 - 2429
[44] Text-dominant multimodal perception network for sentiment analysis based on cross-modal semantic enhancements
Li, Zuhe
Liu, Panbo
Pan, Yushan
Yu, Jun
Liu, Weihua
Chen, Haoran
Luo, Yiming
Wang, Hao
APPLIED INTELLIGENCE, 2025, 55 (02)
[45] Cross-Modal Multitask Transformer for End-to-End Multimodal Aspect-Based Sentiment Analysis
Yang, Li
Na, Jin-Cheon
Yu, Jianfei
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (05)
[46] Deep Coordinated Textual and Visual Network for Sentiment-Oriented Cross-Modal Retrieval
Fu, Jiamei
She, Dongyu
Yao, Xingxu
Zhang, Yuxiang
Yang, Jufeng
PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 684 - 696
[47] Social Image-Text Sentiment Classification With Cross-Modal Consistency and Knowledge Distillation
Liu, Huan
Li, Ke
Fan, Jianping
Yan, Caixia
Qin, Tao
Zheng, Qinghua
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 3332 - 3344
[48] Cross-Modal Sentiment Sensing with Visual-Augmented Representation and Diverse Decision Fusion
Zhang, Sun
Li, Bo
Yin, Chunyong
SENSORS, 2022, 22 (01)
[49] Annotation of a Corpus of Tweets for Sentiment Analysis
dos Santos, Allisfrank
Barros Junior, Jorge Daniel
Camargo, Heloisa de Arruda
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 294 - 302
[50] A Text-Centered Shared-Private Framework via Cross-Modal Prediction for Multimodal Sentiment Analysis
Wu, Yang
Lin, Zijie
Zhao, Yanyan
Qin, Bing
Zhu, Li-Nan
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4730 - 4738

← 1 2 3 4 5 →