Deep learning-based multimodal emotion recognition from audio, visual, and text modalities: A systematic review of recent advancements and future prospects
被引:46
|
作者:
Zhang, Shiqing
论文数: 0引用数: 0
h-index: 0
机构:
Taizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R ChinaTaizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R China
Zhang, Shiqing
[1
]
Yang, Yijiao
论文数: 0引用数: 0
h-index: 0
机构:
Taizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R ChinaTaizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R China
Yang, Yijiao
[1
]
Chen, Chen
论文数: 0引用数: 0
h-index: 0
机构:
Taizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R ChinaTaizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R China
Chen, Chen
[1
]
Zhang, Xingnan
论文数: 0引用数: 0
h-index: 0
机构:
Taizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R ChinaTaizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R China
Zhang, Xingnan
[1
]
Leng, Qingming
论文数: 0引用数: 0
h-index: 0
机构:
Jiujiang Univ, Sch Elect & Informat Engn, Jiujiang 332005, Peoples R ChinaTaizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R China
Leng, Qingming
[2
]
Zhao, Xiaoming
论文数: 0引用数: 0
h-index: 0
机构:
Taizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R ChinaTaizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R China
Zhao, Xiaoming
[1
]
机构:
[1] Taizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Zhejiang, Peoples R China
[2] Jiujiang Univ, Sch Elect & Informat Engn, Jiujiang 332005, Peoples R China
Multimodal emotion recognition;
Deep learning;
Feature extraction;
Multimodal information fusion;
review;
FACIAL EXPRESSION RECOGNITION;
INFORMATION FUSION;
AFFECTIVE FEATURES;
SENTIMENT ANALYSIS;
NEURAL-NETWORKS;
SPEECH;
DATABASES;
MODEL;
DIMENSIONALITY;
SIGNALS;
D O I:
10.1016/j.eswa.2023.121692
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Emotion recognition has recently attracted extensive interest due to its significant applications to human-computer interaction. The expression of human emotion depends on various verbal and non-verbal languages like audio, visual, text, etc. Emotion recognition is thus well suited as a multimodal rather than single-modal learning problem. Owing to the powerful feature learning capability, extensive deep learning methods have been recently leveraged to capture high-level emotional feature representations for multimodal emotion recognition (MER). Therefore, this paper makes the first effort in comprehensively summarize recent advances in deep learning-based multimodal emotion recognition (DL-MER) involved in audio, visual, and text modalities. We focus on: (1) MER milestones are given to summarize the development tendency of MER, and conventional multimodal emotional datasets are provided; (2) The core principles of typical deep learning models and its recent advancements are overviewed; (3) A systematic survey and taxonomy is provided to cover the state-of-theart methods related to two key steps in a MER system, including feature extraction and multimodal information fusion; (4) The research challenges and open issues in this field are discussed, and promising future directions are given.
机构:
National Institute of Technology Rourkela, Department of Electronics and Communication Engineering, Odisha, Rourkela,769008, IndiaNational Institute of Technology Rourkela, Department of Electronics and Communication Engineering, Odisha, Rourkela,769008, India
Diya, V.A.
论文数: 引用数:
h-index:
机构:
Meher, Sukadev
论文数: 引用数:
h-index:
机构:
Panda, Rutuparna
Abraham, Ajith
论文数: 0引用数: 0
h-index: 0
机构:
Machine Intelligence Research Laboratories, Machine Intelligence Research Department, Auburn,WA,98071, United States
Innopolis University, Center for Artificial Intelligence, Innopolis,420500, RussiaNational Institute of Technology Rourkela, Department of Electronics and Communication Engineering, Odisha, Rourkela,769008, India
机构:
Aging Research Center, Karolinska Institutet, SwedenAging Research Center, Karolinska Institutet, Sweden
Javeed, Ashir
Khan, Shafqat Ullah
论文数: 0引用数: 0
h-index: 0
机构:
Department of Electrical Engineering, University of Science and Technology, Bannu, PakistanAging Research Center, Karolinska Institutet, Sweden
Khan, Shafqat Ullah
Ali, Liaqat
论文数: 0引用数: 0
h-index: 0
机构:
Department of Electronics, University of Buner, Buner, PakistanAging Research Center, Karolinska Institutet, Sweden
Ali, Liaqat
Ali, Sardar
论文数: 0引用数: 0
h-index: 0
机构:
School of Engineering and Applied Sciences, Isra University, Islamabad Campus, PakistanAging Research Center, Karolinska Institutet, Sweden
Ali, Sardar
Imrana, Yakubu
论文数: 0引用数: 0
h-index: 0
机构:
School of Engineering, University of Development Studies, Tamale, Ghana
School of Computer Science and Engineering, University of Electronic Science and Technology of China (UESTC), Chengdu, ChinaAging Research Center, Karolinska Institutet, Sweden
Imrana, Yakubu
Rahman, Atiqur
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science, University of Science and Technology, Bannu, PakistanAging Research Center, Karolinska Institutet, Sweden