Estimation of affective dimensions using CNN-based features of audiovisual data

被引：2

作者：

Basnet, Ramesh ^{[1
]}

Islam, Mohammad Tariqul ^{[2
]}

Howlader, Tamanna ^{[3
]}

Rahman, S. M. Mahbubur ^{[2
]}

Hatzinakos, Dimitrios ^{[4
]}

机构：

[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada

[2] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1205, Bangladesh

[3] Univ Dhaka, Inst Stat Res & Training, Dhaka 1000, Bangladesh

[4] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 2E4, Canada

来源：

PATTERN RECOGNITION LETTERS | 2019年 / 128卷

关键词：

Convolutional neural network; Affective features; Emotional dimensions; RECOGNITION;

D O I：

10.1016/j.patrec.2019.09.015

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic estimation of emotional state has been of great interest as emotion is an important component in user-oriented interactive technologies. This paper investigates the usage of feed-forward convolutional neural network (CNN) and features extracted from such networks for predicting dimensions of continuous-level emotional states. In this context, a two-stream CNN architecture wherein the video and audio data are learned simultaneously, is proposed. End-to-end mapping of audiovisual data to emotional dimensions reveals that the two-stream network performs better than its single-stream counterpart. The representations learned by the CNNs are refined through a minimum redundancy maximum relevance statistical selection method. Then, the support vector regression applied to selected CNN-based features estimates the instantaneous values of emotional dimensions. The proposed method is trained and tested using the audiovisual conversations of well-known RECOLA and SEMAINE databases. Experimentally it is verified that the regression of the CNN-based features outperforms the traditional audiovisual affective features as well as the end-to-end CNN mapping. Through generalization experiments, it is also observed that the learned representations are robust enough to provide an acceptable prediction performance, when the settings of training and testing datasets are widely different. (C) 2019 Elsevier B.V. All rights reserved.

引用

页码：290 / 297

页数：8

共 50 条

[21] CNN-based severity prediction of neurodegenerative diseases using gait data
Erdas, Cagatay Berke
Sumer, Emre
Kibaroglu, Seda
DIGITAL HEALTH, 2022, 8
[22] CNN-based burned area mapping using radar and optical data
Belenguer-Plomer, Miguel A.
Tanase, Mihai A.
Chuvieco, Emilio
Bovolo, Francesca
REMOTE SENSING OF ENVIRONMENT, 2021, 260
[23] CNN-based Note Onset Detection using Synthetic Data Augmentation
Mounir, Mina
Karsmakers, Peter
van Waterschoot, Toon
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 171 - 175
[24] CNN-based features for retrieval and classification of food images
Ciocca, Gianluigi
Napoletano, Paolo
Schettini, Raimondo
COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 176 : 70 - 77
[25] Learning CNN-based Features for Retrieval of Food Images
Ciocca, Gianluigi
Napoletano, Paolo
Schettini, Raimondo
NEW TRENDS IN IMAGE ANALYSIS AND PROCESSING - ICIAP 2017, 2017, 10590 : 426 - 434
[26] CNN-Based Models for Emotion and Sentiment Analysis Using Speech Data
Madan, Anjum
Kumar, Devender
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (10)
[27] Lung Nodule Synthesis Using CNN-Based Latent Data Representation
Oliveira, Dario Augusto Borges
Viana, Matheus Palhares
SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, 2018, 11037 : 111 - 118
[28] CNN-Based Local Features for Navigation Near an Asteroid
Knuuttila, Olli
Kestila, Antti
Kallio, Esa
IEEE ACCESS, 2024, 12 : 16652 - 16672
[29] Data Augmentation in CNN-based Periocular Authentication
Dellana, Ryan
Roy, Kaushik
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND MANAGEMENT (ICICM 2016), 2016, : 141 - 145
[30] CNN-based Channel Estimation using NOMA for mmWave Massive MIMO System
Anu, T. S.
Raveendran, Tara
2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 349 - 353

← 1 2 3 4 5 →