Estimation of affective dimensions using CNN-based features of audiovisual data

被引:2
|
作者
Basnet, Ramesh [1 ]
Islam, Mohammad Tariqul [2 ]
Howlader, Tamanna [3 ]
Rahman, S. M. Mahbubur [2 ]
Hatzinakos, Dimitrios [4 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada
[2] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1205, Bangladesh
[3] Univ Dhaka, Inst Stat Res & Training, Dhaka 1000, Bangladesh
[4] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 2E4, Canada
关键词
Convolutional neural network; Affective features; Emotional dimensions; RECOGNITION;
D O I
10.1016/j.patrec.2019.09.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic estimation of emotional state has been of great interest as emotion is an important component in user-oriented interactive technologies. This paper investigates the usage of feed-forward convolutional neural network (CNN) and features extracted from such networks for predicting dimensions of continuous-level emotional states. In this context, a two-stream CNN architecture wherein the video and audio data are learned simultaneously, is proposed. End-to-end mapping of audiovisual data to emotional dimensions reveals that the two-stream network performs better than its single-stream counterpart. The representations learned by the CNNs are refined through a minimum redundancy maximum relevance statistical selection method. Then, the support vector regression applied to selected CNN-based features estimates the instantaneous values of emotional dimensions. The proposed method is trained and tested using the audiovisual conversations of well-known RECOLA and SEMAINE databases. Experimentally it is verified that the regression of the CNN-based features outperforms the traditional audiovisual affective features as well as the end-to-end CNN mapping. Through generalization experiments, it is also observed that the learned representations are robust enough to provide an acceptable prediction performance, when the settings of training and testing datasets are widely different. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:290 / 297
页数:8
相关论文
共 50 条
  • [21] CNN-based severity prediction of neurodegenerative diseases using gait data
    Erdas, Cagatay Berke
    Sumer, Emre
    Kibaroglu, Seda
    DIGITAL HEALTH, 2022, 8
  • [22] CNN-based burned area mapping using radar and optical data
    Belenguer-Plomer, Miguel A.
    Tanase, Mihai A.
    Chuvieco, Emilio
    Bovolo, Francesca
    REMOTE SENSING OF ENVIRONMENT, 2021, 260
  • [23] CNN-based Note Onset Detection using Synthetic Data Augmentation
    Mounir, Mina
    Karsmakers, Peter
    van Waterschoot, Toon
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 171 - 175
  • [24] CNN-based features for retrieval and classification of food images
    Ciocca, Gianluigi
    Napoletano, Paolo
    Schettini, Raimondo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 176 : 70 - 77
  • [25] Learning CNN-based Features for Retrieval of Food Images
    Ciocca, Gianluigi
    Napoletano, Paolo
    Schettini, Raimondo
    NEW TRENDS IN IMAGE ANALYSIS AND PROCESSING - ICIAP 2017, 2017, 10590 : 426 - 434
  • [26] CNN-Based Models for Emotion and Sentiment Analysis Using Speech Data
    Madan, Anjum
    Kumar, Devender
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (10)
  • [27] Lung Nodule Synthesis Using CNN-Based Latent Data Representation
    Oliveira, Dario Augusto Borges
    Viana, Matheus Palhares
    SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, 2018, 11037 : 111 - 118
  • [28] CNN-Based Local Features for Navigation Near an Asteroid
    Knuuttila, Olli
    Kestila, Antti
    Kallio, Esa
    IEEE ACCESS, 2024, 12 : 16652 - 16672
  • [29] Data Augmentation in CNN-based Periocular Authentication
    Dellana, Ryan
    Roy, Kaushik
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND MANAGEMENT (ICICM 2016), 2016, : 141 - 145
  • [30] CNN-based Channel Estimation using NOMA for mmWave Massive MIMO System
    Anu, T. S.
    Raveendran, Tara
    2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 349 - 353