Estimation of affective dimensions using CNN-based features of audiovisual data

被引:2
|
作者
Basnet, Ramesh [1 ]
Islam, Mohammad Tariqul [2 ]
Howlader, Tamanna [3 ]
Rahman, S. M. Mahbubur [2 ]
Hatzinakos, Dimitrios [4 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada
[2] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1205, Bangladesh
[3] Univ Dhaka, Inst Stat Res & Training, Dhaka 1000, Bangladesh
[4] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 2E4, Canada
关键词
Convolutional neural network; Affective features; Emotional dimensions; RECOGNITION;
D O I
10.1016/j.patrec.2019.09.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic estimation of emotional state has been of great interest as emotion is an important component in user-oriented interactive technologies. This paper investigates the usage of feed-forward convolutional neural network (CNN) and features extracted from such networks for predicting dimensions of continuous-level emotional states. In this context, a two-stream CNN architecture wherein the video and audio data are learned simultaneously, is proposed. End-to-end mapping of audiovisual data to emotional dimensions reveals that the two-stream network performs better than its single-stream counterpart. The representations learned by the CNNs are refined through a minimum redundancy maximum relevance statistical selection method. Then, the support vector regression applied to selected CNN-based features estimates the instantaneous values of emotional dimensions. The proposed method is trained and tested using the audiovisual conversations of well-known RECOLA and SEMAINE databases. Experimentally it is verified that the regression of the CNN-based features outperforms the traditional audiovisual affective features as well as the end-to-end CNN mapping. Through generalization experiments, it is also observed that the learned representations are robust enough to provide an acceptable prediction performance, when the settings of training and testing datasets are widely different. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:290 / 297
页数:8
相关论文
共 50 条
  • [1] Statistical Selection of CNN-Based Audiovisual Features for Instantaneous Estimation of Human Emotional States
    Basnet, Ramesh
    Islam, Mohammad Tariqul
    Howlader, Tamanna
    Rahman, S. M. Mahbubur
    Hatzinakos, Dimitrios
    2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 50 - 54
  • [2] A CNN-Based Model for the Estimation of Vertical Scale of Fluctuation Using CPT Data
    Sharma, P.
    Pain, Anindya
    INDIAN GEOTECHNICAL JOURNAL, 2024, 54 (04) : 1271 - 1285
  • [3] Object Viewpoint Estimation using CNN-based Classifier
    Bong, Eunsoo
    Lee, Eunho
    Hwang, Youngbae
    2022 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON22), 2022, : 80 - 85
  • [4] CNN-based Rescaling Factor Estimation
    Liu, Chang
    Kirchner, Matthias
    IH&MMSEC '19: PROCEEDINGS OF THE ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, 2019, : 119 - 124
  • [5] Hybrid Facial Emotion Recognition Using CNN-Based Features
    Shahzad, H. M.
    Bhatti, Sohail Masood
    Jaffar, Arfan
    Akram, Sheeraz
    Alhajlah, Mousa
    Mahmood, Awais
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [6] CNN-BASED INITIAL BACKGROUND ESTIMATION
    Halfaoui, Ibrahim
    Bouzaraa, Fahd
    Urfalioglu, Onay
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 101 - 106
  • [7] Rice yield estimation using a CNN-based image-driven data assimilation framework
    Han, Jingye
    Shi, Liangsheng
    Yang, Qi
    Chen, Zhuowei
    Yu, Jin
    Zha, Yuanyuan
    FIELD CROPS RESEARCH, 2022, 288
  • [8] Time Delay Estimation for Speaker Localization Using CNN-Based Parametrized GCC-PHAT Features
    Salvati, Daniele
    Drioli, Carlo
    Foresti, Gian Luca
    INTERSPEECH 2021, 2021, : 1479 - 1483
  • [9] Multi-target tracking using CNN-based features: CNNMTT
    Nima Mahmoudi
    Seyed Mohammad Ahadi
    Mohammad Rahmati
    Multimedia Tools and Applications, 2019, 78 : 7077 - 7096
  • [10] IMPROVING CNN-BASED VISEME RECOGNITION USING SYNTHETIC DATA
    Mattos, Andrea Britto
    Borges Oliveira, Dario Augusto
    Morais, Edmilson da Silva
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,