Estimation of affective dimensions using CNN-based features of audiovisual data

被引:2
|
作者
Basnet, Ramesh [1 ]
Islam, Mohammad Tariqul [2 ]
Howlader, Tamanna [3 ]
Rahman, S. M. Mahbubur [2 ]
Hatzinakos, Dimitrios [4 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada
[2] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1205, Bangladesh
[3] Univ Dhaka, Inst Stat Res & Training, Dhaka 1000, Bangladesh
[4] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 2E4, Canada
关键词
Convolutional neural network; Affective features; Emotional dimensions; RECOGNITION;
D O I
10.1016/j.patrec.2019.09.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic estimation of emotional state has been of great interest as emotion is an important component in user-oriented interactive technologies. This paper investigates the usage of feed-forward convolutional neural network (CNN) and features extracted from such networks for predicting dimensions of continuous-level emotional states. In this context, a two-stream CNN architecture wherein the video and audio data are learned simultaneously, is proposed. End-to-end mapping of audiovisual data to emotional dimensions reveals that the two-stream network performs better than its single-stream counterpart. The representations learned by the CNNs are refined through a minimum redundancy maximum relevance statistical selection method. Then, the support vector regression applied to selected CNN-based features estimates the instantaneous values of emotional dimensions. The proposed method is trained and tested using the audiovisual conversations of well-known RECOLA and SEMAINE databases. Experimentally it is verified that the regression of the CNN-based features outperforms the traditional audiovisual affective features as well as the end-to-end CNN mapping. Through generalization experiments, it is also observed that the learned representations are robust enough to provide an acceptable prediction performance, when the settings of training and testing datasets are widely different. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:290 / 297
页数:8
相关论文
共 50 条
  • [31] Modulation Format Recognition and OSNR Estimation Using CNN-Based Deep Learning
    Wang, Danshi
    Zhang, Min
    Li, Ze
    Li, Jin
    Fu, Meixia
    Cui, Yue
    Chen, Xue
    IEEE PHOTONICS TECHNOLOGY LETTERS, 2017, 29 (19) : 1667 - 1670
  • [32] Image dehazing with scattering coefficient estimation using cnn-based image regression
    Chung W.Y.
    Kim S.Y.
    Park C.G.
    Kang C.H.
    Journal of Institute of Control, Robotics and Systems, 2021, 27 (11) : 890 - 896
  • [33] CNN-based estimation of heading direction of vehicle using automotive radar sensor
    Lim, Sohee
    Jung, Jaehoon
    Lee, Byeong-ho
    Kim, Seong-Cheol
    Lee, Seongwook
    IET RADAR SONAR AND NAVIGATION, 2021, 15 (06): : 618 - 626
  • [34] Handling Object Symmetries in CNN-based Pose Estimation
    Richter-Klug, Jesse
    Frese, Udo
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13850 - 13856
  • [35] Blood Pressure Estimation with Phonocardiogram on CNN-Based Approach
    Kokkhunthod, Kasidit
    Phapatanaburi, Khomdet
    Pathonsuwan, Wongsathon
    Jumphoo, Talit
    Anchuen, Patikorn
    Nimkuntod, Porntip
    Uthansakul, Monthippa
    Uthansakul, Peerapong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (02): : 1775 - 1794
  • [36] A Survey of CNN-Based Techniques for Scene Flow Estimation
    Muthu, Sundaram
    Tennakoon, Ruwan
    Hoseinnezhad, Reza
    Bab-Hadiashar, Alireza
    IEEE ACCESS, 2023, 11 : 99289 - 99303
  • [37] CNN-SkelPose: a CNN-based skeleton estimation algorithm for clinical applications
    Zavala-Mondragon, Luis A.
    Lamichhane, Bishal
    Zhang, Lu
    de Haan, Gerard
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (06) : 2369 - 2380
  • [38] CNN-SkelPose: a CNN-based skeleton estimation algorithm for clinical applications
    Luis A. Zavala-Mondragon
    Bishal Lamichhane
    Lu Zhang
    Gerard de Haan
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 2369 - 2380
  • [39] USING CNN-BASED HIGH-LEVEL FEATURES FOR REMOTE SENSING SCENE CLASSIFICATION
    Fang, Zhengzheng
    Li, Wei
    Zou, Jinyi
    Du, Qian
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 2610 - 2613
  • [40] Fast CNN-Based Object Tracking Using Localization Layers and Deep Features Interpolation
    El-Shafie, Al-Hussein A.
    Zaki, Mohamed
    Habib, S. E. D.
    2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2019, : 1476 - 1481