Convolutional Features-Based Broad Learning With LSTM for Multidimensional Facial Emotion Recognition in Human-Robot Interaction

被引:3
|
作者
Chen, Luefeng [1 ,2 ]
Li, Min [1 ,2 ]
Wu, Min [1 ,2 ]
Pedrycz, Witold [3 ,4 ,5 ]
Hirota, Kaoru [6 ]
机构
[1] China Univ Geosci, Sch Automat, Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Engn Res Ctr Intelligent Technol Geoexplorat, Minist Educ, Wuhan 430074, Peoples R China
[3] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2R3, Canada
[4] Polish Acad Sci, Syst Res Inst, PL-00901 Warsaw, Poland
[5] Istinye Univ, Dept Comp Engn, TR-34396 Sariyer Istanbul, Turkiye
[6] Tokyo Inst Technol, Tokyo 2268502, Japan
基金
中国国家自然科学基金;
关键词
emotion recognition; human-robot interaction; long short-term memory (LSTM); EXPRESSION RECOGNITION; NETWORK; REGRESSION; FRAMEWORK; SYSTEM;
D O I
10.1109/TSMC.2023.3301001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional feature-based broad learning with long short-term memory (CBLSTM) is proposed to recognize multidimensional facial emotions in human-robot interaction. The CBLSTM model consists of convolution and pooling layers, broad learning (BL), and long-and short-term memory network. It aims to obtain the depth, width, and time scale information of facial emotion through three parts of the model, so as to realize multidimensional facial emotion recognition. CBLSTM adopts the structure of BL after processing was done at the convolution and pooling layer to replace the original random mapping method and extract features with more representation ability, which significantly reduces the computational time of the facial emotion recognition network. Moreover, we adopted incremental learning, which can quickly reconstruct the model without a complete retraining process. Experiments on three databases are developed, including CK+, MMI, and SFEW2.0 databases. The experimental results show that the proposed CBLSTM model using multidimensional information produces higher recognition accuracy than that without time scale information. It is 1.30% higher on the CK+ database and 1.06% higher on the MMI database. The computation time is 9.065 s, which is significantly shorter than the time reported for the convolutional neural network (CNN). In addition, the proposed method obtains improvement compared to the state-of-the-art methods. It improves the recognition rate by 3.97%, 1.77%, and 0.17% compared to that of CNN-SIPS, HOG-TOP, and CMACNN in the CK+ database, 5.17%, 5.14%, and 3.56% compared to TLMOS, ALAW, and DAUGN in the MMI database, and 7.08% and 2.98% compared to CNNVA and QCNN in the SFEW2.0 database.
引用
收藏
页码:64 / 75
页数:12
相关论文
共 50 条
  • [41] Human-robot interaction based on gesture and movement recognition
    Li, Xing
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 81
  • [42] Approach to hand posture recognition based on hand shape features for human-robot interaction
    Qi, Jing
    Xu, Kun
    Ding, Xilun
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (04) : 2825 - 2842
  • [43] Learning Speaker Recognition Models through Human-Robot Interaction
    Martinson, E.
    Lawson, W.
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,
  • [44] Learning Multimodal Confidence for Intention Recognition in Human-Robot Interaction
    Zhao, Xiyuan
    Li, Huijun
    Miao, Tianyuan
    Zhu, Xianyi
    Wei, Zhikai
    Tan, Lifen
    Song, Aiguo
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7819 - 7826
  • [45] Feedback Interpretation based on Facial Expressions in Human-Robot Interaction
    Lang, Christian
    Hanheide, Marc
    Lohse, Manja
    Wersing, Heiko
    Sagerer, Gerhard
    RO-MAN 2009: THE 18TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1 AND 2, 2009, : 915 - +
  • [46] Speech Emotion Recognition Using an Enhanced Kernel Isomap for Human-Robot Interaction
    Zhang, Shiqing
    Zhao, Xiaoming
    Lei, Bicheng
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10
  • [47] Speech emotion recognition in real static and dynamic human-robot interaction scenarios
    Grageda, Nicolas
    Busso, Carlos
    Alvarado, Eduardo
    Garcia, Ricardo
    Mahu, Rodrigo
    Huenupan, Fernando
    Yoma, Nestor Becerra
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [48] On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
    Lakomkin, Egor
    Zamani, Mohammad Ali
    Weber, Cornelius
    Magg, Sven
    Wermter, Stefan
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 854 - 860
  • [49] Interactive Emotion Recognition Using Support Vector Machine for Human-Robot Interaction
    Tsai, Ching-Chih
    Chen, You-Zhu
    Liao, Ching-Wen
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 407 - 412
  • [50] Facial Communicative SignalsValence Recognition in Task-Oriented Human-Robot Interaction
    Christian Lang
    Sven Wachsmuth
    Marc Hanheide
    Heiko Wersing
    International Journal of Social Robotics, 2012, 4 : 249 - 262