Multi-stream CNN for facial expression recognition in limited training data

被引:23
|
作者
Aghamaleki, Javad Abbasi [1 ]
Chenarlogh, Vahid Ashkani [2 ]
机构
[1] Damgham Univ, Fac Engn Dept, Damghan, Iran
[2] Islamic Azad Univ, Sci & Res Branch, ECE Dept, Tehran, Iran
关键词
Facial expression recognition; Convolutional neural network; Limited data; Multi-stream structure; FACE;
D O I
10.1007/s11042-019-7530-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Limited data is a challenging problem to train Convolutional Neural Networks. On the other hand, acquiring a database in a demanded scale is not a straightforward task. In this paper, handcrafted features along with a multi-stream structure are proposed as a solution to improve performance of limited data via CNN. Three handcrafted features using local binary pattern code extractor and Sobel edge detection operator in horizontal and vertical directions of images have been extracted to apply to the multi-stream CNN model. Our model is based on two distinct structures including three-stream and single-stream structures. The three-stream structure can be employed to improve the recognition rate in facial expression classifiers when the training data is limited. In three-stream structure, each of information channels will be added to distinct streams separately. Furthermore, the transfer learning technique employed and behaviour of VGG16 architecture trained with limited data have been studied to be compared with the proposed method. In addition, input data is expanded by means of rotation, cropping, and flipping. Next, three-stream and single-stream structures are examined while using limited and also expanded training data. We have evaluated the mentioned system in order to compare it with state of the arts for CK+ and MUG databases in both limited-data and expanded-data. The results indicate that by using limited-data, recognition accuracy will be improved through the mentioned strategy. (92.19 to 88.95 in CK+ database and 85.4 to 82.5 in MUG database). Additionally, the performance was improved in comparison with benchmark methods.
引用
收藏
页码:22861 / 22882
页数:22
相关论文
共 50 条
  • [41] Continuous electromyographic speech recognition with a multi-stream decoding architecture
    Jou, Szu-Chen Stan
    Schultz, Tanja
    Waibel, Alex
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 401 - +
  • [42] On the pitfalls of learning with limited data: A facial expression recognition case study
    Santander, Miguel Rodriguez
    Albarracin, Juan Hernandez
    Rivera, Adin Ramirez
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
  • [43] Facial Expression Recognition Using CNN with Keras
    Khopkar, Apeksha
    Saxena, Ashish Adholiya
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2021, 14 (05): : 47 - 50
  • [44] Multimodal Multi-stream Deep Learning for Egocentric Activity Recognition
    Song, Sibo
    Chandrasekhar, Vijay
    Mandal, Bappaditya
    Li, Liyuan
    Lim, Joo-Hwee
    Babu, Giduthuri Sateesh
    San, Phyo Phyo
    Cheung, Ngai-Man
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 378 - 385
  • [45] Enhanced multi-stream Kalman filter training for recurrent networks
    Feldkamp, LA
    Prokhorov, DV
    Eagen, CF
    Yuan, FM
    NONLINEAR MODELING: ADVANCED BLACK-BOX TECHNIQUES, 1998, : 29 - 53
  • [46] Multi-Stream Beam-Training for mmWave MIMO Networks
    Ghasempour, Yasaman
    Haider, Muhammad K.
    Cordeiro, Carlos
    Koutsonikolas, Dimitrios
    Knightly, Edward W.
    MOBICOM'18: PROCEEDINGS OF THE 24TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, 2018, : 225 - 239
  • [47] A PRACTICAL TWO-STAGE TRAINING STRATEGY FOR MULTI-STREAM END-TO-END SPEECH RECOGNITION
    Li, Ruizhi
    Sell, Gregory
    Wang, Xiaofei
    Watanabe, Shinji
    Hermansky, Hynek
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7014 - 7018
  • [48] Image Aesthetics Assessment Based on Multi-stream CNN Architecture and Saliency Features
    Takimoto, Hironori
    Omori, Fumiya
    Kanagawa, Akihiro
    APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (01) : 25 - 40
  • [49] Sleep Posture Classification with Multi-Stream CNN Using Vertical Distance Map
    Li, Yan-Ying
    Lei, Yan-Jing
    Chen, Lyn Chao-ling
    Hung, Yi-Ping
    2018 INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT), 2018,
  • [50] Combined Discriminative Training for Multi-Stream HMM-based Audio-Visual Speech Recognition
    Huang, Jing
    Visweswariah, Karthik
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1399 - +