Multi-stream CNN for facial expression recognition in limited training data

被引:23
|
作者
Aghamaleki, Javad Abbasi [1 ]
Chenarlogh, Vahid Ashkani [2 ]
机构
[1] Damgham Univ, Fac Engn Dept, Damghan, Iran
[2] Islamic Azad Univ, Sci & Res Branch, ECE Dept, Tehran, Iran
关键词
Facial expression recognition; Convolutional neural network; Limited data; Multi-stream structure; FACE;
D O I
10.1007/s11042-019-7530-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Limited data is a challenging problem to train Convolutional Neural Networks. On the other hand, acquiring a database in a demanded scale is not a straightforward task. In this paper, handcrafted features along with a multi-stream structure are proposed as a solution to improve performance of limited data via CNN. Three handcrafted features using local binary pattern code extractor and Sobel edge detection operator in horizontal and vertical directions of images have been extracted to apply to the multi-stream CNN model. Our model is based on two distinct structures including three-stream and single-stream structures. The three-stream structure can be employed to improve the recognition rate in facial expression classifiers when the training data is limited. In three-stream structure, each of information channels will be added to distinct streams separately. Furthermore, the transfer learning technique employed and behaviour of VGG16 architecture trained with limited data have been studied to be compared with the proposed method. In addition, input data is expanded by means of rotation, cropping, and flipping. Next, three-stream and single-stream structures are examined while using limited and also expanded training data. We have evaluated the mentioned system in order to compare it with state of the arts for CK+ and MUG databases in both limited-data and expanded-data. The results indicate that by using limited-data, recognition accuracy will be improved through the mentioned strategy. (92.19 to 88.95 in CK+ database and 85.4 to 82.5 in MUG database). Additionally, the performance was improved in comparison with benchmark methods.
引用
收藏
页码:22861 / 22882
页数:22
相关论文
共 50 条
  • [21] A multi-stream approach to audiovisual automatic speech recognition
    Hasegawa-Johnson, Mark
    2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 328 - 331
  • [22] Multi-Stream End-to-End Speech Recognition
    Li, Ruizhi
    Wang, Xiaofei
    Mallidi, Sri Harish
    Watanabe, Shinji
    Hori, Takaaki
    Hermansky, Hynek
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (646-655) : 646 - 655
  • [23] Multi-stream Recognition of Noisy Speech with Performance Monitoring
    Variani, Ehsan
    Li, Feipeng
    Hermansky, Hynek
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2977 - 2980
  • [24] Multi-stream face recognition for crime-fightin
    Jassim, Sabah A.
    Sellahewa, Harin
    BIOMETRIC TECHNOLOGY FOR HUMAN IDENTIFICATION IV, 2007, 6539
  • [25] OPTIMAL SEQUENTIAL DETECTION IN MULTI-STREAM DATA
    Chan, Hock Peng
    ANNALS OF STATISTICS, 2017, 45 (06): : 2736 - 2763
  • [26] Multi-Stream CNN for Spatial Resource Allocation: a Crop Management Application
    Barbosa, Alexandre
    Marinho, Thiago
    Martin, Nicolas
    Hovakimyan, Naira
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 258 - 266
  • [27] Multi-stream CNN based Video Semantic Segmentation for Automated Driving
    Sistu, Ganesh
    Chennupati, Sumanth
    Yogamani, Senthil
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 173 - 180
  • [28] AMS-CNN: Attentive multi-stream CNN for video-based crowd counting
    Tripathy, Santosh Kumar
    Srivastava, Rajeev
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2021, 10 (04) : 239 - 254
  • [29] AMS-CNN: Attentive multi-stream CNN for video-based crowd counting
    Santosh Kumar Tripathy
    Rajeev Srivastava
    International Journal of Multimedia Information Retrieval, 2021, 10 : 239 - 254
  • [30] Facial Expression Recognition by Multi-Scale CNN with Regularized Center Loss
    Li, Zhenghao
    Wu, Song
    Xiao, Guoqiang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3384 - 3389