LEVERAGING MID-LEVEL DEEP REPRESENTATIONS FOR PREDICTING FACE ATTRIBUTES IN THE WILD

被引:0
|
作者
Zhong, Yang [1 ]
Sullivan, Josephine [1 ]
Li, Haibo [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
关键词
deep learning; mid-level deep representation; face attribute prediction; face recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Predicting facial attributes from faces in the wild is very challenging due to pose and lighting variations in the real world. The key to this problem is to build proper feature representations to cope with these unfavourable conditions. Given the success of Convolutional Neural Network (CNN) in image classification, the high-level CNN feature, as an intuitive and reasonable choice, has been widely utilized for this problem. In this paper, however, we consider the mid-level CNN features as an alternative to the high-level ones for attribute prediction. This is based on the observation that face attributes are different: some of them are locally oriented while others are globally defined. Our investigations reveal that the mid-level deep representations outperform the prediction accuracy achieved by the (fine-tuned) high-level abstractions. We empirically demonstrate that the mid-level representations achieve state-of-the-art prediction performance on CelebA and LFWA datasets. Our investigations also show that by utilizing the mid-level representations one can employ a single deep network to achieve both face recognition and attribute prediction.
引用
收藏
页码:3239 / 3243
页数:5
相关论文
共 50 条
  • [21] Learning explicit video attributes from mid-level representation for video captioning
    Nian, Fudong
    Li, Teng
    Wang, Yan
    Wu, Xinyu
    Ni, Bingbing
    Xu, Changsheng
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 163 : 126 - 138
  • [22] Human activity recognition based on mid-level representations in video surveillance applications
    Abdelhedi, Slim
    Wali, Ali
    Alimi, Add M.
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3984 - 3989
  • [23] Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks
    Oquab, Maxime
    Bottou, Leon
    Laptev, Ivan
    Sivic, Josef
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1717 - 1724
  • [24] Mid-level deep Food Part mining for food image recognition
    Zheng, Jiannan
    Zou, Liang
    Wang, Z. Jane
    IET COMPUTER VISION, 2018, 12 (03) : 298 - 304
  • [25] Unsupervised Deep Learning of Mid-Level Video Representation for Action Recognition
    Hou, Jingyi
    Wu, Xinxiao
    Chen, Jin
    Luo, Jiebo
    Jia, Yunde
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6910 - 6917
  • [26] The Impact of Using Mid-level Providers in Face-to-Face Primary Care on Health Care Utilization
    Liu, Hangsheng
    Robbins, Michael
    Mehrotra, Ateev
    Auerbach, David
    Robinson, Brandi E.
    Cromwell, Lee F.
    Roblin, Douglas W.
    MEDICAL CARE, 2017, 55 (01) : 12 - 18
  • [27] Channel-Level Acceleration of Deep Face Representations
    Polyak, Adam
    Wolf, Lior
    IEEE ACCESS, 2015, 3 : 2163 - 2175
  • [28] Mid-level image representations for real-time heart view plane classification of echocardiograms
    Penatti, Otavio A. B.
    Werneck, Rafael de O.
    de Almeida, Waldir R.
    Stein, Bernardo V.
    Pazinato, Daniel V.
    Mendes Junior, Pedro R.
    Torres, Ricardo da S.
    Rocha, Anderson
    COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 66 : 66 - 81
  • [29] Special Section: Parts & Attributes; Mid-level representation for object recognition, scene classification and object detection
    Gonzalez-Diaz, Rocio
    Jimenez, Maria-Jose
    Sivignon, Isabelle
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 138 : I - I
  • [30] Selectivity for mid-level properties of faces and places in the fusiform face area and parahippocampal place area
    Coggan, David D.
    Baker, Daniel H.
    Andrews, Timothy J.
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2019, 49 (12) : 1587 - 1596