LEVERAGING MID-LEVEL DEEP REPRESENTATIONS FOR PREDICTING FACE ATTRIBUTES IN THE WILD

被引:0
|
作者
Zhong, Yang [1 ]
Sullivan, Josephine [1 ]
Li, Haibo [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
关键词
deep learning; mid-level deep representation; face attribute prediction; face recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Predicting facial attributes from faces in the wild is very challenging due to pose and lighting variations in the real world. The key to this problem is to build proper feature representations to cope with these unfavourable conditions. Given the success of Convolutional Neural Network (CNN) in image classification, the high-level CNN feature, as an intuitive and reasonable choice, has been widely utilized for this problem. In this paper, however, we consider the mid-level CNN features as an alternative to the high-level ones for attribute prediction. This is based on the observation that face attributes are different: some of them are locally oriented while others are globally defined. Our investigations reveal that the mid-level deep representations outperform the prediction accuracy achieved by the (fine-tuned) high-level abstractions. We empirically demonstrate that the mid-level representations achieve state-of-the-art prediction performance on CelebA and LFWA datasets. Our investigations also show that by utilizing the mid-level representations one can employ a single deep network to achieve both face recognition and attribute prediction.
引用
收藏
页码:3239 / 3243
页数:5
相关论文
共 50 条
  • [41] Spatio-Temporal VLAD Encoding of Visual Events Using Temporal Ordering of the Mid-Level Deep Semantics
    Soltanian, Mohammad
    Amini, Sajjad
    Ghaemmaghami, Shahrokh
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1769 - 1784
  • [42] End-to-End Correspondence and Relationship Learning of Mid-Level Deep Features for Person Re-Identification
    Lin, Shan
    Li, Chang-Tsun
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 628 - 633
  • [43] Deep sparse representation-based mid-level visual elements discovery in fine-grained classification
    Le Lv
    Dongbin Zhao
    Kun Shao
    Soft Computing, 2019, 23 : 8711 - 8722
  • [44] Deep sparse representation-based mid-level visual elements discovery in fine-grained classification
    Lv, Le
    Zhao, Dongbin
    Shao, Kun
    SOFT COMPUTING, 2019, 23 (18) : 8711 - 8722
  • [45] Mid-level leaders as key policy interpreters: state and local leaders' perspectives on leveraging Castaneda to expand equity for English learner students
    Mavrogordato, Madeline
    Callahan, Rebecca
    Bartlett, Caroline
    LANGUAGE POLICY, 2022, 21 (03) : 331 - 355
  • [46] Mid-level leaders as key policy interpreters: state and local leaders’ perspectives on leveraging Castañeda to expand equity for English learner students
    Madeline Mavrogordato
    Rebecca Callahan
    Caroline Bartlett
    Language Policy, 2022, 21 : 331 - 355
  • [47] Beyond a Transformative Approach and Deep Understanding: External Factors and Mid-Level Leaders' Policy Implementation to Expand Equity for English Learners
    Bartlett, Caroline
    Callahan, Rebecca
    Mavrogordato, Madeline
    EDUCATIONAL ADMINISTRATION QUARTERLY, 2024, 60 (02) : 151 - 190
  • [48] Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation
    Rosano, Marco
    Furnari, Antonino
    Gulino, Luigi
    Santoro, Corrado
    Farinella, Giovanni Maria
    AUTONOMOUS ROBOTS, 2023, 47 (08) : 1483 - 1502
  • [49] Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation
    Marco Rosano
    Antonino Furnari
    Luigi Gulino
    Corrado Santoro
    Giovanni Maria Farinella
    Autonomous Robots, 2023, 47 : 1483 - 1502
  • [50] Predicting Eye Fixations on Webpage With an Ensemble of Early Features and High-Level Representations from Deep Network
    Shen, Chengyao
    Huang, Xun
    Zhao, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 2084 - 2093