Deep Multi-Input Multi-Stream Ordinal Model for age estimation: Based on spatial attention learning

被引:4
|
作者
Kong, Chang [1 ,2 ]
Wang, Haitao [1 ]
Luo, Qiuming [1 ]
Mao, Rui [1 ]
Chen, Guoliang [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[2] SUNIQUECo Ltd, Shenzhen, Peoples R China
关键词
D2MO; Spatial attention; Multi -hot vector; Age estimation; Multi-input; Multi-stream;
D O I
10.1016/j.future.2022.10.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Face aging process is non-stationary since human matures in different ways. This property makes age estimation is an attractive and challenging research topic in the computer vision community. Most of previous work conventionally estimate age from the center area of the aligned face image. However, these methods ignore spatial context information and cannot pay attention to particular domain features due to the uncertainty in deep learning. In this work, we propose a novel Deep Multi-Input Multi-Stream Ordinal (D2MO) Model for facial age estimation, which learns deep fusion feature through a specific spatial attention mechanism. Our approach is motivated by the observations that there are some universal changes, like hair color turning to white and wrinkles increasing, for individuals during aging process. In order to focus these spatial features, our D2MO uses four scales of receptive fields for global and contextual feature learning, and meanwhile, four cropped face patches are utilized for local and detailed feature extraction. Benefiting from a multi-stream CNN architecture, differentiated feature maps are learned separately through each branch and then aggregated together by concatenate layer. We also introduce a novel representation for age label using a multi-hot vector and the final predicted age can be calculated by summing the vector. This representation cast age estimation task to solve a series of binary classification subproblems which is easier to learn and more consistent with human cognition rather than to regress a single age value directly. Finally, we employ a joint training loss to supervise our model to learn the ordinal ranking, label distribution and regression information simultaneously. Extensive experiments show that our D2MO model significantly outperforms other state-of-the-art age estimation methods on MORPH II, FG-NET and UAGD datasets.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:173 / 184
页数:12
相关论文
共 50 条
  • [21] Discriminative Multi-Stream Postfilters Based on Deep Learning for Enhancing Statistical Parametric Speech Synthesis
    Coto-Jimenez, Marvin
    BIOMIMETICS, 2021, 6 (01) : 1 - 15
  • [22] MINA: Multi-Input Network Augmentation for Enhancing Tiny Deep Learning
    Ha, Seokhyeon
    Kim, Yeongmo
    Lee, Jungwoo
    IEEE ACCESS, 2023, 11 : 106289 - 106298
  • [23] Multi-channel multi-task optical performance monitoring based multi-input multi-output deep learning and transfer learning for SDM
    Yang, Shuailong
    Yang, Liu
    Luo, Fengguang
    Wang, Xiaobo
    Li, Bin
    Du, Yuting
    Liu, Deming
    OPTICS COMMUNICATIONS, 2021, 495
  • [24] Part-based Multi-stream Model for Vehicle Searching
    Sun, Ya
    Li, Minxian
    Lu, Jianfeng
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1372 - 1377
  • [25] Elderly fall detection based on multi-stream deep convolutional networks
    Chadia Khraief
    Faouzi Benzarti
    Hamid Amiri
    Multimedia Tools and Applications, 2020, 79 : 19537 - 19560
  • [26] Elderly fall detection based on multi-stream deep convolutional networks
    Khraief, Chadia
    Benzarti, Faouzi
    Amiri, Hamid
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (27-28) : 19537 - 19560
  • [27] Visual Object Tracking Via Multi-Stream Deep Similarity Learning Networks
    Li, Kunpeng
    Kong, Yu
    Fu, Yun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3311 - 3320
  • [28] A novel multi-stream method for violent interaction detection using deep learning
    Li, Hongchang
    Wang, Jing
    Han, Jianjun
    Zhang, Jinmin
    Yang, Yushan
    Zhao, Yue
    MEASUREMENT & CONTROL, 2020, 53 (5-6): : 796 - 806
  • [29] Deep-Learning-Based Amplitude Variation with Angle Inversion with Multi-Input Neural Networks
    Tao, Shiping
    Guo, Yintong
    Huang, Haoyong
    Li, Junfeng
    Chen, Liqing
    Gui, Junchuan
    Zhao, Guokai
    PROCESSES, 2024, 12 (10)
  • [30] Multi-Input Autonomous Driving Based on Deep Reinforcement Learning With Double Bias Experience Replay
    Cui, Jianping
    Yuan, Liang
    He, Li
    Xiao, Wendong
    Ran, Teng
    Zhang, Jianbo
    IEEE SENSORS JOURNAL, 2023, 23 (11) : 11253 - 11261