Deep Multi-Input Multi-Stream Ordinal Model for age estimation: Based on spatial attention learning

被引:4
|
作者
Kong, Chang [1 ,2 ]
Wang, Haitao [1 ]
Luo, Qiuming [1 ]
Mao, Rui [1 ]
Chen, Guoliang [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[2] SUNIQUECo Ltd, Shenzhen, Peoples R China
关键词
D2MO; Spatial attention; Multi -hot vector; Age estimation; Multi-input; Multi-stream;
D O I
10.1016/j.future.2022.10.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Face aging process is non-stationary since human matures in different ways. This property makes age estimation is an attractive and challenging research topic in the computer vision community. Most of previous work conventionally estimate age from the center area of the aligned face image. However, these methods ignore spatial context information and cannot pay attention to particular domain features due to the uncertainty in deep learning. In this work, we propose a novel Deep Multi-Input Multi-Stream Ordinal (D2MO) Model for facial age estimation, which learns deep fusion feature through a specific spatial attention mechanism. Our approach is motivated by the observations that there are some universal changes, like hair color turning to white and wrinkles increasing, for individuals during aging process. In order to focus these spatial features, our D2MO uses four scales of receptive fields for global and contextual feature learning, and meanwhile, four cropped face patches are utilized for local and detailed feature extraction. Benefiting from a multi-stream CNN architecture, differentiated feature maps are learned separately through each branch and then aggregated together by concatenate layer. We also introduce a novel representation for age label using a multi-hot vector and the final predicted age can be calculated by summing the vector. This representation cast age estimation task to solve a series of binary classification subproblems which is easier to learn and more consistent with human cognition rather than to regress a single age value directly. Finally, we employ a joint training loss to supervise our model to learn the ordinal ranking, label distribution and regression information simultaneously. Extensive experiments show that our D2MO model significantly outperforms other state-of-the-art age estimation methods on MORPH II, FG-NET and UAGD datasets.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:173 / 184
页数:12
相关论文
共 50 条
  • [31] Parameter estimation for multirate multi-input systems using auxiliary model and multi-innovation
    Han, Lili
    Ding, Feng
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2010, 21 (06) : 1079 - 1083
  • [32] Accelerating Estimation of a Multi-Input Multi-Output Model of the Hippocampus with a Parallel Computing Strategy
    She, Xiwei
    Robinson, Brian S.
    Berger, Theodore W.
    Song, Dong
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 2479 - 2482
  • [33] Parameter estimation for multirate multi-input systems using auxiliary model and multi-innovation
    Lili Han and Feng Ding School of IoT Engineering
    JournalofSystemsEngineeringandElectronics, 2010, 21 (06) : 1079 - 1083
  • [34] QualityNet: A multi-stream fusion framework with spatial and channel attention for blind image quality assessment
    Aslam, Muhammad Azeem
    Wei, Xu
    Khalid, Hassan
    Ahmed, Nisar
    Zhu, Shuangtong
    Liu, Xin
    Xu, Yimei
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [35] Multi-scale spatial-spectral fusion based on multi-input fusion calculation and coordinate attention for hyperspectral image classification
    Yang, Lina
    Zhang, Fengqi
    Wang, Patrick Shen-Pei
    Li, Xichun
    Meng, Zuqiang
    PATTERN RECOGNITION, 2022, 122
  • [36] Self-Attention-Based Convolutional Parallel Network: An Efficient Multi-Input Deep Learning Model for Endpoint Prediction of High-Carbon BOF Steelmaking
    Xie, Tian-yi
    Zhang, Fei
    Li, Yi-ren
    Zhang, Quan
    Wang, Yan-wei
    Shang, Hao
    METALLURGICAL AND MATERIALS TRANSACTIONS B-PROCESS METALLURGY AND MATERIALS PROCESSING SCIENCE, 2024, 55 (06): : 4271 - 4290
  • [37] Health indicator construction method of multi-input hybrid deep learning network
    Sun S.
    Zhang G.
    Tian F.
    Liang W.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (10): : 2390 - 2398
  • [38] Model-less multi-input analysis of pulmonary blood flow using deep learning convolution
    Saka, Tomoki
    Iwasawa, Tae
    Tsuzuki, Marcos S. G.
    IFAC JOURNAL OF SYSTEMS AND CONTROL, 2024, 29
  • [39] Ingredient analysis of biological wastewater using hybrid multi-stream deep learning framework
    Bakht, Ahtesham
    Nawaz, Alam
    Lee, Moonyong
    Lee, Hyunsoo
    COMPUTERS & CHEMICAL ENGINEERING, 2022, 168
  • [40] Ventricular Arrhythmia Classification Using Similarity Maps and Hierarchical Multi-Stream Deep Learning
    Lin, Qing
    Oglic, Dino
    Curtis, Michael J.
    Lam, Hak-Keung
    Cvetkovic, Zoran
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2025, 72 (03) : 1148 - 1159