Deep Multi-Input Multi-Stream Ordinal Model for age estimation: Based on spatial attention learning

被引：4

作者：

Kong, Chang ^{[1
,2
]}

Wang, Haitao ^{[1
]}

Luo, Qiuming ^{[1
]}

Mao, Rui ^{[1
]}

Chen, Guoliang ^{[1
]}

机构：

[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China

[2] SUNIQUECo Ltd, Shenzhen, Peoples R China

来源：

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2023年 / 140卷

关键词：

D2MO; Spatial attention; Multi -hot vector; Age estimation; Multi-input; Multi-stream;

D O I：

10.1016/j.future.2022.10.009

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Face aging process is non-stationary since human matures in different ways. This property makes age estimation is an attractive and challenging research topic in the computer vision community. Most of previous work conventionally estimate age from the center area of the aligned face image. However, these methods ignore spatial context information and cannot pay attention to particular domain features due to the uncertainty in deep learning. In this work, we propose a novel Deep Multi-Input Multi-Stream Ordinal (D2MO) Model for facial age estimation, which learns deep fusion feature through a specific spatial attention mechanism. Our approach is motivated by the observations that there are some universal changes, like hair color turning to white and wrinkles increasing, for individuals during aging process. In order to focus these spatial features, our D2MO uses four scales of receptive fields for global and contextual feature learning, and meanwhile, four cropped face patches are utilized for local and detailed feature extraction. Benefiting from a multi-stream CNN architecture, differentiated feature maps are learned separately through each branch and then aggregated together by concatenate layer. We also introduce a novel representation for age label using a multi-hot vector and the final predicted age can be calculated by summing the vector. This representation cast age estimation task to solve a series of binary classification subproblems which is easier to learn and more consistent with human cognition rather than to regress a single age value directly. Finally, we employ a joint training loss to supervise our model to learn the ordinal ranking, label distribution and regression information simultaneously. Extensive experiments show that our D2MO model significantly outperforms other state-of-the-art age estimation methods on MORPH II, FG-NET and UAGD datasets.(c) 2022 Elsevier B.V. All rights reserved.

引用

页码：173 / 184

页数：12

共 50 条

[21] Discriminative Multi-Stream Postfilters Based on Deep Learning for Enhancing Statistical Parametric Speech Synthesis
Coto-Jimenez, Marvin
BIOMIMETICS, 2021, 6 (01) : 1 - 15
[22] MINA: Multi-Input Network Augmentation for Enhancing Tiny Deep Learning
Ha, Seokhyeon
Kim, Yeongmo
Lee, Jungwoo
IEEE ACCESS, 2023, 11 : 106289 - 106298
[23] Multi-channel multi-task optical performance monitoring based multi-input multi-output deep learning and transfer learning for SDM
Yang, Shuailong
Yang, Liu
Luo, Fengguang
Wang, Xiaobo
Li, Bin
Du, Yuting
Liu, Deming
OPTICS COMMUNICATIONS, 2021, 495
[24] Part-based Multi-stream Model for Vehicle Searching
Sun, Ya
Li, Minxian
Lu, Jianfeng
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1372 - 1377
[25] Elderly fall detection based on multi-stream deep convolutional networks
Chadia Khraief
Faouzi Benzarti
Hamid Amiri
Multimedia Tools and Applications, 2020, 79 : 19537 - 19560
[26] Elderly fall detection based on multi-stream deep convolutional networks
Khraief, Chadia
Benzarti, Faouzi
Amiri, Hamid
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (27-28) : 19537 - 19560
[27] Visual Object Tracking Via Multi-Stream Deep Similarity Learning Networks
Li, Kunpeng
Kong, Yu
Fu, Yun
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3311 - 3320
[28] A novel multi-stream method for violent interaction detection using deep learning
Li, Hongchang
Wang, Jing
Han, Jianjun
Zhang, Jinmin
Yang, Yushan
Zhao, Yue
MEASUREMENT & CONTROL, 2020, 53 (5-6): : 796 - 806
[29] Deep-Learning-Based Amplitude Variation with Angle Inversion with Multi-Input Neural Networks
Tao, Shiping
Guo, Yintong
Huang, Haoyong
Li, Junfeng
Chen, Liqing
Gui, Junchuan
Zhao, Guokai
PROCESSES, 2024, 12 (10)
[30] Multi-Input Autonomous Driving Based on Deep Reinforcement Learning With Double Bias Experience Replay
Cui, Jianping
Yuan, Liang
He, Li
Xiao, Wendong
Ran, Teng
Zhang, Jianbo
IEEE SENSORS JOURNAL, 2023, 23 (11) : 11253 - 11261

← 1 2 3 4 5 →