Masked autoencoder of multi-scale convolution strategy combined with knowledge distillation for facial beauty prediction

被引:0
|
作者
Gan, Junying [1 ]
Xiong, Junling [1 ]
机构
[1] Wuyi Univ, Sch Elect Informat Engn, Jiangmen 529020, Guangdong, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
中国国家自然科学基金;
关键词
D O I
10.1038/s41598-025-86831-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Facial beauty prediction (FBP) is a leading area of research in artificial intelligence. Currently, there is a small amount of labeled data and a large amount of unlabeled data in the FBP database. The features extracted by the model based on supervised training are limited, resulting in low prediction accuracy. Masked autoencoder (MAE) is a self-supervised learning method that outperforms supervised learning methods without relying on large-scale databases. The MAE can improve the feature extraction ability of the model effectively. The multi-scale convolution strategy can expand the receptive field and combine the attention mechanism of the MAE to capture the dependency between distant pixels and acquire shallow and deep image features. Knowledge distillation can take the abundant knowledge from the teacher net to the student net, reduce the number of parameters, and compress the model. In this paper, the MAE of the multi-scale convolution strategy is combined with knowledge distillation for FBP. First, the MAE model with a multi-scale convolution strategy is constructed and used in the teacher net for pretraining. Second, the MAE model is constructed for the student net. Finally, the teacher net performs knowledge distillation, and the student net receives the loss function transmitted from the teacher net for optimization. The experimental results show that the proposed method outperforms other methods on the FBP task, improves FBP accuracy, and can be widely applied in tasks such as image classification.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Using Multi-Scale Convolution Fusion and Memory-Augmented Adversarial Autoencoder to Detect Diverse Anomalies in Multivariate Time Series
    Ning, Zefei
    Miao, Hao
    Jiang, Zhuolun
    Wang, Li
    TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (01): : 234 - 246
  • [42] Multi-Scale Enhanced Depth Knowledge Distillation for Monocular 3D Object Detection with SEFormer
    Zhang, Han
    Li, Jun
    Tang, Rui
    Shi, Zhiping
    Bu, Aojie
    2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 38 - 43
  • [43] MD-GCN: A Multi-Scale Temporal Dual Graph Convolution Network for Traffic Flow Prediction
    Huang, Xiaohui
    Wang, Junyang
    Lan, Yuanchun
    Jiang, Chaojie
    Yuan, Xinhua
    SENSORS, 2023, 23 (02)
  • [44] Incremental rotating machinery fault diagnosis method based on multi-scale knowledge distillation and label smoothing
    Xia, Yifei
    Gao, Jun
    Shao, Xing
    Wang, Cuixiang
    Xiang, Jiawei
    Lin, Hang
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (04)
  • [45] Defect Detection in Freight Trains Using a Lightweight and Effective Multi-Scale Fusion Framework with Knowledge Distillation
    Ma, Ziqin
    Zhou, Shijie
    Lin, Chunyu
    ELECTRONICS, 2025, 14 (05):
  • [46] Multi-scale Internet Traffic Prediction Using Wavelet Neural Network Combined Model
    Chen Di
    Feng Hai-liang
    Lin Qing-jia
    Chen Chun-xiao
    2006 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA, 2006,
  • [47] MSSTNet: Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
    Changchen ZHAO
    Hongsheng WANG
    Yuanjing FENG
    虚拟现实与智能硬件(中英文), 2023, 5 (02) : 124 - 141
  • [48] MSSTNet: Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
    Zhao, Changchen
    Wang, Hongsheng
    Feng, Yuanjing
    Virtual Reality and Intelligent Hardware, 2023, 5 (02): : 124 - 141
  • [49] Facial Beauty Prediction Combined with Multi-Task Learning of Adaptive Sharing Policy and Attentional Feature Fusion
    Gan, Junying
    Luo, Heng
    Xiong, Junling
    Xie, Xiaoshan
    Li, Huicong
    Liu, Jianqiang
    ELECTRONICS, 2024, 13 (01)
  • [50] Study of Knowledge Fusion-Based Multi-Scale Convolution Neural Network on Feature Extraction in Hydrocracking Process
    Wang, Chen
    Luo, Wenshan
    Liu, Jianhua
    Lu, Pengfei
    Cao, Xiaohong
    Lan, Xinzhi
    Chen, Hanbing
    Shiyou Xuebao, Shiyou Jiagong/Acta Petrolei Sinica (Petroleum Processing Section), 2023, 39 (03): : 532 - 543