Masked autoencoder of multi-scale convolution strategy combined with knowledge distillation for facial beauty prediction

被引:0
|
作者
Gan, Junying [1 ]
Xiong, Junling [1 ]
机构
[1] Wuyi Univ, Sch Elect Informat Engn, Jiangmen 529020, Guangdong, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
中国国家自然科学基金;
关键词
D O I
10.1038/s41598-025-86831-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Facial beauty prediction (FBP) is a leading area of research in artificial intelligence. Currently, there is a small amount of labeled data and a large amount of unlabeled data in the FBP database. The features extracted by the model based on supervised training are limited, resulting in low prediction accuracy. Masked autoencoder (MAE) is a self-supervised learning method that outperforms supervised learning methods without relying on large-scale databases. The MAE can improve the feature extraction ability of the model effectively. The multi-scale convolution strategy can expand the receptive field and combine the attention mechanism of the MAE to capture the dependency between distant pixels and acquire shallow and deep image features. Knowledge distillation can take the abundant knowledge from the teacher net to the student net, reduce the number of parameters, and compress the model. In this paper, the MAE of the multi-scale convolution strategy is combined with knowledge distillation for FBP. First, the MAE model with a multi-scale convolution strategy is constructed and used in the teacher net for pretraining. Second, the MAE model is constructed for the student net. Finally, the teacher net performs knowledge distillation, and the student net receives the loss function transmitted from the teacher net for optimization. The experimental results show that the proposed method outperforms other methods on the FBP task, improves FBP accuracy, and can be widely applied in tasks such as image classification.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Rolling Bearing Health Indicator Extraction and RUL Prediction Based on Multi-Scale Convolutional Autoencoder
    Ye, Zijian
    Zhang, Qiang
    Shao, Siyu
    Niu, Tianlin
    Zhao, Yuwei
    APPLIED SCIENCES-BASEL, 2022, 12 (11):
  • [32] MULTI-SCALE AND MULTI-REGION FACIAL DISCRIMINATIVE REPRESENTATION FOR AUTOMATIC DEPRESSION LEVEL PREDICTION
    Niu, Mingyue
    Tao, Jianhua
    Liu, Bin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1325 - 1329
  • [33] DANet: Multi-scale UAV Target Detection with Dynamic Feature Perception and Scale-aware Knowledge Distillation
    Fang, Houzhang
    Liao, Zikai
    Wang, Lu
    Li, Qingshan
    Chang, Yi
    Yan, Luxin
    Wang, Xuhua
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2121 - 2130
  • [34] Optimization of process-specific catalytic packing in catalytic distillation process: A multi-scale strategy
    Wang, Qinglian
    Yang, Chen
    Wang, Hongxing
    Qiu, Ting
    CHEMICAL ENGINEERING SCIENCE, 2017, 174 : 472 - 486
  • [35] Multimodal hate speech detection via multi-scale visual kernels and knowledge distillation architecture
    Chhabra, Anusha
    Vishwakarma, Dinesh Kumar
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [36] A rolling bearing fault diagnosis method based on multi-scale knowledge distillation and continual learning
    Xia, Yifei
    Gao, Jun
    Shao, Xing
    Wang, Cuixiang
    Zhendong yu Chongji/Journal of Vibration and Shock, 2024, 43 (12): : 276 - 285
  • [37] A Multi-scale Convolution and Gated Recurrent Unit Based Network for Limit Order Book Prediction
    Xu, Borui
    Zhang, Tong
    Liu, Weiguo
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2022, 13368 : 71 - 84
  • [38] MSR-GCN: Multi-Scale Residual Graph Convolution Networks for Human Motion Prediction
    Dang, Lingwei
    Nie, Yongwei
    Long, Chengjiang
    Zhang, Qing
    Li, Guiqing
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11447 - 11456
  • [39] A Multi-Scale Convolutional Neural Network with Self-Knowledge Distillation for Bearing Fault Diagnosis
    Yu, Jiamao
    Hu, Hexuan
    MACHINES, 2024, 12 (11)
  • [40] Multi-scale combined prediction model of dissolved oxygen based on EEMD and ELM
    Li, Zhenbo (zhenboli@126.com), 1600, Asian Association for Agricultural Engineering (26):