Lightweight parser-free virtual try-on based on mixed knowledge distillation and feature enhancement techniques

被引:0
|
作者
Hou, Jue [1 ,2 ]
Ding, Huan [1 ]
Yang, Yang [1 ,2 ]
Lu, Yinwen [1 ]
Yu, Lingjie [3 ]
Liu, Zheng [2 ,4 ]
机构
[1] School of Fashion Design & Engineering, Zhejiang Sci-Tech University, Zhejiang, Hangzhou,310018, China
[2] Key Laboratory of Silk Culture Inheritance and Digital Technology of Product Design, Ministry of Culture and Tourism, Zhejiang, Hangzhou,310018, China
[3] School of Textile Science and Engineering, Xi'an Polytechnic University, Shaanxi, Xi'an,710048, China
[4] International Institute of Fashion Technology, Zhejiang Sci-Tech University, Zhejiang, Hangzhou,310018, China
来源
关键词
Clothes - Depth indicators - Hosiery manufacture - Image texture - Pixels - Tachometers - Time difference of arrival - Virtual addresses;
D O I
10.13475/j.fzxb.20230904501
中图分类号
学科分类号
摘要
Objective In order to address the issues of low accuracy in clothing deformation, texture distortion, and high computational costs in image-based virtual try-on systems, this paper proposes a lightweight parser-free virtual try-on based on mixed knowledge distillation and feature enhancement techniques. Method Firstly, by integrating global features and calibrating the results of flow computation at different scales, an improved appearance flow estimation method was proposed to enhance the accuracy of appearance flow estimation. Moreover, a lightweight try-on network based on depth separable convolution was constructed by decoupling image segmentation results and virtual try-on processes using knowledge distillation. Finally, a garment complexity index GTC (garment texture complexity) based on the pixel-wise average gradient was proposed to quantitatively analyze the texture complexity of clothing. Based on this, the VITON dataset is divided into a simple texture set, a moderately complex texture set, and a highly complex texture set. Results This paper used the VITON dataset to verify and analyze the proposed model. Compared with the SOTA (state-of-art) model, the number of parameters and computational complexity (flops) was decreased by 70.12% and 42.38%, respectively, suggesting a faster and better model to meet the deployment requirements of the mobile Internet. Moreover, the experimental results showed that the scores of the proposed model in image quality evaluation indicators (FID, LPIPS, PSNR, KID) were increased by 5.06%, 28.57%, 3.71%, and 33.33%, respectively, compared with the SOTA model. In the segmentation analysis of clothing complexity, the score of KID and LPIPS in this model was 48.08%, 30.45%, 1.03%, 35.54%, 30.41%, and 12.94% higher than that of the SOTA model, respectively, proving that the method proposed is superior to other methods in restoring and preserving original clothing details when warping clothing images with complex textures. Conclusion A lightweight parser-free virtual try-on based on mixed knowledge distillation and feature enhancement techniques is proposed, which uses an efficient appearance flow estimation method to reduce registration errors, complex texture loss, and distortion during the clothing distortion process. In addition, the method proposed is shown to reduce the size and computational complexity of the final model by mixing distillation and using depth-separable convolution effectively and speeding up the running of the model. Finally, a quantitative index used for characterizing the complexity of clothing texture is proposed and the VITON test set is divided into samples. Compared with other virtual try-on methods, the experimental results show that on the VITON test set, the evaluation index results obtained from the proposed method are better than the current virtual try-on method with the best performance, and the ability of the proposed method to deal with clothing with complex patterns is also better than other methods. In addition, the ablation experiment proves that the proposed method has an obvious improvement on the final virtual try-on result. © 2024 China Textile Engineering Society. All rights reserved.
引用
收藏
页码:164 / 174
相关论文
共 21 条
  • [21] KD-PAR: A knowledge distillation-based pedestrian attribute recognition model with multi-label mixed feature learning network
    Wu, Peishu
    Wang, Zidong
    Li, Han
    Zeng, Nianyin
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237