FA-VTON: A Feature Alignment-Based Model for Virtual Try-On

被引:0
|
作者
Wan, Yan [1 ]
Ding, Ning [1 ]
Yao, Li [1 ]
机构
[1] Donghua Univ, Sch Comp Sci & Technol, 2999 North Renmin Rd, Shanghai 201620, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 12期
关键词
deep learning; virtual try-on; image generation; knowledge distillation;
D O I
10.3390/app14125255
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The virtual try-on technology based on 2D images aims to seamlessly transfer provided garments onto target person images. Prior methods mainly concentrated on warping garments and generating images, overlooking the influence of feature alignment on the try-on results. In this study, we initially analyze the distortions present by existing methods and elucidate the critical role of feature alignment in the extraction stage. Building on this, we propose a novel feature alignment-based model (FA-VTON). Specifically, FA-VTON aligns the upsampled higher-level features from both person and garment images to acquire precise boundary information, which serves as guidance for subsequent garment warping. Concurrently, the Efficient Channel Attention mechanism (ECA) is introduced to generate the final result in the try-on generation module. This mechanism enables adaptive adjustment of channel feature weights to extract important features and reduce artifact generation. Furthermore, to make the student network focus on salient regions of each channel, we utilize channel-wise distillation (CWD) to minimize the Kullback-Leibler (KL) divergence between the channel probability maps of the two networks. The experiments show that our model achieves better results in both qualitative and quantitative analyses compared to current methods on the popular virtual try-on datasets.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Self-supervised feature matched virtual try-on
    Jiang, Shiyi
    Xu, Yang
    Li, Danyang
    Fan, Runze
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2023, 10 (05) : 1958 - 1969
  • [22] M3D-VTON: A Monocular-to-3D Virtual Try-On Network
    Zhao, Fuwei
    Xie, Zhenyu
    Kampffmeyer, Michael
    Dong, Haoye
    Han, Songfang
    Zheng, Tianxiang
    Zhang, Tao
    Liang, Xiaodan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13219 - 13229
  • [23] NL-VTON: a non-local virtual try-on network with feature preserving of body and clothes (vol 11, 19950, 2021)
    Tan, Ze Lin
    Bai, Jing
    Zhang, Shao Min
    Qin, Fei Wei
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [24] CloTH-VTON plus : Clothing Three-Dimensional Reconstruction for Hybrid Image-Based Virtual Try-ON
    Minar, Matiur Rahman
    Tuan, Thai Thanh
    Ahn, Heejune
    IEEE ACCESS, 2021, 9 : 30960 - 30978
  • [25] PF-VTON: Toward High-Quality Parser-Free Virtual Try-On Network
    Chang, Yuan
    Peng, Tao
    He, Ruhan
    Hu, Xinrong
    Liu, Junping
    Zhang, Zili
    Jiang, Minghua
    MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 28 - 40
  • [26] Attention-based Video Virtual Try-On
    Tsai, Wen-Jiin
    Tien, Yi-Cheng
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 209 - 216
  • [27] Optimization based Garment Transfer for Virtual Try-on
    Xie, Hao-Yang
    Zhong, Yue-Qi
    Yu, Zhi-Cai
    TEXTILE BIOENGINEERING AND INFORMATICS SYMPOSIUM (TBIS) PROCEEDINGS, 2020, 2020, : 251 - 258
  • [28] Image-Based Virtual Try-On: A Survey
    Song, Dan
    Zhang, Xuanpu
    Zhou, Juan
    Nie, Weizhi
    Tong, Ruofeng
    Kankanhalli, Mohan
    Liu, An-An
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 2692 - 2720
  • [29] VTNCT: an image-based virtual try-on network by combining feature with pixel transformation
    Chang, Yuan
    Peng, Tao
    Yu, Feng
    He, Ruhan
    Hu, Xinrong
    Liu, Junping
    Zhang, Zili
    Jiang, Minghua
    VISUAL COMPUTER, 2023, 39 (07): : 2583 - 2596
  • [30] VTNCT: an image-based virtual try-on network by combining feature with pixel transformation
    Yuan Chang
    Tao Peng
    Feng Yu
    Ruhan He
    Xinrong Hu
    Junping Liu
    Zili Zhang
    Minghua Jiang
    The Visual Computer, 2023, 39 : 2583 - 2596