FA-VTON: A Feature Alignment-Based Model for Virtual Try-On

被引:0
|
作者
Wan, Yan [1 ]
Ding, Ning [1 ]
Yao, Li [1 ]
机构
[1] Donghua Univ, Sch Comp Sci & Technol, 2999 North Renmin Rd, Shanghai 201620, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 12期
关键词
deep learning; virtual try-on; image generation; knowledge distillation;
D O I
10.3390/app14125255
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The virtual try-on technology based on 2D images aims to seamlessly transfer provided garments onto target person images. Prior methods mainly concentrated on warping garments and generating images, overlooking the influence of feature alignment on the try-on results. In this study, we initially analyze the distortions present by existing methods and elucidate the critical role of feature alignment in the extraction stage. Building on this, we propose a novel feature alignment-based model (FA-VTON). Specifically, FA-VTON aligns the upsampled higher-level features from both person and garment images to acquire precise boundary information, which serves as guidance for subsequent garment warping. Concurrently, the Efficient Channel Attention mechanism (ECA) is introduced to generate the final result in the try-on generation module. This mechanism enables adaptive adjustment of channel feature weights to extract important features and reduce artifact generation. Furthermore, to make the student network focus on salient regions of each channel, we utilize channel-wise distillation (CWD) to minimize the Kullback-Leibler (KL) divergence between the channel probability maps of the two networks. The experiments show that our model achieves better results in both qualitative and quantitative analyses compared to current methods on the popular virtual try-on datasets.
引用
收藏
页数:22
相关论文
共 50 条
  • [11] D4-VTON: Dynamic Semantics Disentangling for Differential Diffusion Based Virtual Try-On
    Yang, Zhaotong
    Jiang, Zicheng
    Li, Xinzhe
    Zhou, Huiyu
    Dong, Junyu
    Zhang, Huaidong
    Du, Yong
    COMPUTER VISION-ECCV 2024, PT XLVI, 2025, 15104 : 36 - 52
  • [12] UF-VTON: Toward User-Friendly Virtual Try-On Network
    Chang, Yuan
    Peng, Tao
    He, Ruhan
    Hu, Xinrong
    Liu, Junping
    Zhang, Zili
    Jiang, Minghua
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 313 - 321
  • [13] MT-VTON: Multilevel Transformation-Based Virtual Try-On for Enhancing Realism of Clothing
    Lee, Jaeyoung
    Lee, Moonhyun
    Kim, Younghoon
    APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [14] DP-VTON: TOWARD DETAIL-PRESERVING IMAGE-BASED VIRTUAL TRY-ON NETWORK
    Chang, Yuan
    Peng, Tao
    He, Ruhan
    Hu, Xinrong
    Liu, Junping
    Zhang, Zili
    Jiang, Minghua
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2295 - 2299
  • [15] Slot-VTON: subject-driven diffusion-based virtual try-on with slot attention
    Ye, Jianglei
    Wang, Yigang
    Xie, Fengmao
    Wang, Qin
    Gu, Xiaoling
    Wu, Zizhao
    VISUAL COMPUTER, 2024, : 3297 - 3308
  • [16] ST-VTON: Self-supervised vision transformer for image-based virtual try-on
    Chong, Zheng
    Mo, Lingfei
    IMAGE AND VISION COMPUTING, 2022, 127
  • [17] LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
    Morelli, Davide
    Baldrati, Alberto
    Cartella, Giuseppe
    Cornia, Marcella
    Bertini, Marco
    Cucchiara, Rita
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 8580 - 8589
  • [18] VTON-HF: High Fidelity Virtual Try-on Network via Semantic Adaptation
    Du, Chenghu
    Yu, Feng
    Chen, Yadong
    Jiang, Minghua
    Wei, Xiong
    Peng, Tao
    Hu, Xinrong
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 224 - 231
  • [19] PG-VTON: A Novel Image-Based Virtual Try-On Method via Progressive Inference Paradigm
    Fang, Naiyu
    Qiu, Lemiao
    Zhang, Shuyou
    Wang, Zili
    Hu, Kerui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6595 - 6608
  • [20] SPG-VTON: Semantic Prediction Guidance for Multi-Pose Virtual Try-on
    Hu, Bingwen
    Liu, Ping
    Zheng, Zhedong
    Ren, Mingwu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1233 - 1246