Attention-based Video Virtual Try-On

被引:1
|
作者
Tsai, Wen-Jiin [1 ]
Tien, Yi-Cheng [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Hsinchu, Taiwan
关键词
Virtual try-on; attention; parsing free;
D O I
10.1145/3591106.3592252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a video virtual try-on model which is based on appearance flow warping and is parsing-free. In this model, we utilized attention methods from Transformer [15] and proposed three attention-based modules: a Person-Cloth Transformer, a Self-Attention Generator, and a Cloth Refinement Transformer. The Person-Cloth Transformer enables clothing features to refer to person information, which is beneficial for style vector calculation and also improves the style warping process to estimate better appearance flows. The Self-Attention Generator utilizes a self-attention mechanism at the deepest feature layer, which enables the feature map to learn global context from all the other pixels, helping it synthesize more realistic results. The Cloth Refinement Transformer utilizes two cross-attention modules: one enables the current warped clothes to refer to previously warped clothes to ensure it is temporally consistent, and the other enables the current warped clothes to refer to person information to ensure it is spatially aligned. Our ablation study shows that each proposed module contributes to the improvement of the results. Experiment results show that our model can generate realistic try-on videos with high quality and perform better than existing methods.
引用
收藏
页码:209 / 216
页数:8
相关论文
共 50 条
  • [21] MIRROR: Towards Generalizable On-Device Video Virtual Try-On for Mobile Shopping
    Kang, Dong-Sig
    Baek, Eunsu
    Son, Sungwook
    Lee, Youngki
    Gong, Taesik
    Kim, Hyung-Sin
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2023, 7 (04):
  • [22] A Virtual Try-On System for Prescription Eyeglasses
    Zhang, Qian
    Guo, Yu
    Laffont, Pierre-Yves
    Martin, Tobias
    Gross, Markus
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2017, 37 (04) : 84 - +
  • [23] Virtual try-on by replacing the person in image
    Li, Jun
    Zhang, Mingmin
    Pan, Zhigeng
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2015, 27 (09): : 1694 - 1700
  • [24] HRAM-VITON: High-Resolution Virtual Try-On with Attention Mechanism
    Chen, Yue
    Liang, Xiaoman
    Lin, Mugang
    Zhang, Fachao
    Zhao, Huihuang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (02): : 2753 - 2768
  • [26] Slot-VTON: subject-driven diffusion-based virtual try-on with slot attention
    Ye, Jianglei
    Wang, Yigang
    Xie, Fengmao
    Wang, Qin
    Gu, Xiaoling
    Wu, Zizhao
    VISUAL COMPUTER, 2024, : 3297 - 3308
  • [27] Cloth Interactive Transformer for Virtual Try-On
    Ren, Bin
    Tang, Hao
    Meng, Fanyang
    Ding Runwei
    Torr, Philip H. S.
    Sebe, Nicu
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
  • [28] Significance of Anatomical Constraints in Virtual Try-On
    Roy, Debapriya
    Santra, Sanchayan
    Mukherjee, Diganta
    Chanda, Bhabatosh
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02): : 1853 - 1864
  • [29] IMAGE-BASED VIRTUAL TRY-ON NETWORK WITH STRUCTURAL COHERENCE
    Sun, Feng
    Guo, Jiaming
    Su, Zhuo
    Gao, Chengying
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 519 - 523
  • [30] Style-Based Global Appearance Flow for Virtual Try-On
    He, Sen
    Song, Yi-Zhe
    Xiang, Tao
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3460 - 3469