A Markov Chain approach for video-based virtual try-on with denoising diffusion generative adversarial network

被引:1
|
作者
Hou, Jue [1 ,2 ]
Lu, Yinwen [1 ,2 ]
Wang, Mingjie [3 ]
Ouyang, Wenbing [4 ]
Yang, Yang [1 ,2 ]
Zou, Fengyuan [1 ,2 ]
Gu, Bingfei [1 ,2 ]
Liu, Zheng [2 ,5 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Fash Design & Engn, CN-310018 Hangzhou, Zhejiang, Peoples R China
[2] Minist Culture & Tourism, Key Lab Silk Culture Heritage & Prod Design Digita, CN-310018 Hangzhou, Zhejiang, Peoples R China
[3] Zhejiang Sci Tech Univ, Sch Sci, Dept Math, CN-310018 Hangzhou, Zhejiang, Peoples R China
[4] Amazon Inc, 410 Terry Ave N, Seattle, WA 98109 USA
[5] Zhejiang Sci Tech Univ, Sch Int Educ, CN-310018 Hangzhou, Zhejiang, Peoples R China
关键词
Markov Chain; Diffusion model; Video synthesis; Virtual try -on;
D O I
10.1016/j.knosys.2024.112233
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-based virtual try-ons have attracted unprecedented attention owing to the development of e-commerce. However, this problem is very challenging because of the arbitrary poses of persons and the demand for temporary consistency of frames, particularly when attempting to synthesize high-quality virtual try-on videos using single images. Specifically, there are two key challenges. 1) The existing video-based virtual try-on methods are based on generative adversarial networks (GAN), which are limited by unstable training and a lack of realism in generated details. 2) The explicit building of stronger constraints of generated frames, which aims to increase the coherence of generated videos. To address these challenges, this study proposed a novel framework, Extended Markov Chain Based Denoising Diffusion Generative Adversarial Network (EMC-DDGAN), which was derived from a denoising diffusion GAN, which is a diffusion model with efficient sampling. Moreover, we proposed an extended Markov chain that used a diffusion model to synthesize frames via sequential generation. With a carefully designed network and learning objects, the proposed approach achieved outstanding performance on public datasets. Rigorous experiments demonstrated that EMC-DDGAN could synthesize higher-quality videos compared to other state-of-the-art methods and validated the effectiveness of the proposed approach.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] A new blind image denoising method based on asymmetric generative adversarial network
    Wang, Yiming
    Chang, Dongxia
    Zhao, Yao
    IET IMAGE PROCESSING, 2021, 15 (06) : 1260 - 1272
  • [42] Bathymetric Data Processing based on Denoising Autoencoder Wasserstein Generative Adversarial Network
    Zhang, Ruichen
    Chen, Yongbing
    Bian, Shaofeng
    Gao, Duanyang
    GLOBAL INTELLIGENCE INDUSTRY CONFERENCE (GIIC 2018), 2018, 10835
  • [43] A novel image denoising algorithm based on least square generative adversarial network
    Mohammed, Sharfuddin Waseem
    Murugan, Brindha
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (03)
  • [44] An integrated method of seismic data reconstruction and denoising based on generative adversarial network
    Zhang, Yan
    Zhang, Yiming
    Dong, Hongli
    Song, Liwei
    Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2024, 59 (04): : 714 - 723
  • [45] SP-VITON: shape-preserving image-based virtual try-on network
    Song, Dan
    Li, Tianbao
    Mao, Zhendong
    Liu, An-An
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 33757 - 33769
  • [46] CS-VITON: a realistic virtual try-on network based on clothing region alignment and SPM
    Chen, Jinguang
    Zhang, Xin
    Ma, Lili
    Yang, Bo
    Zhang, Kaibing
    VISUAL COMPUTER, 2025, 41 (01): : 563 - 577
  • [47] Slot-VTON: subject-driven diffusion-based virtual try-on with slot attention
    Ye, Jianglei
    Wang, Yigang
    Xie, Fengmao
    Wang, Qin
    Gu, Xiaoling
    Wu, Zizhao
    VISUAL COMPUTER, 2024, : 3297 - 3308
  • [48] A Novel Conditional Generative Adversarial Network Based On Graph Attention Network For Moving Image Denoising
    Shen, Weihong
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2022, 26 (06): : 831 - 841
  • [49] TransGANomaly: Transformer based Generative Adversarial Network for Video Anomaly Detection
    Aslam, Nazia
    Kolekar, Maheshkumar H.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [50] Unsupervised Transfer Learning For Video Prediction Based on Generative Adversarial Network
    Shi, Jiwen
    Zhu, Qiuguo
    Wu, Jun
    2021 27TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE (M2VIP), 2021,