CartoonDiff: Training-free Cartoon Image Generation with Diffusion Transformer Models

被引:1
|
作者
He, Feihong [1 ]
Li, Gang [2 ,3 ]
Si, Lingyu [2 ]
Yan, Leilei [1 ]
Hou, Shimeng [4 ]
Dong, Hongwei [2 ]
Li, Fanzhang [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China
[2] Chinese Acad Sci, Inst Software, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Northwestern Polytech Univ, Fremont, CA USA
基金
中国博士后科学基金; 中国国家自然科学基金; 国家重点研发计划;
关键词
Diffusion models; cartoon image generation; training-free cartoonization;
D O I
10.1109/ICASSP48485.2024.10447821
中图分类号
学科分类号
摘要
Image cartoonization has attracted significant interest in the field of image generation. However, most of the existing image cartoonization techniques require re-training models using images of cartoon style. In this paper, we present CartoonDiff, a novel training-free sampling approach which generates image cartoonization using diffusion transformer models. Specifically, we decompose the reverse process of diffusion models into the semantic generation phase and the detail generation phase. Furthermore, we implement the image cartoonization process by normalizing high-frequency signal of the noisy image in specific denoising steps. CartoonDiff doesn't require any additional reference images, complex model designs, or the tedious adjustment of multiple parameters. Extensive experimental results show the powerful ability of our CartoonDiff. The project page is available at: https://cartoondiff.github.io/
引用
收藏
页码:3825 / 3829
页数:5
相关论文
共 50 条
  • [21] Towards Training-Free Appearance-Based Localization: Probabilistic Models for Whole-Image Descriptors
    Lowry, Stephanie M.
    Wyeth, Gordon F.
    Milford, Michael J.
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 711 - 717
  • [22] FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
    Yu, Jiwen
    Wang, Yinhuai
    Zhao, Chen
    Ghanem, Bernard
    Zhang, Jian
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 23117 - 23127
  • [23] Diffusion Model is Secretly a Training-Free Open Vocabulary Semantic Segmenter
    Wang, Jinglong
    Li, Xiawei
    Zhang, Jing
    Xu, Qingyuan
    Zhou, Qin
    Yu, Qian
    Sheng, Lu
    Xu, Dong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1895 - 1907
  • [24] Training-free prior guided diffusion model for zero-reference low-light image enhancement
    Shang, Kai
    Shao, Mingwen
    Wang, Chao
    Qiao, Yuanjian
    Wan, Yecong
    NEUROCOMPUTING, 2025, 617
  • [25] A training-free recursive multiresolution framework for diffeomorphic deformable image registration
    Sheikhjafari, Ameneh
    Noga, Michelle
    Punithakumar, Kumaradevan
    Ray, Nilanjan
    APPLIED INTELLIGENCE, 2022, 52 (11) : 12546 - 12555
  • [26] Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning
    Miaol, Zichen
    Wang, Jiang
    Wang, Ze
    Yang, Zhengyuan
    Wang, Lijuan
    Qiu, Qiang
    Liu, Zicheng
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10844 - 10853
  • [27] A training-free recursive multiresolution framework for diffeomorphic deformable image registration
    Ameneh Sheikhjafari
    Michelle Noga
    Kumaradevan Punithakumar
    Nilanjan Ray
    Applied Intelligence, 2022, 52 : 12546 - 12555
  • [28] TRAINING-FREE LOCATION-AWARE TEXT-TO-IMAGE SYNTHESIS
    Mao, Jiafeng
    Wang, Xueting
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 995 - 999
  • [29] An Efficient and Training-Free Blind Image Blur Assessment in the Spatial Domain
    Bong, David B. L.
    Khoo, Bee Ee
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (07): : 1864 - 1871
  • [30] LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models
    Javaheripi, Mojan
    de Rosa, Gustavo H.
    Mukherjee, Subhabrata
    Shah, Shital
    Religa, Tomasz L.
    Mendes, Caio C. T.
    Bubeck, Sebastien
    Koushanfar, Farinaz
    Dey, Debadeepta
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,