Diffusion Self-Guidance for Controllable Image Generation

被引:0
|
作者
Epstein, Dave [1 ,2 ]
Jabri, Allan [1 ]
Poole, Ben [2 ]
Efros, Alexei A. [1 ]
Holynski, Aleksander [1 ,2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Google Res, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large-scale generative models are capable of producing high-quality images from detailed text descriptions. However, many aspects of an image are difficult or impossible to convey through text. We introduce self-guidance, a method that provides greater control over generated images by guiding the internal representations of diffusion models. We demonstrate that properties such as the shape, location, and appearance of objects can be extracted from these representations and used to steer the sampling process. Self-guidance operates similarly to standard classifier guidance, but uses signals present in the pretrained model itself, requiring no additional models or training. We show how a simple set of properties can be composed to perform challenging image manipulations, such as modifying the position or size of specific objects, merging the appearance of objects in one image with the layout of another, composing objects from multiple images into one, and more. We also show that self-guidance can be used for editing real images. See our project page for results and an interactive demo: https://dave.ml/selfguidance
引用
收藏
页数:18
相关论文
共 50 条
  • [1] REGIONAL SELF-GUIDANCE
    MYNOTT, S
    ECONOMIST, 1960, 197 (09): : 875 - 875
  • [2] Vocational Self-Guidance
    Pieron, Henri
    ANNEE PSYCHOLOGIQUE, 1925, 26 : 624 - 625
  • [3] VOCATIONAL SELF-GUIDANCE
    Dewey, Doris B.
    FAMILY, 1926, 7 (02): : 63 - 64
  • [4] Vocational self-guidance
    Stone, C. L.
    AMERICAN ECONOMIC REVIEW, 1925, 15 (03): : 514 - 514
  • [5] Vocational Self-Guidance
    Proctor, William M.
    JOURNAL OF EDUCATIONAL RESEARCH, 1926, 13 (03): : 222 - 222
  • [6] Vocational Self-Guidance
    Kitson, Harry D.
    SCHOOL AND SOCIETY, 1926, 23 (594): : 630 - 631
  • [7] Vocational Self-Guidance
    Ash, William C.
    ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 1925, 121 : 199 - 199
  • [8] VOCATIONAL SELF-GUIDANCE
    Buell, Bradley
    MENTAL HYGIENE, 1925, 9 (04) : 872 - 875
  • [9] Analysis of Deep Image Prior and Exploiting Self-Guidance for Image Reconstruction
    Liang, Shijun
    Bell, Evan
    Qu, Qing
    Wang, Rongrong
    Ravishankar, Saiprasad
    arXiv,
  • [10] Vocational Self-Guidance
    McCracken, Thomas C.
    JOURNAL OF APPLIED PSYCHOLOGY, 1925, 9 (04) : 435 - 436