GazeFusion: Saliency-Guided Image Generation

被引:0
|
作者
Zhang, Yunxiang [1 ]
Wu, Nan [2 ]
Lin, Connor Z. [2 ]
Wetzstein, Gordon [2 ]
Sun, Qi [1 ]
机构
[1] NYU, Brooklyn, NY 11201 USA
[2] Stanford Univ, Stanford, CA USA
关键词
Human Visual Attention; Perceptual Computer Graphics; Controllable Image Generation; VISUAL-ATTENTION; ALLOCATION; MODEL;
D O I
10.1145/3694969
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Diffusion models offer unprecedented image generation power given just a text prompt. While emerging approaches for controlling diffusion models have enabled users to specify the desired spatial layouts of the generated content, they cannot predict or control where viewers will pay more attention due to the complexity of human vision. Recognizing the significance of attention-controllable image generation in practical applications, we present a saliency-guided framework to incorporate the data priors of human visual attention mechanisms into the generation process. Given a user-specified viewer attention distribution, our control module conditions a diffusion model to generate images that attract viewers' attention toward the desired regions. To assess the efficacy of our approach, we performed an eye-tracked user study and a large-scale model-based saliency analysis. The results evidence that both the cross-user eye gaze distributions and the saliency models' predictions align with the desired attention distributions. Lastly, we outline several applications, including interactive design of saliency guidance, attention suppression in unwanted regions, and adaptive generation for varied display/viewing conditions.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] DEEP ACTIVE LEARNING BASED ON SALIENCY-GUIDED DATA AUGMENTATION FOR IMAGE CLASSIFICATION
    Liu, Ying
    Pang, Yuliang
    Zhang, Weidong
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 815 - 819
  • [32] A Saliency-Guided Method for Automatic Photo Refocusing
    Liu, Na
    Ju, Ran
    Ren, Tongwei
    Wu, Gangshan
    8TH INTERNATIONAL CONFERENCE ON INTERNET MULTIMEDIA COMPUTING AND SERVICE (ICIMCS2016), 2016, : 264 - 267
  • [33] A SPARSE LINEAR MODEL FOR SALIENCY-GUIDED DECOLORIZATION
    Liu, Chun-Wei
    Liu, Tyng-Luh
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1105 - 1109
  • [34] Remote Sensing Image Super-Resolution via Saliency-Guided Feedback GANs
    Wu, Hanlin
    Zhang, Libao
    Ma, Jie
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [35] Closed-Form Optimization on Saliency-Guided Image Compression for HEVC-MSP
    Li, Shengxi
    Xu, Mai
    Ren, Yun
    Wang, Zulin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (01) : 155 - 170
  • [36] Boosting Factorization Machines via Saliency-Guided Mixup
    Wu, Chenwang
    Lian, Defu
    Ge, Yong
    Zhou, Min
    Chen, Enhong
    Tao, Dacheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (06) : 4443 - 4459
  • [37] SSiT: Saliency-Guided Self-Supervised Image Transformer for Diabetic Retinopathy Grading
    Huang, Yijin
    Lyu, Junyan
    Cheng, Pujin
    Tam, Roger
    Tang, Xiaoying
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (05) : 2806 - 2817
  • [38] Saliency-guided Selective Magnification for Company Logo Detection
    Eggert, Christian
    Winschel, Anton
    Zecha, Dan
    Lienhart, Rainer
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 651 - 656
  • [39] Saliency-guided improvement for hand posture detection and recognition
    Chuang, Yuelong
    Chen, Ling
    Chen, Gencai
    NEUROCOMPUTING, 2014, 133 : 404 - 415
  • [40] Saliency-Guided Object Candidates Based on Gestalt Principles
    Werner, Thomas
    Martin-Garcia, German
    Frintrop, Simone
    COMPUTER VISION SYSTEMS (ICVS 2015), 2015, 9163 : 34 - 44