DISA: Disentangled Dual-Branch Framework for Affordance-Aware Human Insertion

被引:0
|
作者
Cao, Xuanqing [1 ]
Zhou, Wengang [1 ]
Sun, Qi [1 ]
Wang, Weilun [1 ]
Li, Li [1 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Human-scene interaction; controllable human synthesis; diffusion models;
D O I
10.1145/3715140
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Affordance-aware human insertion is a controllable human synthesis task aimed at seamlessly integrating a person into a scene while aligning human pose with contextual scene affordance and preserving human visual identity. Previous methods, typically reliant on a general framework of inpainting that injects all conditional information into a single branch, often struggle with the complexities of real-world contexts and the nuanced attributes of human figures. To this end, we present a novel Disentangled dual-branch framework for Affordance-aware human insertion task (DISA), which focuses on both scene context comprehension and precise person attribute extraction. Specifically, our dual-branch design facilitates diffusion models to ensure disentangled and precise manipulations: one branch utilizes an additional network for deep scene context comprehension and control, while the other branch employs a parallel encoder to extract the feature of the reference person and injects this information through cross-attention mechanism. Furthermore, to comprehensively evaluate affordance-aware human insertion task, we introduce a new metric to assess the preservation of visual identity. We conduct a broad variety of evaluation experiments and validate the diversity and robustness of our method in different settings and downstream applications. Both qualitative and quantitative experimental analysis demonstrates that our approach outperforms previous methods in terms of image quality, pose accuracy, and visual identity preservation.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Dual-branch framework: AUV-based target recognition method for marine survey
    Yu, Fei
    He, Bo
    Liu, Jixin
    Wang, Qi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 115
  • [22] Dual-Branch Interactive Networks on Multichannel Time Series for Human Activity Recognition
    Tang, Yin
    Zhang, Lei
    Wu, Hao
    He, Jun
    Song, Aiguo
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (10) : 5223 - 5234
  • [23] DBCA-Net: A Dual-Branch Context-Aware Algorithm for Cattle Face Segmentation and Recognition
    Feng, Xiaopu
    Zhang, Jiaying
    Qi, Yongsheng
    Liu, Liqiang
    Li, Yongting
    AGRICULTURE-BASEL, 2025, 15 (05):
  • [24] HRNet Encoder and Dual-Branch Decoder Framework-Based Scene Text Recognition Model
    Li, Meiling
    Li, Xiumei
    Sun, Junmei
    Dong, Yujin
    INTERNATIONAL JOURNAL OF ANTENNAS AND PROPAGATION, 2022, 2022
  • [25] A Dual-Branch Framework With Prior Knowledge for Precise Segmentation of Lung Nodules in Challenging CT Scans
    Jiang, Wujun
    Zhi, Lijia
    Zhang, Shaomin
    Zhou, Tao
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (03) : 1540 - 1551
  • [26] A dual-branch multi-feature deep fusion network framework for hyperspectral image classification
    Liu, Linfeng
    Zhang, Chengcai
    Luo, Weiran
    GEOCARTO INTERNATIONAL, 2022, 37 (27) : 18692 - 18715
  • [27] Local dual-branch attention feature learning framework from UAVs for visual defect detection
    Xu, Jianbing
    Zhou, Jiangxin
    Xu, Dongxu
    Chen, Yu
    VISUAL COMPUTER, 2025,
  • [28] Cumulative dual-branch network framework for long-tailed multi-class classification
    Fan, Saite
    Zhang, Xinmin
    Song, Zhihuan
    Shao, Weiming
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [29] 3d human pose estimation based on conditional dual-branch diffusion
    Li, Jinghua
    Bai, Zhuowei
    Kong, Dehui
    Chen, Dongpan
    Li, Qianxing
    Yin, Baocai
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [30] DCA-Net: Dual-branch contextual-aware network for auxiliary localization and segmentation of parathyroid glands
    Liu, Qian
    Ding, Feng
    Li, Jiyu
    Ji, Shuxia
    Liu, Kailin
    Geng, Chong
    Lyu, Lei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84