DISA: Disentangled Dual-Branch Framework for Affordance-Aware Human Insertion

被引:0
|
作者
Cao, Xuanqing [1 ]
Zhou, Wengang [1 ]
Sun, Qi [1 ]
Wang, Weilun [1 ]
Li, Li [1 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Human-scene interaction; controllable human synthesis; diffusion models;
D O I
10.1145/3715140
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Affordance-aware human insertion is a controllable human synthesis task aimed at seamlessly integrating a person into a scene while aligning human pose with contextual scene affordance and preserving human visual identity. Previous methods, typically reliant on a general framework of inpainting that injects all conditional information into a single branch, often struggle with the complexities of real-world contexts and the nuanced attributes of human figures. To this end, we present a novel Disentangled dual-branch framework for Affordance-aware human insertion task (DISA), which focuses on both scene context comprehension and precise person attribute extraction. Specifically, our dual-branch design facilitates diffusion models to ensure disentangled and precise manipulations: one branch utilizes an additional network for deep scene context comprehension and control, while the other branch employs a parallel encoder to extract the feature of the reference person and injects this information through cross-attention mechanism. Furthermore, to comprehensively evaluate affordance-aware human insertion task, we introduce a new metric to assess the preservation of visual identity. We conduct a broad variety of evaluation experiments and validate the diversity and robustness of our method in different settings and downstream applications. Both qualitative and quantitative experimental analysis demonstrates that our approach outperforms previous methods in terms of image quality, pose accuracy, and visual identity preservation.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Putting People in Their Place: Affordance-Aware Human Insertion into Scenes
    Kulal, Sumith
    Brooks, Tim
    Aiken, Alex
    Wu, Jiajun
    Yang, Jimei
    Lu, Jingwan
    Efros, Alexei A.
    Singh, Krishna Kumar
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17089 - 17099
  • [2] Harmonizing the Cacophony with MIC: An Affordance-aware Framework for Platform Moderation
    Bajpai T.
    Asher D.
    Goswami A.
    Chandrasekharan E.
    Proceedings of the ACM on Human-Computer Interaction, 2022, 6
  • [3] Affordance-Aware Handovers With Human Arm Mobility Constraints
    Ardon, Paola
    Cabrera, Maria E.
    Pairet, Eric
    Petrick, Ronald P. A.
    Ramamoorthy, Subramanian
    Lohan, Katrin S.
    Cakmak, Maya
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 3136 - 3143
  • [4] Text2Place: Affordance-Aware Text Guided Human Placement
    Parihar, Rishubh
    Gupta, Harsh
    Sachidanand, V. S.
    Babu, R. Venkatesh
    COMPUTER VISION - ECCV 2024, PT III, 2025, 15061 : 57 - 77
  • [5] Dual-Branch Residual Disentangled Adversarial Learning Network for Facial Expression Recognition
    Chen, Puhua
    Wang, Zhe
    Mao, Shasha
    Hui, Xinyue
    Ning, Huyan
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1840 - 1844
  • [6] Physics-aware dual-branch architectures for accurate weather predictions
    Zhang, Cuilian
    Zhou, Weijun
    GEOINFORMATICA, 2025,
  • [7] Inconsistency-Aware Wavelet Dual-Branch Network for Face Forgery Detection
    Jia G.
    Zheng M.
    Hu C.
    Ma X.
    Xu Y.
    Liu L.
    Deng Y.
    He R.
    IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021, 3 (03): : 308 - 319
  • [8] Dual-Branch Deep Point Cloud Registration Framework for Unconstrained Rotation
    Fu, Kexue
    Li, Zhihao
    Xu, Mingye
    Luo, Xiaoyuan
    Wang, Manning
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (07) : 7851 - 7861
  • [9] DBGNet: Dual-Branch Gate-Aware Network for Infrared Small Target Detection
    Chi, Weijian
    Liu, Jiahang
    Wang, Xiaozhen
    Feng, Ruilei
    Cui, Jian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [10] Crowd counting by the dual-branch scale-aware network with ranking loss constraints
    Wu, Qin
    Yan, Fangfang
    Chai, Zhilei
    Guo, Guodong
    IET COMPUTER VISION, 2020, 14 (03) : 101 - 109