Object-level Scene Deocclusion

被引:0
|
作者
Liu, Zhengzhe [1 ]
Liu, Qing [2 ]
Chang, Chirui [3 ]
Zhang, Jianming [2 ]
Pakhomov, Daniil [2 ]
Zheng, Haitian [2 ]
Lin, Zhe [2 ]
Cohen-Or, Daniel [4 ]
Fu, Chi-Wing [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Adobe, San Jose, CA USA
[3] Univ Hong Kong, Hong Kong, Peoples R China
[4] Tel Aviv Univ, Tel Aviv, Israel
关键词
scene deocclusion; object completion; image recomposition;
D O I
10.1145/3641519.3657409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deoccluding the hidden portions of objects in a scene is a formidable task, particularly when addressing real-world scenes. In this paper, we present a new self-supervised PArallel visible-to-COmplete diffusion framework, named PACO, a foundation model for object-level scene deocclusion. Leveraging the rich prior of pre-trained models, we first design the parallel variational autoencoder, which produces a full-view feature map that simultaneously encodes multiple complete objects, and the visible-to-complete latent generator, which learns to implicitly predict the full-view feature map from partial-view feature map and text prompts extracted from the incomplete objects in the input image. To train PACO, we create a large-scale dataset with 500k samples to enable self-supervised learning, avoiding tedious annotations of the amodal masks and occluded regions. At inference, we devise a layer-wise deocclusion strategy to improve efficiency while maintaining the deocclusion quality. Extensive experiments on COCOA and various real-world scenes demonstrate the superior capability of PACO for scene deocclusion, surpassing the state of the arts by a large margin. Our method can also be extended to cross-domain scenes and novel categories that are not covered by the training set. Further, we demonstrate the deocclusion applicability of PACO in single-view 3D scene reconstruction and object recomposition. Project page: https://liuzhengzhe.github.io/Deocclude-Any-Object.github.io/.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Object-Level Scene Context Prediction
    Qiao, Xiaotian
    Zheng, Quanlong
    Cao, Ying
    Lau, Rynson W. H.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5280 - 5292
  • [2] Unsupervised Object-Level Representation Learning from Scene Images
    Xie, Jiahao
    Zhan, Xiaohang
    Liu, Ziwei
    Ong, Yew Soon
    Loy, Chen Change
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [3] Object-level Proposals
    Ma, Jianxiang
    Ming, Anlong
    Huang, Zilong
    Wang, Xinggang
    Zhou, Yu
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4931 - 4939
  • [4] Tell Me Where I Am: Object-level Scene Context Prediction
    Qiao, Xiaotian
    Zheng, Quanlong
    Cao, Ying
    Lau, Rynson W. H.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2628 - 2636
  • [5] Detecting Object-Level Scene Changes in Images with Viewpoint Differences Using Graph Matching
    Doi, Kento
    Hamaguchi, Ryuhei
    Iwasawa, Yusuke
    Onishi, Masaki
    Matsuo, Yutaka
    Sakurada, Ken
    REMOTE SENSING, 2022, 14 (17)
  • [6] Object-Level Unknown Obstacle Detection
    Huang, Chuan-Yuan
    Chen, Cheng-Tsung
    Chen, Yu-An
    Chen, Kuan-Wen
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5722 - 5729
  • [7] Object-Level Priors for Stixel Generation
    Cordts, Marius
    Schneider, Lukas
    Enzweiler, Markus
    Franke, Uwe
    Roth, Stefan
    PATTERN RECOGNITION, GCPR 2014, 2014, 8753 : 172 - 183
  • [8] Leveraging Object Proposals for Object-Level Change Detection
    Takuma, Sugimoto
    Kanji, Tanaka
    Kousuke, Yamaguchi
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 397 - 402
  • [9] ObjTest: Object-Level Mutation for Testing Object Detection Systems
    Liu, Zixi
    Feng, Yang
    Xu, Jiali
    Xu, Baowen
    PROCEEDINGS OF THE 15TH ASIA-PACIFIC SYMPOSIUM ON INTERNETWARE, INTERNETWARE 2024, 2024, : 61 - 70
  • [10] Object-level change detection for autonomous sensemaking
    LeDuc, Dominic
    Fisher, Taber
    Engle, Isaiah
    Vadlamudi, Avinash K.
    Reisman, Matthew D.
    GEOSPATIAL INFORMATICS XII, 2022, 12099