Object-level Scene Deocclusion

被引:0
|
作者
Liu, Zhengzhe [1 ]
Liu, Qing [2 ]
Chang, Chirui [3 ]
Zhang, Jianming [2 ]
Pakhomov, Daniil [2 ]
Zheng, Haitian [2 ]
Lin, Zhe [2 ]
Cohen-Or, Daniel [4 ]
Fu, Chi-Wing [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Adobe, San Jose, CA USA
[3] Univ Hong Kong, Hong Kong, Peoples R China
[4] Tel Aviv Univ, Tel Aviv, Israel
关键词
scene deocclusion; object completion; image recomposition;
D O I
10.1145/3641519.3657409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deoccluding the hidden portions of objects in a scene is a formidable task, particularly when addressing real-world scenes. In this paper, we present a new self-supervised PArallel visible-to-COmplete diffusion framework, named PACO, a foundation model for object-level scene deocclusion. Leveraging the rich prior of pre-trained models, we first design the parallel variational autoencoder, which produces a full-view feature map that simultaneously encodes multiple complete objects, and the visible-to-complete latent generator, which learns to implicitly predict the full-view feature map from partial-view feature map and text prompts extracted from the incomplete objects in the input image. To train PACO, we create a large-scale dataset with 500k samples to enable self-supervised learning, avoiding tedious annotations of the amodal masks and occluded regions. At inference, we devise a layer-wise deocclusion strategy to improve efficiency while maintaining the deocclusion quality. Extensive experiments on COCOA and various real-world scenes demonstrate the superior capability of PACO for scene deocclusion, surpassing the state of the arts by a large margin. Our method can also be extended to cross-domain scenes and novel categories that are not covered by the training set. Further, we demonstrate the deocclusion applicability of PACO in single-view 3D scene reconstruction and object recomposition. Project page: https://liuzhengzhe.github.io/Deocclude-Any-Object.github.io/.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Global Localization with Object-Level Semantics and Topology
    Liu, Yu
    Petillot, Yvan
    Lane, David
    Wang, Sen
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 4909 - 4915
  • [22] Object Bank: An Object-Level Image Representation for High-Level Visual Recognition
    Li-Jia Li
    Hao Su
    Yongwhan Lim
    Li Fei-Fei
    International Journal of Computer Vision, 2014, 107 : 20 - 39
  • [23] Object-Level Image Segmentation Using Low Level Cues
    Zhu, Hongyuan
    Zheng, Jianmin
    Cai, Jianfei
    Thalmann, Nadia M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (10) : 4019 - 4027
  • [24] Object Bank: An Object-Level Image Representation for High-Level Visual Recognition
    Li, Li-Jia
    Su, Hao
    Lim, Yongwhan
    Li Fei-Fei
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 107 (01) : 20 - 39
  • [25] SafePicking: Learning Safe Object Extraction via Object-Level Mapping
    Wada, Kentaro
    James, Stephen
    Davison, Andrew J.
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 10202 - 10208
  • [26] Learning to Complete Object Shapes for Object-level Mapping in Dynamic Scenes
    Xu, Binbin
    Davison, Andrew J.
    Leutenegger, Stefan
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2257 - 2264
  • [27] Online object-level SLAM with dual bundle adjustment
    Liu, Jiaqi
    Gao, Yongbin
    Jiang, Xiaoyan
    Fang, Zhijun
    APPLIED INTELLIGENCE, 2023, 53 (21) : 25092 - 25105
  • [28] LoomIO: Object-Level Coordination in Distributed File Systems
    Hua, Yusheng
    Shi, Xuanhua
    He, Kang
    Jin, Hai
    Xie, Wei
    He, Ligang
    Chen, Yong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (08) : 1799 - 1810
  • [29] Object-level Perception Sharing Among Connected Vehicles
    Ambrosin, Moreno
    Alvarez, Ignacio J.
    Buerkle, Cornelius
    Yang, Lily L.
    Oboril, Fabian
    Sastry, Manoj R.
    Sivanesan, Kathiravetpillai
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 1566 - 1573
  • [30] Object-Level Salience Detection by Progressively Enhanced Network
    Yuan, Wang
    Song, Haichuan
    Tan, Xin
    Chen, Chengwei
    Ding, Shouhong
    Ma, Lizhuang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 371 - 382