Amodal instance segmentation with dual guidance from contextual and shape priors

被引:0
|
作者
Zhan, Jiao [1 ]
Luo, Yarong [1 ]
Guo, Chi [1 ,2 ]
Wu, Yejun [3 ]
Yang, Bohan [1 ]
Wang, Jingrong [1 ]
Liu, Jingnan [1 ]
机构
[1] Wuhan Univ, GNSS Res Ctr, Wuhan 430072, Hubei, Peoples R China
[2] Hubei Luojia Lab, Wuhan 430079, Hubei, Peoples R China
[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
基金
中国博士后科学基金;
关键词
Instance segmentation; Amodal instance segmentation; Pixel affinity; Contextual dependency;
D O I
10.1016/j.asoc.2024.112602
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human perception possesses the remarkable ability to mentally reconstruct the complete structure of occluded objects, which has inspired researchers to pursue amodal instance segmentation for a more comprehensive understanding of the scene. Previous works have shown promising results, but they often capture the contextual dependencies in an unsupervised way, which can lead to undesirable contextual dependencies and unreasonable feature representations. To tackle this problem, we propose a Pixel Affinity-Parsing (PAP) module trained with the Pixel Affinity Loss (PAL). Embedded into CNN, the PAP module can leverage learned contextual priors to guide the network to explicitly distinguish different relationships between pixels, thus capturing the intraclass and inter-class contextual dependencies in a non-local and supervised way. This process helps to yield robust feature representations to prevent the network from misjudging. To demonstrate the effectiveness of the PAP module, we design an effective Pixel Affinity-Parsing Network (PAPNet). Notably, PAPNet also introduces shape priors to guide the amodal mask refinement process, thus preventing implausible shapes in the predicted masks. Consequently, with the dual guidance of contextual and shape priors, PAPNet can reconstruct the full shape of occluded objects accurately and reasonably. Experimental results demonstrate that the proposed PAPNet outperforms existing state-of-the-art methods on multiple amodal datasets. Specifically, on the KINS dataset, PAPNet achieves 37.1% AP, 60.6% AP50 and 39.8% AP75, surpassing C2F-Seg by 0.6%, 2.4% and 2.8%. On the D2SA dataset, PAPNet achieves 71.70% AP, 85.98% AP50 and 77.10% AP75, surpassing PGExp by 0.75% and 0.33% in AP50 and AP75, and being comparable to PGExp in AP. On the COCOA-cls dataset, PAPNet achieves 41.29% AP, 60.95% AP50 and 46.17% AP75, surpassing PGExp by 3.74%, 3.21% and 4.76%. On the CWALT dataset, PAPNet achieves 72.51% AP, 85.02% AP50 and 80.47% AP75, surpassing VRSPNet by 5.38%, 0.07% and 5.35%. The code is available at https://github.com/jiaoZ7688/PAP-Net.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Bayesian Shape Models with Shape Priors for MRI Brain Segmentation
    Garcia, Hernan F.
    Alvarez, Mauricio A.
    Orozco, Alvaro
    ADVANCES IN VISUAL COMPUTING (ISVC 2014), PT II, 2014, 8888 : 851 - 860
  • [32] MCMC Shape Sampling for Image Segmentation with Nonparametric Shape Priors
    Erdil, Ertunc
    Yildirim, Sinan
    Cetin, Mujdat
    Tasdizen, Tolga
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 411 - 419
  • [33] Graph cut segmentation with nonlinear shape priors
    Malcolm, James
    Rathi, Yogesh
    Tannenbaum, Allen
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 2061 - 2064
  • [34] Amodal Instance Segmentation of Thin Objects with Large Overlaps by Seed-to-Mask Extending
    Kanke, Ryohei
    Takahashi, Masanobu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (07) : 908 - 911
  • [35] Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors
    Du, Heming
    Yu, Xin
    Hussain, Farookh
    Armin, Mohammad Ali
    Petersson, Lars
    Li, Weihao
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4260 - 4269
  • [36] Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation
    Trung-Nghia Le
    Tam V. Nguyen
    Minh-Triet Tran
    Machine Vision and Applications, 2022, 33
  • [37] Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation
    Le, Trung-Nghia
    Nguyen, Tam, V
    Tran, Minh-Triet
    MACHINE VISION AND APPLICATIONS, 2022, 33 (02)
  • [38] Application of amodal segmentation for shape reconstruction and occlusion recovery in occluded tomatoes
    Yang, Jing
    Deng, Hanbing
    Zhang, Yufeng
    Zhou, Yuncheng
    Miao, Teng
    FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [39] UNILATERAL HIP JOINT SEGMENTATION WITH SHAPE PRIORS LEARNED FROM MISSING DATA
    Chandra, Shekhar
    Xia, Yinq
    Engstrom, Craig
    Schwarz, Raphael
    Lauer, Lars
    Crozier, Stuart
    Salvado, Olivier
    Fripp, Jurgen
    2012 9TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2012, : 1711 - 1714
  • [40] Nonlinear Shape Manifolds as Shape Priors in Level Set Segmentation and Tracking
    Prisacariu, Victor Adrian
    Reid, Ian
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,