Amodal instance segmentation with dual guidance from contextual and shape priors

被引:0
|
作者
Zhan, Jiao [1 ]
Luo, Yarong [1 ]
Guo, Chi [1 ,2 ]
Wu, Yejun [3 ]
Yang, Bohan [1 ]
Wang, Jingrong [1 ]
Liu, Jingnan [1 ]
机构
[1] Wuhan Univ, GNSS Res Ctr, Wuhan 430072, Hubei, Peoples R China
[2] Hubei Luojia Lab, Wuhan 430079, Hubei, Peoples R China
[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
基金
中国博士后科学基金;
关键词
Instance segmentation; Amodal instance segmentation; Pixel affinity; Contextual dependency;
D O I
10.1016/j.asoc.2024.112602
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human perception possesses the remarkable ability to mentally reconstruct the complete structure of occluded objects, which has inspired researchers to pursue amodal instance segmentation for a more comprehensive understanding of the scene. Previous works have shown promising results, but they often capture the contextual dependencies in an unsupervised way, which can lead to undesirable contextual dependencies and unreasonable feature representations. To tackle this problem, we propose a Pixel Affinity-Parsing (PAP) module trained with the Pixel Affinity Loss (PAL). Embedded into CNN, the PAP module can leverage learned contextual priors to guide the network to explicitly distinguish different relationships between pixels, thus capturing the intraclass and inter-class contextual dependencies in a non-local and supervised way. This process helps to yield robust feature representations to prevent the network from misjudging. To demonstrate the effectiveness of the PAP module, we design an effective Pixel Affinity-Parsing Network (PAPNet). Notably, PAPNet also introduces shape priors to guide the amodal mask refinement process, thus preventing implausible shapes in the predicted masks. Consequently, with the dual guidance of contextual and shape priors, PAPNet can reconstruct the full shape of occluded objects accurately and reasonably. Experimental results demonstrate that the proposed PAPNet outperforms existing state-of-the-art methods on multiple amodal datasets. Specifically, on the KINS dataset, PAPNet achieves 37.1% AP, 60.6% AP50 and 39.8% AP75, surpassing C2F-Seg by 0.6%, 2.4% and 2.8%. On the D2SA dataset, PAPNet achieves 71.70% AP, 85.98% AP50 and 77.10% AP75, surpassing PGExp by 0.75% and 0.33% in AP50 and AP75, and being comparable to PGExp in AP. On the COCOA-cls dataset, PAPNet achieves 41.29% AP, 60.95% AP50 and 46.17% AP75, surpassing PGExp by 3.74%, 3.21% and 4.76%. On the CWALT dataset, PAPNet achieves 72.51% AP, 85.02% AP50 and 80.47% AP75, surpassing VRSPNet by 5.38%, 0.07% and 5.35%. The code is available at https://github.com/jiaoZ7688/PAP-Net.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Improving abdominal image segmentation with overcomplete shape priors
    Sadikine, Amine
    Badic, Bogdan
    Tasu, Jean-Pierre
    Noblet, Vincent
    Ballet, Pascal
    Visvikis, Dimitris
    Conze, Pierre-Henri
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 113
  • [42] Learning With Explicit Shape Priors for Medical Image Segmentation
    You, Xin
    He, Junjun
    Yang, Jie
    Gu, Yun
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (02) : 927 - 940
  • [43] Level Set Segmentation with Both Shape and Intensity Priors
    Chen, Siqi
    Radke, Richard J.
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 763 - 770
  • [44] Hedgehog Shape Priors for Multi-object Segmentation
    Isack, Hossam
    Veksler, Olga
    Sonka, Milan
    Boykov, Yuri
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2434 - 2442
  • [45] Building and enforcing shape priors for segmentation of alloy micrographs
    Huffman, Landis M.
    Simmons, Jeff P.
    De Graef, Marc
    Pollak, Ilya
    COMPUTATIONAL IMAGING XI, 2013, 8657
  • [46] Interactive graph cut based segmentation with shape priors
    Freedman, D
    Zhang, T
    2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 755 - 762
  • [47] Nonparametric Joint Shape and Feature Priors for Image Segmentation
    Erdil, Ertunc
    Ghani, Muhammad Usman
    Rada, Lavdie
    Argunsah, Ali Ozgur
    Unay, Devrim
    Tasdizen, Tolga
    Cetin, Mujdat
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) : 5312 - 5323
  • [48] Nonlinear dynamical shape priors for level set segmentation
    Cremers, Daniel
    JOURNAL OF SCIENTIFIC COMPUTING, 2008, 35 (2-3) : 132 - 143
  • [49] Optimal Multiple Surface Segmentation With Shape and Context Priors
    Song, Qi
    Bai, Junjie
    Garvin, Mona K.
    Sonka, Milan
    Buatti, John M.
    Wu, Xiaodong
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2013, 32 (02) : 376 - 386
  • [50] Automatic segmentation of overlapping fish using shape priors
    Clausen, Sigmund
    Greiner, Katharina
    Andersen, Odd
    Lie, Knut-Andreas
    Schulerud, Helene
    Kavlil, Tom
    IMAGE ANALYSIS, PROCEEDINGS, 2007, 4522 : 11 - +