Dense Unsupervised Learning for Video Segmentation

被引:0
|
作者
Araslanov, Nikita [1 ]
Schaub-Meyer, Simone [1 ]
Roth, Stefan [1 ,2 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
[2] hessian AI, Darmstadt, Germany
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel approach to unsupervised learning for video object segmentation (VOS). Unlike previous work, our formulation allows to learn dense feature representations directly in a fully convolutional regime. We rely on uniform grid sampling to extract a set of anchors and train our model to disambiguate between them on both inter- and intra-video levels. However, a naive scheme to train such a model results in a degenerate solution. We propose to prevent this with a simple regularisation scheme, accommodating the equivariance property of the segmentation task to similarity transformations. Our training objective admits efficient implementation and exhibits fast training convergence. On established VOS benchmarks, our approach exceeds the segmentation accuracy of previous work despite using significantly less training data and compute power.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Unsupervised Network Learning for Cell Segmentation
    Han, Liang
    Yin, Zhaozheng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 282 - 292
  • [32] Metric Learning for Unsupervised Phoneme Segmentation
    Qiao, Yu
    Minematsu, Nobuaki
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1060 - 1063
  • [33] Unsupervised learning of dense hierarchical appearance representations
    Scalzo, Fabien
    Piater, Justus H.
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 395 - +
  • [34] Unsupervised video object segmentation by spatiotemporal graphical model
    Guo, Lijun
    Cheng, Tingting
    Huang, Yuanjie
    Zhao, Jieyu
    Zhang, Rong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (01) : 1037 - 1053
  • [35] Warp Consistency for Unsupervised Learning of Dense Correspondences
    Truong, Prune
    Danelljan, Martin
    Yu, Fisher
    Van Gool, Luc
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10326 - 10336
  • [36] Guided Slot Attention for Unsupervised Video Object Segmentation
    Lee, Minhyeok
    Cho, Suhwan
    Lee, Dogyoon
    Park, Chaewon
    Lee, Jungho
    Lee, Sangyoun
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 3807 - 3816
  • [37] Unsupervised video object segmentation by spatiotemporal graphical model
    Lijun Guo
    Tingting Cheng
    Yuanjie Huang
    Jieyu Zhao
    Rong Zhang
    Multimedia Tools and Applications, 2017, 76 : 1037 - 1053
  • [38] Unsupervised segmentation of defocused video based on matting model
    Li, Hongliang
    Ngan, King Ngi
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 1825 - +
  • [39] Asymmetric Attention Fusion for Unsupervised Video Object Segmentation
    Jiang, Hongfan
    Wu, Xiaojun
    Xu, Tianyang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 170 - 182
  • [40] Dual Prototype Attention for Unsupervised Video Object Segmentation
    Cho, Suhwan
    Lee, Minhyeok
    Lee, Seunghoon
    Lee, Dogyoon
    Choi, Heeseung
    Kim, Ig-Jae
    Lee, Sangyoun
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 19238 - 19247