Multi-Class Video Co-Segmentation with a Generative Multi-Video Model

被引:47
|
作者
Chiu, Wei-Chen [1 ]
Fritz, Mario [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
D O I
10.1109/CVPR.2013.48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video data provides a rich source of information that is available to us today in large quantities e. g. from on-line resources. Tasks like segmentation benefit greatly from the analysis of spatio-temporal motion patterns in videos and recent advances in video segmentation has shown great progress in exploiting these addition cues. However, observing a single video is often not enough to predict meaningful segmentations and inference across videos becomes necessary in order to predict segmentations that are consistent with objects classes. Therefore the task of video co-segmentation is being proposed, that aims at inferring segmentation from multiple videos. But current approaches are limited to only considering binary foreground/background segmentation and multiple videos of the same object. This is a clear mismatch to the challenges that we are facing with videos from online resources or consumer videos. We propose to study multi-class video co-segmentation where the number of object classes is unknown as well as the number of instances in each frame and video. We achieve this by formulating a non-parametric bayesian model across videos sequences that is based on a new videos segmentation prior as well as a global appearance model that links segments of the same class. We present the first multi-class video co-segmentation evaluation. We show that our method is applicable to real video data from online resources and outperforms state-of-the-art video segmentation and image co-segmentation baselines.
引用
收藏
页码:321 / 328
页数:8
相关论文
共 50 条
  • [41] Supervised Nonparametric Multimodal Topic Models for Multi-class Video Classification
    Xue, Jianfei
    Eguchi, Koji
    ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2019, 7 (02): : 80 - 91
  • [42] HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy
    Borgli, Hanna
    Thambawita, Vajira
    Smedsrud, Pia H.
    Hicks, Steven
    Jha, Debesh
    Eskeland, Sigrun L.
    Randel, Kristin Ranheim
    Pogorelov, Konstantin
    Lux, Mathias
    Nguyen, Duc Tien Dang
    Johansen, Dag
    Griwodz, Carsten
    Stensland, Hakon K.
    Garcia-Ceja, Enrique
    Schmidt, Peter T.
    Hammer, Hugo L.
    Riegler, Michael A.
    Halvorsen, Pal
    de Lange, Thomas
    SCIENTIFIC DATA, 2020, 7 (01)
  • [43] Multi-video Object Synopsis Integrating Optimal View Switching
    Zhang, Zhensong
    Nie, Yongwei
    Sun, Hanqiu
    Lai, Qiuxia
    Li, Guiqing
    IGGRAPH ASIA 2017 TECHNICAL BRIEFS (SA'17), 2017,
  • [44] HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy
    Hanna Borgli
    Vajira Thambawita
    Pia H. Smedsrud
    Steven Hicks
    Debesh Jha
    Sigrun L. Eskeland
    Kristin Ranheim Randel
    Konstantin Pogorelov
    Mathias Lux
    Duc Tien Dang Nguyen
    Dag Johansen
    Carsten Griwodz
    Håkon K. Stensland
    Enrique Garcia-Ceja
    Peter T. Schmidt
    Hugo L. Hammer
    Michael A. Riegler
    Pål Halvorsen
    Thomas de Lange
    Scientific Data, 7
  • [45] Video Object Discovery and Co-Segmentation with Extremely Weak Supervision
    Wang, Le
    Hua, Gang
    Sukthankar, Rahul
    Xue, Jianru
    Niu, Zhenxing
    Zheng, Nanning
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (10) : 2074 - 2088
  • [46] Video Object Co-segmentation by Regulated Maximum Weight Cliques
    Zhang, Dong
    Javed, Omar
    Shah, Mubarak
    COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 : 551 - 566
  • [47] MvsGCN: A Novel Graph Convolutional Network for Multi-video Summarization
    Wu, Jiaxin
    Zhong, Sheng-hua
    Liu, Yan
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 827 - 835
  • [48] MULTI-CLASS SEMANTIC SEGMENTATION OF FACES
    Khan, Khalil
    Mauro, Massimo
    Leonardi, Riccardo
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 827 - 831
  • [49] Pricing model for Internet-based multi-class Video-On-Demand (VOD) services
    Kim, Whan-Seon
    TELECOMMUNICATION SYSTEMS, 2006, 33 (04) : 317 - 331
  • [50] Pricing model for internet-based multi-class Video-On-Demand(VOD) services
    Whan-Seon Kim
    Telecommunication Systems, 2006, 33 : 317 - 331