Weakly Supervised Actor-Action Segmentation via Robust Multi-Task Ranking

被引:17
|
作者
Yan, Yan [1 ]
Xu, Chenliang [2 ]
Cai, Dawen [3 ]
Corso, Jason J. [1 ]
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[2] Univ Rochester, Dept Comp Sci, Rochester, NY 14627 USA
[3] Univ Michigan, Dept Cell & Dev Biol, Biophys, Ann Arbor, MI 48109 USA
关键词
D O I
10.1109/CVPR.2017.115
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained activity understanding in videos has attracted considerable recent attention with a shift from action classification to detailed actor and action understanding that provides compelling results for perceptual needs of cutting-edge autonomous systems. However, current methods for detailed understanding of actor and action have significant limitations: they require large amounts of finely labeled data, and they fail to capture any internal relationship among actors and actions. To address these issues, in this paper, we propose a novel, robust multi-task ranking model for weakly-supervised actor-action segmentation where only video-level tags are given for training samples. Our model is able to share useful information among different actors and actions while learning a ranking matrix to select representative supervoxels for actors and actions respectively. Final segmentation results are generated by a conditional random field that considers various ranking scores for video parts. Extensive experimental results on the Actor-Action Dataset (A2D) demonstrate that the proposed approach outperforms the state-of-the-art weakly supervised methods and performs as well as the topperforming fully supervised method.
引用
收藏
页码:1022 / 1031
页数:10
相关论文
共 50 条
  • [31] Robust Visual Tracking via Multi-Task Sparse Learning
    Zhang, Tianzhu
    Ghanem, Bernard
    Liu, Si
    Ahuja, Narendra
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2042 - 2049
  • [32] Multi-proposal collaboration and multi-task training for weakly-supervised video moment retrieval
    Zhang, Bolin
    Yang, Chao
    Jiang, Bin
    Komamizu, Takahiro
    Ide, Ichiro
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,
  • [33] Prototypical Transformer for Weakly Supervised Action Segmentation
    Lin, Tao
    Chang, Xiaobin
    Sun, Wei
    Zheng, Weishi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 195 - 206
  • [34] Ranking Performance Measures in Multi-Task Agencies
    Christensen, Peter O.
    Sabac, Florin
    Tian, Jie
    ACCOUNTING REVIEW, 2010, 85 (05): : 1545 - 1575
  • [35] Multi-task ranking SVM for image cosegmentation
    Liang, Xianpeng
    Zhu, Lin
    Huang, De-Shuang
    NEUROCOMPUTING, 2017, 247 : 126 - 136
  • [36] A Multi-Task Dense Network with Self-Supervised Learning for Retinal Vessel Segmentation
    Tu, Zhonghao
    Zhou, Qian
    Zou, Hua
    Zhang, Xuedong
    ELECTRONICS, 2022, 11 (21)
  • [37] Multi-task learning for gland segmentation
    Rezazadeh, Iman
    Duygulu, Pinar
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (01) : 1 - 9
  • [38] MULTI-TASK SELF-SUPERVISED VISUAL REPRESENTATION LEARNING FOR MONOCULAR ROAD SEGMENTATION
    Cho, Jaehoon
    Kim, Youngjung
    Jung, Hyungjoo
    Oh, Changjae
    Youn, Jaesung
    Sohn, Kwanghoon
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [39] Multi-Task Learning for Subspace Segmentation
    Wang, Yu
    Wipf, David
    Ling, Qing
    Chen, Wei
    Wassell, Ian
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1209 - 1217
  • [40] Multi-task learning for gland segmentation
    Iman Rezazadeh
    Pinar Duygulu
    Signal, Image and Video Processing, 2023, 17 : 1 - 9